Page History

Versions Compared

Old Version 2

changes.mady.by.user user-3484a

Saved on 30 Nov, 2017

compared with

New Version 3

changes.mady.by.user user-3484a

Saved on 30 Nov, 2017

Key

This line was added.
This line was removed.
Formatting was changed.

...

In order to configure the H2O step, you will need to connect to an instance of H2O by providing a valid URL, then choose a data science model. Next, the input fields of the model will need to be set up. This is done by mapping them with the data from the transformation flow. Unlike configuration with other outputs of data science models (such as PMML), there is no need to configure an output field. The generated result is defined during the creation of the model.

Model Output

Supported Model Categories

The types of models that Yellowfin supports can be generalized into four categories. Note: To check the category of a model, refer to the model’s Output section in H2O.

Image Added

Following describes the type of output each of these categories generate:

Regression: Models belonging to this category will generate actual predicted value for every row of data.
Binomial: These types of models will output the text label of the class that was predicted for every row.
Multinomial: (Same as above.)
Clustering: Such models will result in the index number of the cluster to which every row belongs.

Checking your model’s outputYour Model’s Output

In most cases, the user would know the output of the model. But you can still determine the output by selecting the model from your instance of H2O and checking the output settings. For example, for a binary model, the output can be checked in the model’s parameters.

Image Added

Datatype of your model:Your Model Output

The datatype of the output column will also depend on what has been configured in the model. It will be NUMERIC for models belonging to the “Clustering” and “Regression” categories. For other cases it will be TEXT.

Guideline: Using H2O.ai in Yellowfin

...

H2O is a modern open source AI platform that allows users to work with predictive models. You can download the latest version of H2O from here: http://h2o-release.s3.amazonaws.com/h2o/rel-weierstrass/7/index.html

You can use H2O either locally by starting it and using it on your machine, or by using a publicly available space accessible through a URL.

The following procedure shows how to run H2O locally:

Download H2O.ai
Unzip the file into a directory.
Open a terminal (Apple terminal or MSDOS) and go to the extracted folder.
Run the jar via “java –jar h2o.jar” – this will start the H2O.ai server.
By default, H2O.ai server will run at http://localhost:54321/ (once set up correctly). Note: You can customize the URL and other settings of your H2O instance.

...

H2O URL: To establish a connection to an instance of H2O, you would need to provide the instance’s URL. This can be done by either giving the default one if set up locally (e.g. http://localhost:54321/) or including the IP address (such as, http://127.0.0.1:54321/) for both, a local setup or remote access (ensure that you have a stable internet connection if trying to access it remotely). Note: You need to include http:// as part of the URL for it to be recognized correctly. The transformation step will not function if an incorrect URL is provided..