Like what you see? Have a play with our trial version.

Error rendering macro 'rw-search'

null

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. Build a model in PMML and save it in .xml format. (This file will be referred to as the PMML file in this guide.)
  2. Navigate to Yellowfin’s Data Transformation model. (Create button > Transformation Flow)
  3. Create a transformation flow beginning with an input step to extract data. (Click here if you want to learn how to create a basic flow, or here to learn about the different ways data can be extracted).



  4. The extracted data will appear in the data preview panel. You can add more steps to further transform the data.
  5. Once you are ready to use the PMML model, follow the procedure below.
  6. Drag the PMML model prediction step from the transformation step list onto the canvas. (Note: If you don’t see this step, ensure that you have installed the PMML plug-in.)



  7. Using a connection point, create a connection from the last step to the PMML transformation step.



  8. Next, you need to configure the PMML step. (Make sure it is selected to bring up its configuration screen.)



  9. Upload your PMML file. There are two ways of doing this.
    1. Drag the file into the specified panel, or browse and select it. (This is the preferred method if the file is located on your local machine.)
    2. Or enable the Load from Path toggle and load the file by providing its path or URL. Then click on the Load File button.



    3. Provide the full URL to access the PMML file, for example: http://dmg.org/pmml/pmml_examples/knime_pmml_examples/ElNinoLinearReg.xml
      (Note: To be able to load a PMML file using a URL, you would need an internet connection, otherwise the file will not load.)

      Example of a full file path on a Windows machine is given below:
      C:\Users\admin\Desktop\audit_cluster.xml

      Example of a file path on a Linux machine is given below:
      /root/audit_cluster.xml

      Note: If the file cannot be parsed, or if the path provided is invalid, then the following error message will appear.



      Note 2: When using the drag and drop method to upload a file, the file name might disappear when you first upload it, but will reappear once an action on the step is performed.
  10. Once the file is successfully uploaded, detailed configuration options will appear to map the input fields and specify the output. The input and output fields that appear will be dependent on the model.



  11. First, match the input columns required by your model with the appropriate columns in the extracted file. For example, our model requires input in the form of income, age and gender.
    Note: It is essential to make the correct mapping here in order to produce the correct result. The data fields may not have the same names as they do in the model. You should ensure that the selected fields match the datatype that the model is expecting. (If an incorrect mapping is made, the Errors tab will generate errors for each of the data values.)



    Note 2: All the input fields are mandatory for mapping, the step will not run unless all these are properly matched.
  12. Next, select which model output result you would like to generate. The configure panel will display all the possible outputs that your model is designed to produce. You can choose one or more outputs (however at least one output field must be selected).
    Note: The “Default Target” output field contains the default output of the model. If the PMML file does not explicitly specify an output field, then only the “Default Target” option will be available.



  13. Once the input and output fields have been properly configured, click Save. (Note: If you click anywhere outside of the configure panel after configuring the model and before clicking Save, you will lose the configuration details.)
  14. On doing so, columns(s) with the model’s result will appear in the data preview panel. (Note: Since we selected two output fields, therefore our example shows two new result columns.)

    Note

    The datatype of these output columns will depend on what has been specified in the model. However, if not specified, then the system will use Text as the default type.





  15. You can perform further transformations or save your result in a database.

...