All Downloads are FREE. Search and download functionalities are using the official Maven repository.

www.flow.help.2_2.html Maven / Gradle / Ivy

The newest version!

After you have imported your data, parse the data.

Flow - Parse options

The read-only Sources field displays the file path for the imported data selected for parsing.

The ID contains the auto-generated name for the parsed data (by default, the file name of the imported file with .hex as the file extension). Use the default name or enter a custom name in this field.

Select the parser type (if necessary) from the drop-down Parser list. For most data parsing, H2O automatically recognizes the data type, so the default settings typically do not need to be changed. The following options are available:

  • Auto

  • ARFF

  • XLS

  • XLSX

  • CSV

  • SVMLight

    Note: For SVMLight data, the column indices must be >= 1 and the columns must be in ascending order.

If a separator or delimiter is used, select it from the Separator list.

Select a column header option, if applicable:

  • Auto: Automatically detect header types.
  • First row contains column names: Specify heading as column names.
  • First row contains data: Specify heading as data. This option is selected by default.

Select any necessary additional options:

  • Enable single quotes as a field quotation character: Treat single quote marks (also known as apostrophes) in the data as a character, rather than an enum. This option is not selected by default.
  • Delete on done: Check this checkbox to delete the imported data after parsing. This option is selected by default.

A preview of the data displays in the “Edit Column Names and Types” section.

To change or add a column name, edit or enter the text in the column’s entry field. In the screenshot below, the entry field for column 16 is highlighted in red.

Flow - Column Name Entry Field

To change the column type, select the drop-down list to the right of the column name entry field and select the data type. The options are:

  • Unknown
  • Numeric
  • Enum
  • Time
  • UUID
  • String
  • Invalid

You can search for a column by entering it in the Search by column name… entry field above the first column name entry field. As you type, H2O displays the columns that match the specified search terms.

Note: Only custom column names are searchable. Default column names cannot be searched.

To navigate the data preview, click the <- Previous page or -> Next page buttons.

Flow - Pagination buttons

After making your selections, click the Parse button.

After you click the Parse button, the code for the current job displays.

Flow - Parse code

Since we’ve submitted a couple of jobs (data import & parse) to H2O now, let’s take a moment to learn more about jobs in H2O.






© 2015 - 2024 Weber Informatics LLC | Privacy Policy