Skip to content

Overview: Import dataset settings

Before a successful dataset import, you need to define a few dataset settings before the dataset can be imported and used for an experiment. The required dataset settings will depend upon the structure and content of the dataset; in other words, it will depend on the problem type the dataset aims to solve. To learn about the particular required import dataset settings for a supported problem type, see import dataset settings.

Note

  • After importing a dataset through one of the supported data connectors for a problem type, H2O Hydrogen Torch will automatically define the dataset settings of the problem type by exploring the content of the imported dataset. Though, before saving the defined settings, you have the option to modify an erroneous value or option given to a dataset.

  • To learn how to edit the settings of a saved (imported) dataset, see Edit imported dataset.

Import dataset

To import a dataset to H2O Hydrogen Torch, consider the following instructions:

  1. In the H2O Hydrogen Torch navigation menu, click Import dataset.
  2. In the Source list, select the source that you want to use (e.g., S3).

    Consider one of the following instruction sections depending on the source you selected.

    1. In the S3 bucket name box, enter the name of the S3 bucket name.
    2. In the S3 file name list, select the S3 file name.
    3. In the AWS access key box, enter the AWS access key.
    4. In the AWS secret key box, enter the AWS secret key.
    1. In the Kaggle API command box, enter a Kaggle API command.
    2. In the Kaggle username box, enter your username.
    3. In the Kaggle secret key box, enter your kaggle secret key.
    1. Click Browse.
      • Or drag and drop the file (dataset)
    2. Click Upload.
  3. Define the import dataset settings according to the dataset's problem type.

    Note

    To learn about the import dataset settings for a particular problem type, see Import dataset settings.

  4. Click Continue.

Note

  • The dataset must be in a .zip or .csv file format.

  • After importing a preprocessed dataset, you will be able to use it for an experiment.

  • To learn how to preprocess your dataset for a particular supported problem type, see Dataset formats.

Edit imported dataset

To edit the import dataset settings of a dataset, consider the following instructions:

  1. In the H2O Hydrogen Torch navigation menu, click View datasets.
  2. In the View datasets card, click Edit dataset.
  3. Select the dataset you want to edit.
  4. Click Edit dataset.
  5. Make your desired changes.
  6. Click Continue.


Back to top