Skip to content

Dataset connectors

H2O Hydrogen Torch provides a number of data connectors to access external data sources. The following data connection types are available:

  • Local: Standard upload feature

    The following parameter is required:

    • File location
  • S3: Amazon S3

    The following parameters are required:

    • S3 Bucket Name
    • S3 File Name
    • AWS Access Key
    • AWS Secret Key
  • Kaggle: Kaggle datasets

    The following parameters are required:

    • Kaggle API Command
    • Kaggle Username
    • Kaggle Secret Key

Note

  • Each data connector requires either a single .csv file or the data to be in a .zip file for a successful import.

  • The format of a dataset differs for different problem types. For more information, see Dataset Formats.

  • Before a successful dataset import, you will have to specify a set of dataset settings before the dataset can be used for a given experiment. The required dataset settings will differ upon the structure and content of the dataset. For more information, see Import Dataset Settings.

  • For the S3 and Kaggle connector, you can save your AWS and Kaggle credentials in your H2O Hydrogen Torch credentials to avoid the reenter of often used credentials. For more information, see global settings.

Back to top