Accessing training data - 7.3
Machine Learning
- Version
- 7.3
- Language
- English
- Product
- Talend Big Data
- Talend Big Data Platform
- Talend Data Fabric
- Talend Real-Time Big Data Platform
- Module
- Talend Studio
- Content
- Data Governance > Third-party systems > Machine Learning components
- Data Quality and Preparation > Third-party systems > Machine Learning components
- Design and Development > Third-party systems > Machine Learning components
Procedure
-
Add a tFileDelimitedInput component to the palette.
-
Set the Property Type to Repository, then choose HDFS:MarketingCampaignData.
-
Click the ellipsis to the right of Folder/File and navigate to the training dataset in HDFS, in this case it is located at /user/puccini/machinelearning/marketing/marketing_campaign_train.csv.
-
Click OK.
-
For Schema, choose Repository and select the schema you created earlier.