Reading the input files from your local folder - 7.1

Data Preparation

author
Talend Documentation Team
EnrichVersion
Cloud
7.1
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Data Preparation components
Data Quality and Preparation > Third-party systems > Data Preparation components
Design and Development > Third-party systems > Data Preparation components
EnrichPlatform
Talend Data Preparation
Talend Studio

Procedure

  1. In the design workspace, select tFileList_1 and click the Component tab to define its basic settings.

    This first tFileList will read the customers_files folder, and retrieve the path of the two sub folders so that they can be reused later.

  2. In the Directory field, enter the path to the customers_files folder, containing the customers datasets, in their respective sub folders.
  3. Click the + button in the Filemask table to add a new line and rename it *, between double quotes.
  4. In the design workspace, select tFileList_2 and click the Component tab to define its basic settings.

    This second tFileList will read the four .csv datasets contained in the two sub folders and retrieve their file paths.

  5. To fill the Directory field with the expression that will dynamically retrieve the input files paths, drag it from the tFileList_1 list of expressions in the Outline panel.
  6. Check the Includes subdirectories check box.
  7. Click the + button in the Filemask table to add a new line and rename it *.csv, between double quotes.
  8. In the design workspace, select the tFileInputDelimited and click the Component tab to define its basic settings.
  9. To fill the File name/Stream field with the expression that will dynamically retrieve the input files paths, drag it from the tFileList_2 list of expressions in the Outline panel.
  10. Enter the Row Separator and Field Separator that correspond to your datasets, between double quotes.
  11. Click the Edit schema button to define the columns of the source datasets and their data type.

    The schema is the same for all the datasets from the customers_files folder. Make sure that this schema matches the schema expected by the tDataprepRun component. In other words, the input schema must be the same as the datasets upon which the preparations were made in the first place.