Configuring the components - 6.5

tFileList

author
Talend Documentation Team
EnrichVersion
6.5
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for ESB
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > File components (Integration) > File management components > tFileList
Data Quality and Preparation > Third-party systems > File components (Integration) > File management components > tFileList
Design and Development > Third-party systems > File components (Integration) > File management components > tFileList
EnrichPlatform
Talend Studio

Procedure

  1. In the Basic settings view of the first tFileList component, fill the Directory field with the path to the first folder you want to read filenames from, E:/DataFiles/DI/images in this example, and leave the other settings as they are.
  2. Double-click the first tIterateToFlow component to show its Basic settings view.
  3. Double-click the [...] button next to Edit schema to open the [Schema] dialog box and define the schema of the text file the next component will write filenames to. When done, click OK to close the dialog box and propagate the schema to the next component.
    In this example, the schema contains only one column: Filename.
  4. In Value field of the Mapping table, press Ctrl+Space to access the autocomplete list of variables, and select the global variable ((String)globalMap.get("tFileList_1_CURRENT_FILE")) to read the name of each file in the input directory, which will be put into a data flow to pass to the next component.
  5. In the Basic settings view of the first tFileOutputDelimited component, fill the File Name field with the path of the text file that will store the filenames from the incoming flow, D:/temp/tempdata.csv in this example. This completes the configuration of the first subjob.
  6. Repeat the steps above to complete the configuration of the second subjob, but:
    • fill the Directory field in the Basic settings view of the second tFileList component with the other folder you want to read filenames from, E:/DataFiles/DQ/images in this example.

    • select the Append check box in the Basic settings view of the second tFileOutputDelimited component so that the filenames previously written to the text file will not be overwritten.

  7. In the Basic settings view of the tFileInputDelimited component, fill the File name/Stream field with the path of the text file that stores the list of filenames, D:/temp/tempdata.csv in this example, and define the file schema, which contains only one column in this example, Filename.
  8. In the Basic settings view of the tUniqRow component, select the Key attribute check box for the only column, Filename in this example.
  9. In the Basic settings view of the tLogRow component, select the Table (print values in cells of a table) option for better display effect.