Configuring the iteration - 7.2

HDFS

Version
7.2
Language
English (United States)
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > File components (Integration) > HDFS components
Data Quality and Preparation > Third-party systems > File components (Integration) > HDFS components
Design and Development > Third-party systems > File components (Integration) > HDFS components

Procedure

  1. Double-click tHDFSList to open its Component view.
  2. In the Version area, select the Hadoop distribution you are connecting to and its version.
  3. In the Connection area, enter the values of the parameters required to connect to the HDFS.
    In the real-world practice, you may use tHDFSConnection to create a connection and reuse it from the current component. For further information, see tHDFSConnection.
  4. In the HDFS Directory field, enter the path to the folder where the files to be iterated on are. In this example, as presented earlier, the directory is /user/ychen/data/hdfs/out/.
  5. In the FileList Type field, select File.
  6. In the Files table, click to add one row and enter * between the quotation marks to iterate on any files existing.