Skip to main content Skip to complementary content

Configuring the iteration

Procedure

  1. Double-click tHDFSList to open its Component view.
  2. In the Version area, select the Hadoop distribution you are connecting to and its version.
  3. In the Connection area, enter the values of the parameters required to connect to the HDFS.
    In the real-world practice, you may use tHDFSConnection to create a connection and reuse it from the current component. For further information, see tHDFSConnection.
  4. In the HDFS Directory field, enter the path to the folder where the files to be iterated on are. In this example, as presented earlier, the directory is /user/ychen/data/hdfs/out/.
  5. In the FileList Type field, select File.
  6. In the Files table, click to add one row and enter * between the quotation marks to iterate on any files existing.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!