Configuring the connection to HDFS - 7.0

HDFS

author
Talend Documentation Team
EnrichVersion
7.0
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > File components (Integration) > HDFS components
Data Quality and Preparation > Third-party systems > File components (Integration) > HDFS components
Design and Development > Third-party systems > File components (Integration) > HDFS components
EnrichPlatform
Talend Studio

Procedure

  1. Double-click tHDFSExist to open its Component view.
  2. In the Version area, select the Hadoop distribution you are connecting to and its version.
  3. In the Connection area, enter the values of the parameters required to connect to the HDFS.
    In the real-world practice, you may use tHDFSConnection to create a connection and reuse it from the current component. For further information, see tHDFSConnection.
  4. In the HDFS Directory field, browse to, or enter the path to the folder where the file to be checked is. In this example, browse to /user/ychen/data/hdfs/out/dest.
  5. In the File name or relative path field, enter the name of the file you want to check the existence. For example, output.csv.