Setting up the Job

HDFS

author
Talend Documentation Team
EnrichVersion
6.5
EnrichProdName
Talend Data Fabric
Talend Big Data Platform
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
Talend Big Data
task
Data Quality and Preparation > Third-party systems > File components (Integration) > HDFS components
Data Governance > Third-party systems > File components (Integration) > HDFS components
Design and Development > Third-party systems > File components (Integration) > HDFS components
EnrichPlatform
Talend Studio

Procedure

  1. Drop the following components from the Palette onto the design workspace: tFixedFlowInput, tFileOutputDelimited, tHDFSPut, tHDFSGet, tFileInputDelimited and tLogRow.
  2. Connect tFixedFlowInput to tFileOutputDelimited using a Row > Main connection.
  3. Connect tFileInputDelimited to tLogRow using a Row > Main connection.
  4. Connect tFixedFlowInput to tHDFSPut using an OnSubjobOk connection.
  5. Connect tHDFSPut to tHDFSGet using an OnSubjobOk connection.
  6. Connect tHDFSGet to tFileInputDelimitedusing an OnSubjobOk connection.