Setting up the Job - 7.0

HDFS

author
Talend Documentation Team
EnrichVersion
7.0
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > File components (Integration) > HDFS components
Data Quality and Preparation > Third-party systems > File components (Integration) > HDFS components
Design and Development > Third-party systems > File components (Integration) > HDFS components
EnrichPlatform
Talend Studio

Procedure

  1. Drop the following components from the Palette onto the design workspace: tFixedFlowInput, tFileOutputDelimited, tHDFSPut, tHDFSGet, tFileInputDelimited and tLogRow.
  2. Connect tFixedFlowInput to tFileOutputDelimited using a Row > Main connection.
  3. Connect tFileInputDelimited to tLogRow using a Row > Main connection.
  4. Connect tFixedFlowInput to tHDFSPut using an OnSubjobOk connection.
  5. Connect tHDFSPut to tHDFSGet using an OnSubjobOk connection.
  6. Connect tHDFSGet to tFileInputDelimitedusing an OnSubjobOk connection.