Setting up the Job

HDFS

author
Talend Documentation Team
EnrichVersion
6.5
EnrichProdName
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
Talend Big Data Platform
Talend Big Data
Talend Data Fabric
task
Data Governance > Third-party systems > File components (Integration) > HDFS components
Design and Development > Third-party systems > File components (Integration) > HDFS components
Data Quality and Preparation > Third-party systems > File components (Integration) > HDFS components
EnrichPlatform
Talend Studio
  1. Drop the following components from the Palette onto the design workspace: tFixedFlowInput, tFileOutputDelimited, tHDFSPut, tHDFSGet, tFileInputDelimited and tLogRow.
  2. Connect tFixedFlowInput to tFileOutputDelimited using a Row > Main connection.
  3. Connect tFileInputDelimited to tLogRow using a Row > Main connection.
  4. Connect tFixedFlowInput to tHDFSPut using an OnSubjobOk connection.
  5. Connect tHDFSPut to tHDFSGet using an OnSubjobOk connection.
  6. Connect tHDFSGet to tFileInputDelimitedusing an OnSubjobOk connection.