Reading data from the HDFS and saving the data locally - 7.0

HDFS

author
Talend Documentation Team
EnrichVersion
7.0
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > File components (Integration) > HDFS components
Data Quality and Preparation > Third-party systems > File components (Integration) > HDFS components
Design and Development > Third-party systems > File components (Integration) > HDFS components
EnrichPlatform
Talend Studio

Procedure

  1. Double-click tFileInputDelimited to define the component in its Basic settings view.
  2. Set property type to Built-In.
  3. Next to the File Name/Stream field, click the three-dot button to browse to the file you have obtained from the HDFS. In this scenario, the directory is C:/hadoopfiles/getFile/in.txt.
  4. Set Schema to Built-In and click Edit schema to define the data to pass on to the tLogRow component.
  5. Click the plus button to add a new column.
  6. Click OK to close the dialog box and accept to propagate the changes when prompted by the studio.