Reading data from the HDFS and saving the data locally - 7.3

HDFS

Version
7.3
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > File components (Integration) > HDFS components
Data Quality and Preparation > Third-party systems > File components (Integration) > HDFS components
Design and Development > Third-party systems > File components (Integration) > HDFS components
Last publication date
2024-02-21

Procedure

  1. Double-click tFileInputDelimited to define the component in its Basic settings view.
  2. Set property type to Built-In.
  3. Next to the File Name/Stream field, click the [...] button to browse to the file you have obtained from the HDFS. In this scenario, the directory is C:/hadoopfiles/getFile/in.txt.
  4. Set Schema to Built-In and click Edit schema to define the data to pass on to the tLogRow component.
  5. Click the plus button to add a new column.
  6. Click OK to close the dialog box and accept to propagate the changes when prompted by the studio.