Writing output data in HDFS - 7.3

Machine Learning

Version
7.3
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Machine Learning components
Data Quality and Preparation > Third-party systems > Machine Learning components
Design and Development > Third-party systems > Machine Learning components
Last publication date
2024-02-21

Procedure

  1. Double-click the first tHDFSOutput to open its Component view.
  2. Click the [...] button next to the Folder field and browse to the folder in which you want to write the region data.
  3. From the Type list, select the data format for the records to be written. In this example, select Text file.
  4. From the Action list, select the operation you need to perform on the file in question. If the file already exists, select Overwrite, otherwise select Create.
  5. Select the Merge result to single file check box and enter the path, or browse to the file you need to write the merged output data in.
  6. If the file for the merged data exists, select the Override target file check box to overwrite that file.
  7. Double-click the second tHDFSOutput to open its Component view.
  8. Define the component settings similarly to write the data about the client channels from the second cluster to an output HDFS folder.