Configuring the merging process - Cloud - 8.0

Data matching with Talend tools

Version
Cloud
8.0
Language
English
Product
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Data Quality components > Matching components > Continuous matching components
Data Governance > Third-party systems > Data Quality components > Matching components > Data matching components
Data Governance > Third-party systems > Data Quality components > Matching components > Fuzzy matching components
Data Governance > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Continuous matching components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Data matching components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Fuzzy matching components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
Design and Development > Third-party systems > Data Quality components > Matching components > Continuous matching components
Design and Development > Third-party systems > Data Quality components > Matching components > Data matching components
Design and Development > Third-party systems > Data Quality components > Matching components > Fuzzy matching components
Design and Development > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
Last publication date
2024-02-06

Procedure

  1. Double-click tUnite to open its Basic settings view.
  2. Click [...] next to Edit schema to check that the output schema corresponds to the schema from the input tFileInputDelimited components.
  3. Double-click the first tFileOutputDelimited component to display the Basic settings view and define the component properties.
    You have already accepted to propagate the schema to the output components when you defined the input component.
  4. Clear the Define a storage configuration component check box to use the local system as your target file system.
  5. In the Folder field, set the path to the folder which will hold the output data.
  6. From the Action list, select the operation for writing data:
    • Select Create when you run the Job for the first time.

    • Select Overwrite to replace the file every time you run the Job.

  7. Set the row and field separators in the corresponding fields.
  8. Select the Merge results to single file check box, and in the Merge file path field set the path where to output the file of the clean and deduplicated data set.