Configuring the components - Cloud - 8.0

Data matching with Talend tools

Version
Cloud
8.0
Language
English
Product
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Data Quality components > Matching components > Continuous matching components
Data Governance > Third-party systems > Data Quality components > Matching components > Data matching components
Data Governance > Third-party systems > Data Quality components > Matching components > Fuzzy matching components
Data Governance > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Continuous matching components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Data matching components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Fuzzy matching components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
Design and Development > Third-party systems > Data Quality components > Matching components > Continuous matching components
Design and Development > Third-party systems > Data Quality components > Matching components > Data matching components
Design and Development > Third-party systems > Data Quality components > Matching components > Fuzzy matching components
Design and Development > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
Last publication date
2024-02-06

Procedure

  1. Double-click the tFileInputDelimited component to display its Basic settings view.
  2. Click the [...] button next to the File Name/Stream field to browse to your input file.
  3. Define the header and footer rows.
    In this example, the first row of the input file is the header row.
  4. Click Edit schema to define the schema for this component.
    1. Click the [+] button to add columns. In this example, the input file has five columns: Id, FirstName, LastName, Age, and City.
    2. Click OK.
  5. Double-click the tSortRow component to display its Basic settings view.
  6. To rearrange the entries in the alphabetic order of the names, add two rows in the Criteria table by clicking the [+] button:
    1. Select the FirstName and LastName columns under Schema column.
    2. Select alpha as the sorting type.
    3. Select asc the sorting order.
  7. Double-click the tUniqRow component to display its Basic settings view.
  8. In the Unique key area, select the columns on which you want deduplication to be carried out.
    In this example, you will sort out duplicated names.
  9. In the Basic settings view of the tLogRow components, select the Table option to view the Job execution result in table mode.