Applying the matching model on the data set - Cloud - 8.0

Data matching with Talend tools

Version
Cloud
8.0
Language
English
Product
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Data Quality components > Matching components > Continuous matching components
Data Governance > Third-party systems > Data Quality components > Matching components > Data matching components
Data Governance > Third-party systems > Data Quality components > Matching components > Fuzzy matching components
Data Governance > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Continuous matching components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Data matching components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Fuzzy matching components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
Design and Development > Third-party systems > Data Quality components > Matching components > Continuous matching components
Design and Development > Third-party systems > Data Quality components > Matching components > Data matching components
Design and Development > Third-party systems > Data Quality components > Matching components > Fuzzy matching components
Design and Development > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
Last publication date
2024-02-06

Procedure

  1. Double-click tMatchPredict to display the Basic settings view and define the component properties.
  2. Click Sync columns to retrieve the schema defined in the input component.
  3. From the Input type list, select paired as the input data is already paired with tMatchPairing.
  4. From the Matching model location list, select from file system and then set the path to the matching model in the folder field.
  5. In the Clustering classes table, add one or more of the labels you used on the sample suspects generated by tMatchPairing, YES in this example.

    The labels were set manually or through Talend Data Stewardship. If you labeled the sample of suspect records using Talend Data Stewardship, add the answer(s) defined in the Grouping campaign to the table.

    The tMatchPredict component will group suspect records which match the YES label.