Applying the matching model on the data set - 7.0

Matching with machine learning

author
Talend Documentation Team
EnrichVersion
7.0
EnrichProdName
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
Design and Development > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
EnrichPlatform
Talend Data Stewardship
Talend Studio

Procedure

  1. Double-click tMatchPredict to display the Basic settings view and define the component properties.
  2. Click Sync columns to retrieve the schema defined in the input component.
  3. From the Input type list, select paired as the input data is already paired with tMatchPairing.
  4. From the Matching model location list, select from file system and then set the path to the matching model in the folder field.
  5. In the Clustering classes table, add one or more of the labels you used on the sample suspects generated by tMatchPairing, YES in this example.

    The labels were set manually or through Talend Data Stewardship. If you labeled the sample of suspect records using Talend Data Stewardship, add the answer(s) defined in the Grouping campaign to the table.

    The tMatchPredict component will group suspect records which match the YES label.