Executing the Job to compute suspect pairs and suspect sample - 7.0

Matching with machine learning

author
Talend Documentation Team
EnrichVersion
7.0
EnrichProdName
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
Design and Development > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
EnrichPlatform
Talend Data Stewardship
Talend Studio

Procedure

Press F6 to execute the Job.

Results

tMatchPairing computes the pairs of suspect records and the pairs sample, based on the blocking key definition, and writes the results to the output files.

tMatchPairing excludes unique rows and writes them in the output file:

tMatchPairing excludes exact duplicates and writes them in the Run view:

The component has added an extra read-only column, LABEL, for the Pairs sample link.

What to do next

You can use the LABEL column to label suspect records manually before using them with the tMatchModel component.

For an example of how to generate a matching model using tMatchModel, see Generating a matching model.