Analyzing duplicates - 6.5

Talend Open Studio for MDM User Guide

EnrichVersion
6.5
EnrichProdName
Talend Open Studio for MDM
task
Data Governance
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

You can use the match analysis in the Profiling perspective of the studio to compare columns in databases or delimited files and create groups of similar records using the VSR or the T-Swoosh algorithm.

This analysis provides you with a simple way to create match rules, test them on a set of columns and see the results directly in the editor..

You can also use the Profiling perspective to define match rules in a match rule editor and save them in the studio repository. For further information, see Creating a match rule.