The match analysis enables you to compare a set of columns in databases or in
delimited files and create groups of similar records using blocking and matching
keys and/or survivorship rules.
About this task
This analysis enables you to create match rules and test them on data to assess the number of duplicates before using the match rules in the tMatchGroup component, for example. Currently, you can test match rules only on columns in the same table.
Prerequisite(s): You have selected the Profiling perspective of Talend Studio. At least one database or file connection is defined under the Metadata node.
The sequence of setting up a match analysis involves the following steps: