Big Data Platform
Cloud API Services Platform
Cloud Big Data Platform
Cloud Data Fabric
Cloud Data Management Platform
Data Management Platform
Data Services Platform
Real-Time Big Data Platform
The match analysis enables you to compare a set of columns in databases or in delimited files, and create groups of similar records using blocking and matching keys and survivorship rules.
At least one database or file connection is defined under the Metadata node.
Before you begin
This analysis enables you to create match rules and test them on data to assess the number of duplicates. You can test match rules only on columns in the same table.
About this task
Creating the connection to a data source from inside the editor if no
connection has been defined under the Metadata
folder in the Studio tree view.
For further information, see Configuring the match analysis.
Defining the table or the group of columns you want to search for similar
records using match processes.
For further information, see Defining a match analysis from the Analysis folder or Defining a match analysis from the Metadata folder.
Defining blocking keys to reduce the number of pairs that need to be
For further information, see Defining a match rule.
- Defining match keys, the match methods according to which similar records are grouped together. For further information, see Defining a match rule.
Exporting the match rules from the match analysis editor and centralize them
in the Studio repository.
For further information, see Importing or exporting match rules.
- Generating reports on the match analyses and save them in a distant database. These reports let you compare current and historical statistics to determine the evolution of data. For more information, see What are reports?.