Configuration wizard - 7.0

Data matching

author
Talend Documentation Team
EnrichVersion
7.0
EnrichProdName
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Data Quality components > Matching components > Data matching components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Data matching components
Design and Development > Third-party systems > Data Quality components > Matching components > Data matching components
EnrichPlatform
Talend Studio

The configuration wizard enables you to create different production environments, Configurations, and their match rules.

You can also use the configuration wizard to import match rules created and tested in the studio and use them in your match Jobs. For further information, see Importing match rules from the studio repository.

You can not open the configuration wizard unless you link the input component to the tMatchGroup component.

To open the configuration wizard:

Procedure

  1. In the studio workspace, design your job and link the components together, for example as below:
  2. Double-click tMatchGroup; or right-click it and from the contextual menu select Configuration Wizard; or click Preview in the basic settings view of tMatchGroup.
  3. In the popup that opens, click Skip Computation if you want to open the Configuration Wizard without running the match rules defined in it.

Results

The configuration wizard is composed of three areas:
  • the Configuration view, where you can set the match rules and the blocking column(s).

  • the matching chart, which presents the graphic matching result,

  • the matching table, which presents the details of the matching result.

The Limit field at the upper-left corner indicates the maximum number of rows to be processed by the match rule(s) in the wizard. The by-default maximum row number is 1000.