Creating and defining a match rule - 6.2

Talend Data Fabric Studio User Guide

EnrichVersion
6.2
EnrichProdName
Talend Data Fabric
task
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

In MDM, match rules are used to decide whether two or more data records match, and how to handle them if they do.

In this scenario, a match rule is defined to match the staging data records that belong to the Product entity based on its Name field. If two or more staging data records have a product name that is similar enough, they can be matched and merged into a golden record, which will be written into the master database as a master data record.

For more information about how to create a data model, see Setting up a data model and creating some business entities.

To create a match rule, do the following:

  1. In the MDM Repository tree view, right-click Match Rule and then select New from the contextual menu.

  2. In the dialog box [New Match Rule] that opens, define a name for the new match rule.

    If needed, enter information in the Purpose and Description fields to better describe your match rule.

  3. Click Finish to close the dialog box.

The newly created match rule is displayed under the Match Rule node.

In the Match Rule Editor that opens, you need to further define the characteristics of the match rule.

To define a match rule, do the following:

  1. In the Record linkage algorithm section, select T-Swoosh.

  2. In the Match and Survivor section, you define the criteria to use when matching staging data records.

    In this example, add one match key Productname, select Jaro-Winkler as the matching function, set the threshold to be 0.85, and select Most trusted source as the survivorship function.

Once you have defined a match rule, you must attach the match rule to a specific entity of a data model.

Note

You cannot deploy a match rule directly to the MDM server. Rather, match rules are deployed along with the data model to which they are attached.