Configuring the process of matching data - 7.3

Name standardization

Version
7.3
Language
English
Product
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Data Quality components > Standardization components > Name standardization components
Data Quality and Preparation > Third-party systems > Data Quality components > Standardization components > Name standardization components
Design and Development > Third-party systems > Data Quality components > Standardization components > Name standardization components
Last publication date
2024-02-21
You need to select the data columns of interest before matching them using tFirstnameMatch.

Procedure

  1. Click the tFilterColumns component to display its Basic settings view and define the component properties.
  2. Click the [...] button next to Edit schema to open a dialog box.
  3. Select the name and gender columns from the input schema and move them to the output schema.
  4. Click OK to validate your changes and close the dialog box.
  5. Click tFirstnameMatch to display the Basic settings view and define the component properties.
  6. Click the [...] button next to Edit schema to view the input and output schemas, and then click OK to close the dialog box.

    The output schema of this component is the same as the input schema plus one fixed column: FIRSTNAMEMATCH.

  7. From the First Names list, select the column that holds the first names, name in this example.
  8. If required, select the Use Gender or the Use Country check box and, from the list, select the column that contains the gender or country respectively.
    This will optimize system performance and will give more precise results.
  9. If required, select the Fuzzy Search check box if you want to get the first-name best match possible, in case several matches are available.