Resolving master data conflicts resulting from data matching - 6.5

Talend Data Stewardship Console User Guide

EnrichVersion
6.5
EnrichProdName
Talend Data Fabric
Talend MDM Platform
task
Data Governance
Data Quality and Preparation
EnrichPlatform
Talend MDM Web UI

Talend Data Stewardship Console provides the possibility to resolve data conflicts in a data record resulting from the matching processes performed on data. These matching processes may be carried out using a Match Rule attached to a data model, or using the Talend data quality component tMatchGroup.

The results of the matching processes will point out any data conflicts among the matched elements across heterogeneous sources. Conflicted record values will be listed in the stewardship console in Talend MDM Web User Interface in the form of tasks.

Validity checks are now necessary through the stewardship console in order to improve and enhance data quality. An authorized data steward must manually define and merge data in the listed tasks.

Prerequisite(s): You have logged in to Talend MDM Web User Interface and accessed the stewardship console. At least one data resolution task is assigned to you by the administrator. For further information, see Assigning a task.

To resolve data match problems related to multiple sources of data, do the following:

  1. In the Task Navigator panel, expand Common and then click New to display all new tasks.

    Alternatively, expand Types and click Resolution to display all resolution tasks, whether new or resolved.

    Note

    Your list will show only the tasks that have been assigned to you by the administrator. For further information, see Assigning a task.

  2. Double-click the data resolution task that holds the data record you want to resolve.

    The detail view of the selected task is displayed on the console.

    Note

    You can use the and buttons in the upper right corner of the task detail view in order to open the detail view of the previous or next task in the task list. When you need to leave the task detail view, you can click to go back to the task list.

    In the task detail view, all data items of the selected data record coming from different sources are displayed in a table which may be partially filled thanks to the auto-suggest feature. In order to appropriately use the auto-suggest feature, avoid using the same name for these sources. For further information about the auto-suggest feature, see the table presented later.

    In the table, the rows are the source record(s) together with the golden record, which is shown at the top, and the columns are the data record fields.

    Note

    The Use Auto Suggest button , the Use Common Value button and the Use Trusted Score button are all activated by default. They use the common values or the trusted values, if any, from the SOURCE records to pre-fill the GOLDEN RECORD.

  3. If needed, you can drag-and-drop the rows or columns to the exact position you want to place them in the table.

    To do so, firstly hover the pointer over the left-hand side of a row or column until the pointer turns into two crossing arrows, then drag the row or column to the desired position and hold the pointer until the desired position becomes totally blank, and finally drop the row or column.

    Note

    If you change the positions of rows or columns in a table, this change will be maintained and will also be applied to any other existing tasks with the same rows (source records) and columns (fields).

    In this example, the golden record comes from five different sources. The result of the match and survivorship process is represented by the golden record and reveals that the values contained in the record differ from one source to the other.

    Note

    Different from matching scores that represent the distance between a record and a master record, a trusted score indicates how trustworthy a source value is: it gets one point each time when the steward selects it as the target value.

  4. Click the Underline Equal tab on top of the page to underline all similar values in the different source rows.

  5. Now you need to manually coordinate the data for the master data record: click the value in the source that you want to use to create the master record.

    The selected value from the source record row is displayed in the corresponding field of the golden record row.

    Note

    If any of the values in the source record row has a red mark, place your pointer on this mark to display its attached information in this task. Such information can help you in resolving the data record. For further information about adding this information to the task, see the tStewardshipTaskOutput section in the Talend Components Reference Guide.

  6. To write all the values from a SOURCE row to the GOLDEN RECORD row, click in the corresponding source record row.

  7. If required, enter a completely new value in the corresponding field of the GOLDEN RECORD row and consider it as the correct value for the master data record.

  8. Click the Highlight Selected tab to highlight all values in the source rows that you have selected for the GOLDEN RECORD row.

  9. Click the Show Scores tab to display the overall score of the source record (next to the source name) and the matching weights (next to individual values).