Adding a Merging campaign to deduplicate records - 6.4

Talend Data Stewardship Examples

author
Talend Documentation Team
EnrichVersion
6.4
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Administration and Monitoring > Managing users
Data Governance > Assigning tasks
Data Governance > Managing campaigns
Data Governance > Managing data models
Data Quality and Preparation > Handling tasks
EnrichPlatform
Talend Data Stewardship

A Merging campaign enables data stewards to merge several potential duplicate data records into one single master record. Source records can come from the same source (data deduplication) or different sources (data reconciliation).

As a campaign owner, you need to create the campaign to determine the structure of the data to be managed, the actions to be taken on data and which data stewards to work on what tasks.

One common use case of data deduplication is same customers appear as separate records in your CRM system. You would like here to match records in order to identify duplicates. A MERGING campaign enables you to decide what fields to use to determine a match and merge the records. Once data is deduplicated, a Talend Job can be used to reupload the cleansed data to CRM.

For a real world use case about data reconciliation, see Adding a Merging campaign to reconciliate data.

Before you begin

  • An administrator has created Talend Data Stewardship users and assigned them roles in Talend Administration Center. For further information, see Creating Data Stewardship users.

  • You have been assigned a campaign owner role in Talend Administration Center.

  • You have defined a data model for the campaign in Talend Data Stewardship.

  • You have accessed Talend Data Stewardship as a campaign owner.