This use case describes how you can match and cleanse data coming from different sources in order to build master records, using a Merging campaign in Talend Data Stewardship.
Let's suppose that you are facing data quality and anomalies issues in your customer data. You have found duplicates lead information due to lack of synchronization between the different CRMs used in your enterprise. A Merging campaign enables you to solve the duplicates by surviving only the appropriate data.
- How do you identify the match groups which group potentially duplicate records together? This question is resolved through using a Talend Job in the Studio.
- How do you pick the best attribute values from the data sources and presents the most accurate and reliable master records for consumptions by users and systems? This issue is resolved through the Merging campaign in the web application.
To replicate the example and use the exact client data, we assume that:
- An administrator has installed and launched Talend Data Stewardship. For more information, see the Talend Administration Center Installation Guide.
An administrator has created Talend Data Stewardship users and assigned them roles in Talend Administration Center. For further information, see Creating Data Stewardship users.
- A campaign owner has downloaded the input data and the Talend Job used in this example. They
can be used to load tasks in the Merging campaign once it is created.
Retrieve the tds_gettingstarted_source_files.zip file from the Downloads tab in the left panel of this page.