Writing tasks in a Merging campaign - 6.5

Data Stewardship

Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Talend Data Stewardship
Talend Studio
Data Governance > Third-party systems > Data Stewardship components
Data Quality and Preparation > Third-party systems > Data Stewardship components
Design and Development > Third-party systems > Data Stewardship components

This Job loads tasks into a Merging campaign defined in Talend Data Stewardship according to the criteria you define in the basic settings of the tDataStewardshipTaskOutput component.

The data records in these tasks have duplicates. But once they are in the application, authorized campaign participants can intervene and merge the records.

For more technologies supported by Talend, see Talend components.

This scenario applies only to subscription-based Talend products.

In this Job:

  • The tFileInputDelimited component reads the customer data.

  • The tMatchGroup component compares data using matching and blocking methods and creates groups of similar encountered duplicates.

  • The tMap component maps the group identifier, GID, generated by tMatchGroup to TDS_GID.

    When the input data has a column which holds the names of the data sources, tMap can also map the input column to TDS_SOURCE.

  • The tDataStewardshipTaskOutput component writes the data in the CRM Data Deduplication campaign in Talend Data Stewardship.