Campaigns and tasks - 7.0

Talend Data Stewardship User Guide

Version
7.0
Language
English (United States)
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Data Stewardship
Content
Administration and Monitoring > Managing users
Data Governance > Assigning tasks
Data Governance > Managing campaigns
Data Governance > Managing data models
Data Quality and Preparation > Handling tasks
Installation and Upgrade
The main units of work in Talend Data Stewardship are campaigns and tasks.

What are campaigns?

A campaign is the main unit of work for campaign owners. It contains all the required configuration assets that are determined by the campaign owner:
  • What are the tasks about (data structure, validation constraints, etc.)?

  • What do data stewards have to do to resolve the campaign tasks (task type)?

  • Which data stewards work on the campaign tasks (campaign participants)?

  • How data stewards collaborate to resolve the campaign tasks (campaign workflow)?

Several campaign types are supported including Arbitration, Resolution, Grouping and Merging. A campaign contains tasks of homogeneous type. The task type defines what the data steward has to do to resolve tasks.

A campaign can be owned by one or several campaign owners.

What are tasks?

A task is the main unit of work for data stewards. A task belongs to a campaign and is assigned to a data steward. It has a lifecycle where it passes through different states according to the workflow defined in the campaign.

When tasks are created, they belong to a campaign and they are initially unresolved and unassigned unless the campaign owner decides to assign them to a specific data steward through the parameters used in the tDataStewardshipTaskOutput component.

Tasks have a lifecycle where they pass through different states according to the workflow defined in the campaign. When the current assignee performs some modification on a task, the task is transitioned to the next step in the workflow till it reaches the resolved state. The task type defines what the data steward has to do to resolve it.

Talend Data Stewardship supports several task types including:
  • Arbitration tasks: allow authorized data stewards to assign them to a label chosen among a list of arbitration choices defined by the campaign owner. The outcome of an arbitration task is the arbitration choice made by data stewards.

  • Resolution tasks: allow authorized data stewards to fix one or several fields of the data record. The outcome of a resolution task is the curated record fixed by data stewards.

  • Merging tasks: allow authorized data stewards to merge several potential duplicate source records into one single record (golden record). The outcome of a merging task is the golden record produced by data stewards.

    Source records can come from the same source (database deduplication) or different sources (databases reconciliation).

  • Grouping tasks: allow authorized data stewards to validate a relationship between pairs or groups of records. The outcome of a grouping task is the list of records associated to each other.