What are campaigns?
What are the tasks about (data structure, validation constraints, etc.)?
What do data stewards have to do to resolve the campaign tasks (task type)?
Which data stewards work on the campaign tasks (campaign participants)?
How data stewards collaborate to resolve the campaign tasks (campaign workflow)?
Several campaign types are supported including Arbitration, Resolution, Grouping and Merging. A campaign contains tasks of homogeneous type. The task type defines what the data steward has to do to resolve tasks.
A campaign can be owned by one or several campaign owners.
What are tasks?
A task is the main unit of work for data stewards. A task belongs to a campaign and is assigned to a data steward. It has a lifecycle where it passes through different states according to the workflow defined in the campaign.
When tasks are created, they belong to a campaign and they are initially unresolved and unassigned unless the campaign owner decides to assign them to a specific data steward through the parameters used in the tDataStewardshipTaskOutput component.
Tasks have a lifecycle where they pass through different states according to the workflow defined in the campaign. When the current assignee performs some modification on a task, the task is transitioned to the next step in the workflow till it reaches the resolved state. The task type defines what the data steward has to do to resolve it.
Arbitration tasks: allow authorized data stewards to assign them to a label chosen among a list of arbitration choices defined by the campaign owner. The outcome of an arbitration task is the arbitration choice made by data stewards.
Resolution tasks: allow authorized data stewards to fix one or several fields of the data record. The outcome of a resolution task is the curated record fixed by data stewards.
Merging tasks: allow authorized data stewards to merge several potential duplicate source records into one single record (golden record). The outcome of a merging task is the golden record produced by data stewards.
Source records can come from the same source (database deduplication) or different sources (databases reconciliation).
- Grouping tasks: allow authorized data stewards to validate a relationship between pairs or groups of records. The outcome of a grouping task is the list of records associated to each other.