Defining a data model for the Merging campaign - Cloud

Talend Cloud Data Stewardship Examples

Talend Documentation Team
Talend Cloud
Data Governance > Assigning tasks
Data Governance > Managing campaigns
Data Governance > Managing data models
Data Quality and Preparation > Handling tasks
Talend Data Stewardship

Create a data model to determine the structure of the data to be managed in the CRM data deduplication campaign which you create to allow data stewards to merge duplicate customer data stored in the enterprise CRM.

Talend Cloud Data Stewardship has data model awareness which makes possible the syntactic and semantic validation of data. You can define the attributes in the data model and select their types out of a predefined standard or semantic types.


  2. Enter a name and a description for the new model.
  3. In the Attributes section, define the columns you want to have in the data model as the following:
    1. In the IDENTIFIER field, enter the technical identifier for the first column.
    2. Enter a name and a description for the column in the corresponding fields, if needed.
      What you set in the NAME field is the name displayed in the task list. If no name is set, the technical identifier will be displayed.
    3. From the attribute type list, select the type of the column.

      Standard and semantic types are integrated in the application by default.

      • For the standard types, additional fields are displayed according to the type you select. These fields are optional and they enable you to define some constraints on the attribute you define such as defining a minimum and/or maximum length or defining a pattern against which to validate the attribute.

        For Date and Timestamp columns, you have access to a date and time picker which helps you set the date and time automatically in the right format.

      • For the semantic types, you can use the Talend Dictionary Service to manage the semantic types. However, the availability of this service depends on the license you have.
  4. Optionally, toggle the ALLOW EMPTY VALUES option to disable the upload of empty fields. This option is enabled by default.
  5. Click ADD ATTRIBUTE in the left panel and repeat the above steps to create all the columns you need in the data model.


    The columns defined for the CRM Data Deduplication campaign include information about the customers, their addresses, email addresses, occupation and the company in which they work as shown in the capture.

  6. To display email address as hyperlinks in the task list, set the semantic type for the Email column to MailTo url.
    This enables you to click the email address directly from the task list to open a new window where you can send an email to the recipient.