Defining a data model for the campaign - 7.3

Talend Data Stewardship Getting Started Guide

Version
7.3
Language
English (United States)
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Data Stewardship
Content
Data Governance > Managing campaigns
Data Governance > Managing data models
Data Quality and Preparation > Deduplicating data
Data Quality and Preparation > Handling tasks

Create a data model to determine the structure of the data to be managed in the Reconciling client data campaign. This campaign helps data stewards to merge redundant client data coming from different systems.

Data Stewardship has data model awareness which makes syntactic and semantic validation of data possible. Therefore, a campaign relies on a data model to ensure that the data matches the expected structure and format.

About this task

Define the attributes in the data model and select their types out of a predefined standard or semantic types

Procedure

  1. Log in as a campaign owner.
  2. Select DATA MODELS > ADD DATA MODEL.
  3. Enter a name and a description for the new model in the Name and Description fields respectively. Optional fields are marked as optional next to their names.
  4. In the Attributes section, define the columns you want to have in the data model as the following:
    1. In the IDENTIFIER field, enter the technical identifier for the first column.
    2. Enter a name and a description for the column in the corresponding fields, if needed.
      What you set in the NAME field is the name displayed in the task list. If no name is set, the technical identifier will be displayed.
    3. From the attribute type list, select the type of the column.

      Standard and semantic types are integrated in the application by default.

      • For the standard types, additional fields are displayed according to the type you select. These fields are optional and they enable you to define some constraints on the attribute you define such as defining a minimum and/or maximum length or defining a pattern against which to validate the attribute.

        To make sure the entire value matches your validation pattern, it is best practice to surround the validation pattern with ^ and $.

        Some examples:
        • [A-Z] matches A and ABC.
        • ^[A-Z]$ matches A but does not match ABC.

        For Date and Timestamp columns, you have access to a date and time picker which helps you set the date and time automatically in the right format.

      • For the semantic types, you can use the Talend Dictionary Service to manage the semantic types. However, the availability of this service depends on the license you have.
  5. Toggle the DEFINE LIST OF VALUES option to display fields where you can set specific values for the attribute.
    Any values that are not in this list are marked as invalid in the task list.
  6. Optionally, toggle the ALLOW EMPTY VALUES option to disable the upload of empty fields. This option is enabled by default.
  7. Click ADD ATTRIBUTE and repeat the above steps to create all the columns you need in the data model.