Create a data model to determine the structure of the data to be
managed in the Reconciling client data campaign.
This campaign helps data stewards to merge redundant client data coming from different
systems.
Data Stewardship has data model awareness
which makes syntactic and semantic validation of data possible. Therefore, a
campaign relies on a data model to ensure that the data matches the expected
structure and format.
About this task
Define the attributes in the data
model and select their types out of a predefined standard or semantic types
Procedure
-
Log in as a campaign owner.
-
Select .
-
Enter a name and a description for the new model in the
Name and Description
fields respectively. Optional fields are marked with * next to
their names.
-
In the Attributes
section, define the columns you want to have in the data model as the
following:
-
In the Identifier field, enter the technical identifier for the first
column.
-
Enter a name and a description for the column in the
corresponding fields, if needed.
What you set in
the Name field is the name displayed in the task list. If no name
is set, the technical identifier will be displayed.
-
From the attribute type list, select the type of the
column.
Standard and semantic types are integrated in the application by
default.
- For the standard types, additional fields are
displayed according to the type you select. These fields are
optional and they enable you to define some constraints on the
attribute you define such as defining a minimum and/or maximum
length or defining a pattern against which to validate the
attribute.
To make sure the entire value matches your
validation pattern, it is best practice to surround the
validation pattern with ^
and
$
.
Some examples:
-
[A-Z]
matches A
and ABC
.
-
^[A-Z]$
matches A
but does not match ABC
.
For Date
and Timestamp
columns, you have
access to a date and time picker which helps you set the
date and time automatically in the right format.
- For the semantic types, you can use the Talend Dictionary Service to manage the semantic types. However, the
availability of this service depends on the license you
have.
-
Toggle the Define list of values option to display fields
where you can set specific values for the attribute.
Any values that are not in this list are marked as invalid in the task
list.
-
Optionally, toggle the Allow empty
values option to disable the upload of empty fields. This option is
enabled by default.
-
Click Add attribute
and repeat the above steps to create all the columns you need in the data
model.
Tip: In the
Rules tab, you can apply data quality
rules. A data quality rule is a set of business requirements which helps you
detect anomalies in campaigns. It defines the values your data must comply with. A
condition can be added to make the data quality rule apply to some data only. For
more information, see
Working with data quality rules.