Removing a semantic type - 7.0

Talend Data Stewardship User Guide

Version
7.0
Language
English (United States)
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Data Stewardship
Content
Administration and Monitoring > Managing users
Data Governance > Assigning tasks
Data Governance > Managing campaigns
Data Governance > Managing data models
Data Quality and Preparation > Handling tasks
Installation and Upgrade

You can delete a semantic type in Talend Dictionary Service to remove it from the list of recognized data types in Talend Data Stewardship.

You can delete both predefined semantic types as well as predefined standard types.

The variety of semantic types that are present by default in Talend Data Stewardship can be troublesome in certain situations. For example, a five-digit number can be interpreted as a American ZIP code, but also as a French or German one since they share the same format.

Let's say that you are working in an American company, and you only have to deal with data coming from American clients, including ZIP codes. You would prefer to keep only the American ZIP code in the list of recognized semantic types.

Using Talend Dictionary Service, you will simply remove the other semantic types that match the five-digit format and only leave US Postal Code. The change will then be ported instantly in Talend Data Stewardship, and you will from now on validate a ZIP code column against the semantic type US Postal Code.

Procedure

  1. In the homepage, click SEMANTIC TYPES.
  2. Click the search icon on the top-right corner of the page and enter postal to filter the list of semantic types and display only the Zip code types.
  3. Point to the name of a Zip code semantic type and click the trash can which displays to the right.
  4. Accept to remove the semantic type when prompted and repeat the operation to remove the other semantic types that match five-digit numbers till you are left with only US Postal Code.
    When you delete a semantic type which is already used in a data model attached to a campaign, the attributes in the data model automatically switch to text and the validity of all tasks is recomputed. This means data which could display as invalid with the initial semantic type may look as valid with the text semantic type.

Results

You have deleted all the semantic types compatibles with five-digit numbers but one. From now on, when adding new data models, you can set only US Postal Code as the semantic type for columns with Zip code data.