You can edit an existing semantic type in Talend Dictionary Service to impact how your data is validated in Talend Data Stewardship.
Predefined semantic types in Talend Data Stewardship are based on standard values, but you may need to tailor them to match your own data. Some data that you would expect to fall under a predefined category, may be considered invalid.
Let's take the example of a dataset containing a list of customers, with their email addresses, date of birth, and the country they live in. You can notice that all the entries for United States of America are considered invalid, when they should not since it is the official name of the country.
The problem here is that United States of America is not one of
the expected value for the
Country semantic type in Talend Data Stewardship. The valid entry in this
case would be United States.
To avoid having this problem in the future, you need to update the
Country semantic type in Talend Dictionary Service and add
States of America to the list of valid entries. The change will be
automatically available in Talend Data Stewardship.
- In the homepage, click SEMANTIC TYPES.
- Click the search icon on the top-right corner of the page and enter country to filter the list of semantic types.
Click Country in the list.
- Click the icon next to Values and enter United States of America in the field which displays.
Click to add the new value to the top of the list of valid
entries for the
Click SAVE AND PUBLISH to send the semantic type to
the Talend Dictionary Service
server and make it available to be used by the system.
Clicking SAVE AS DRAFT stores the new type on the server without propagating it to the system. The new type is not usable unless it is published. For a use case of this option, let's say that you have new semantic types to deploy as part of a new project. You can prepare the work by creating the semantic types and save them as draft before the go-live of the project, and can deploy the semantic types only the day of go-live.
Go back to Talend Data Stewardship and refresh the task list containing the customers countries or reopen
Country semantic type has been manually updated to support the
From now on, when working with data that are matched with the
Country semantic type, United States of
America will be considered a valid value.