You can edit an existing semantic type in Talend Dictionary Service to impact how your data is validated in Talend Data Preparation.
Predefined semantic types in Talend Data Preparation are based on standard values, but you may need to tailor them to match your own data. Some data that you would expect to fall under a predefined category, may be considered invalid.
Let's take the example of a dataset containing a list of customers, with their email addresses, date of birth, and the country they live in. You can notice that all the entries for America are considered invalid. While it is indeed not a valid country name, it is the value that your company is using and you would like to make it a valid value.
The problem here is that America
is not one of the expected value for the country
semantic type in Talend Data Preparation. The valid
entry in this case would be United States or
United States of America.
To avoid having this problem in the future, you will update the country
semantic type in Talend Dictionary Service, and add America to the list of valid entries. The change will be
automatically available in Talend Data Preparation.
Procedure
Results
The country
semantic type has been manually updated to support a new
value.
From now on, when dealing with data that are matched with the country
semantic type, America will be considered a valid value.