When you add a dataset, the application automatically suggests one of the supported semantic type for each column.
The semantic type corresponds to the category (names, emails, phone numbers, etc) of the data. If the semantic type that has been applied on a column is not the desired one, you have the possibility to manually change it to one of the predefined types, based on your own experience.
Let's take the example of a dataset containing client data, including the job title of
your customers. You can see in the header of the job title column
that the data type has only been recognized as String
. You are going to
change the semantic type of the column so that it more accurately reflects the data.
Procedure
Results
Job
Title
, as you can see in the header of the job title
column.Every time that the semantic type of a column is modified, the dataset quality is calculated again.