The data classification operation uses the data pattern to match data classes to
imported objects based on the matching criteria.
When you approve or reject a learning data class, Talend Data Catalog absorbs
the information and improve its understanding of the data pattern.
Procedure
-
Open the object page you want to use as a basis to learn from.
-
Assign the data class manually to that object.
-
Go to to open the properties of the learning data class.
If you see numbers in blue next to the values in the Data
Pattern area, it means that they have learned.
The numbers in blue next to the values are the percentage of instances of the data
which matched that particular value, with a minimum of 10%.
Talend Data Catalog picks up all the possible values or patterns that fit the
percentage specified in the Matching threshold
field.
The data patterns which have the higher values in blue next to them are likely to
be more accurate. You can adjust the list of possible values or patterns.
-
Clear the Auto Learning check box to disable the
option.
-
Adjust the list of data patterns by removing the less accurate patterns.
-
Save your changes.
Results
When you have a good set of patterns, you can invoke data
classification on other objects to automatically associate the data class with these
objects.