Managing data classification - 8.0

Talend Data Catalog Administration Guide

Version
8.0
Language
English
Product
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Data Catalog
Content
Administration and Monitoring
Data Governance
Last publication date
2023-09-26

Once you have data classes defined, you can apply them to harvested data elements:
  • Manually: You apply the data-detected classes from the object pages or multiple at the same time using a worksheet.
  • Automatically: You invoke a data classification process where data classes are proposed based on the patterns and metadata queries defined for the different data classes. The automatic data classification process can be initiated during the model harvesting or using the Classify Data feature.

    After there are data classes proposed, you can approve or remove the suggestions.

You have been assigned an object role with the Data Classification Editing capability.

Data Class proposal and approval process

  • For data-detected data classes, when the confidence level is higher than the matching threshold specified for that data class, Talend Data Catalog proposes to classify the harvested object with the data-detected data class.

  • For metadata-detected data classes, when the associated MQL query produces the harvested object as a match, Talend Data Catalog proposes to classify the harvested object with the metadata-detected data class.
  • For compound data classes, when either of the two above conditions applies to any of the contained data classes, Talend Data Catalog proposes to classify the harvested object with the compound data class.

When the data classification is done automatically, you have to approve or reject the suggestions made by Talend Data Catalog. The proposed data classes appear in the Data Classifications area in the Overview tab of the object page.

  • When approving the assignment, Talend Data Catalog creates the "classifies" relationship between the data class and the imported object. It creates the same relationship when you assign a data class to an imported object manually.
  • When rejecting the assignment, Talend Data Catalog will remember this action and will never assign this data class to the object in the future automatic data classification of that object.

Editing the data classifications manually from each object page

  • Click the tick to approve the suggestion.

    When approving the assignment, the state of that data class assignment changes to approved.

  • Click the cross to reject the suggestion.

    When rejecting the assignment, Talend Data Catalog will remember this action and will never assign this data class to the object in the future automatic data classification of that object.

    To revert a rejection, you need to re-assign manually the data class.

  • Remove the data class from the list to remove the assignment without rejecting it.

    When removing a suggestion, Talend Data Catalog will not remember this action and will assign the data class to the object in the future automatic data classification of that object.

Editing the data classifications manually in a worksheet

You must include the Data Classifications column in the worksheet.

You can also add the Data Classifications Approved, Data Classifications Matched and Data Classifications Rejected columns.