Managing data classification - Cloud

Talend Cloud Data Catalog User Guide

Version
Cloud
Language
English
Product
Talend Cloud
Module
Talend Data Catalog
Content
Data Governance
Last publication date
2023-11-13

Once you have data classes defined, you can apply them to harvested data elements:
  • Manually: You apply the data-detected classes from the object pages or multiple at the same time using a worksheet.
  • Automatically: You invoke a data classification process where data classes are proposed based on the patterns and metadata queries defined for the different data classes. The automatic data classification process can be initiated during the model harvesting or using the Classify Data feature.

    After there are data classes proposed, you can approve or remove the suggestions.

You have been assigned an object role with the Data Classification Editing capability.

Data Class proposal and approval process

  • For data-detected data classes, when the confidence level is higher than the matching threshold specified for that data class, Talend Cloud Data Catalog proposes to classify the harvested object with the data-detected data class.

  • For metadata-detected data classes, when the associated MQL query produces the harvested object as a match, Talend Cloud Data Catalog proposes to classify the harvested object with the metadata-detected data class.
  • For compound data classes, when either of the two above conditions applies to any of the contained data classes, Talend Cloud Data Catalog proposes to classify the harvested object with the compound data class.

When the data classification is done automatically, you have to approve or reject the suggestions made by Talend Cloud Data Catalog. The proposed data classes appear in the Data Classifications area in the Overview tab of the object page.

  • When approving the assignment, Talend Cloud Data Catalog creates the "classifies" relationship between the data class and the imported object. It creates the same relationship when you assign a data class to an imported object manually.
  • When rejecting the assignment, Talend Cloud Data Catalog will remember this action and will never assign this data class to the object in the future automatic data classification of that object.

Editing the data classifications manually from each object page

  • Click the tick to approve the suggestion.

    When approving the assignment, the state of that data class assignment changes to approved.

  • Click the cross to reject the suggestion.

    When rejecting the assignment, Talend Cloud Data Catalog will remember this action and will never assign this data class to the object in the future automatic data classification of that object.

    To revert a rejection, you need to re-assign manually the data class.

  • Remove the data class from the list to remove the assignment without rejecting it.

    When removing a suggestion, Talend Cloud Data Catalog will not remember this action and will assign the data class to the object in the future automatic data classification of that object.

Editing the data classifications manually in a worksheet

You must include the Data Classifications column in the worksheet.

You can also add the Data Classifications Approved, Data Classifications Matched and Data Classifications Rejected columns.