Finding and hiding sensitive data automatically with a semantic type - 7.3

Talend Data Catalog Getting Started Guide

Version
7.3
Language
English
Product
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Data Catalog
Content
Data Governance
Last publication date
2021-04-15
Apply the PII semantic type to the email_address column in the anonymized_orders table.

Before you begin

  • You have been assigned a Data Manager security role.

  • An administrator has configured a semantic type to hide data.

About this task

By default, the PII semantic type exists in the Talend Data Catalog dictionary and is configured to hide data.
When applying this semantic type, you identify a data asset as sensitive and its data is hidden automatically from unauthorized users.

Procedure

  1. In the search box, type in email columns.
    The email_address columns from the ACME customer hub database display in the search results.
  2. On the toolbar, click Columns and select the Grid view.
  3. Drag and drop the Semantic Types column from Available columns to Selected columns to display it.
    PII tags are already applied to the email_address columns in the orders_us, orders_eu and customers tables.
    Talend Data Catalog suggests the PII semantic type for email_address in the anonymized_orders table.
  4. Click the tick icon to approve the semantic type.
  5. Open the details page of the anonymized_orders table.
  6. Click the Data Sample tab.
    The email_address column header is highlighted in yellow and its data is grayed out.
    When previewing sample data from the anonymized_orders table without the Data Manager security role, data values of PII are replaced with a string of # number signs.

Results

You are ready to tag the table to indicate that it is compliant with the data protection rules.