tPatternMasking - Cloud - 8.0

Data privacy

Version
Cloud
8.0
Language
English
Product
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Data Quality components > Data privacy components
Data Quality and Preparation > Third-party systems > Data Quality components > Data privacy components
Design and Development > Third-party systems > Data Quality components > Data privacy components
Last publication date
2024-03-28

Masks data that follows a specific pattern and can transform the original data in consistent manner, if needed.

If you need to mask heterogeneous data, you can use the tDataMasking component. For more information, see tDataMasking.

tPatternMasking replaces pattern-specific and generic data with random characters from a specified range of date and numeric values or a set of named values. The actual data is protected and the substitute data is functional for occasions when it is not advisable to show sensitive real data.

Data will keep looking real and consistent and will remain usable for purposes such as testing and training. The most common data type which may need masking method is where the data contains Personally Identifiable Information (PII) or Sensitive Personal Data (SPD).

tPatternMasking reads a data set row by row and creates a structurally similar but inauthentic version of the data after performing the data masking operations you defined on data fields:
  • The component identifies spaces, slashes (/), dashes (-) and points (.) in the input as separators.
  • The component preserves the pattern of the input values in the masked output.
  • The component generates one row for each input row.

For example, the masked output for 615/67/7489 could be 379/48/1789.

When the input data does not match the pattern you defined, tPatternMasking outputs null.

In local mode, Apache Spark 2.4.0 and later versions are supported.

This component is not shipped with your Talend Studio by default. You need to install it using the Feature Manager. For more information, see Installing features using the Feature Manager.

For more technologies supported by Talend, see Talend components.