Consistent data masking - 7.3

Data privacy

Version
7.3
Language
English
Product
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Data Quality components > Data privacy components
Data Quality and Preparation > Third-party systems > Data Quality components > Data privacy components
Design and Development > Third-party systems > Data Quality components > Data privacy components
Last publication date
2024-04-03

When the same value appears twice in the input data, consistent masking functions output the same masked value in the same Job execution.

However, two different input values can be masked with the same value in the output.

For example, the following diagram shows an example of how the tDataMasking component can mask data consistently:
  • The A value is masked with D, regardless of the number of occurrences in the input dataset.
  • The B and C values are masked with E.
Example of a consistent masking operation

Consistent data masking examples

The following table shows examples of generated masked values using the Mask email left part of domain with consistent items function:
Input values Extra Parameter Examples of masked values
newuser@domain.com "talend,value,newcompany" newuser@newcompany.com
admin@company.com "talend,value,newcompany" admin@value.com
newuser@domain.com "talend,value,newcompany" newuser@newcompany.com
user@company.com "talend,value,newcompany" user@value.com
user@domain.com "talend,value,newcompany" user@newcompany.com