Bijective data masking - 7.3

Data privacy

Version
7.3
Language
English
Product
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Data Quality components > Data privacy components
Data Quality and Preparation > Third-party systems > Data Quality components > Data privacy components
Design and Development > Third-party systems > Data Quality components > Data privacy components
Last publication date
2024-04-03
Bijective masking functions have the following characteristics:
  • They are consistent masking functions.
  • They are injective, meaning that they output two different masked values for two different input values.
  • They check that the input data is in a valid format. If the input value is valid, bijective masking functions output a valid value. If the input value is not valid, they output an invalid value or replace values with null, depending of the masking function used.
For example, the following diagram shows an example of how the tDataMasking component can mask data bijectively:
  • The A value is masked with D, regardless of the number of occurrences in the input dataset.
  • The B value is masked with E.
  • The C value is masked with F.
Example of a bijective data masking operation

Bijective data masking examples

The following table shows examples of generated masked values using the Mask French SSN number function:
Input values Example of masked values
190049418437621 289052428331901
271083561478941 234112758889352
190049418437621 289052428331901
117029 null