Configuring the masking operations - 7.3

Data privacy

Version
7.3
Language
English
Product
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Data Quality components > Data privacy components
Data Quality and Preparation > Third-party systems > Data Quality components > Data privacy components
Design and Development > Third-party systems > Data Quality components > Data privacy components
Last publication date
2024-03-28
Configure one masking operation for each part of the input phone numbers. Separators will be left unchanged in the masked values.

About this task

In this Job, the fictitious Australian phone numbers use the XX XXXX XXXX format:
  • A two-digit area code
  • A space used as a separator
  • A first four-digit line number
  • A space used as a separator
  • A second four-digit line number

Procedure

  1. Double-click tPatternMasking to display its Basic settings view in the Component tab.
  2. If required, click Sync columns to retrieve the schema defined in the input component.
  3. Click the Edit schema button to open the schema dialog box.

    tPatternMasking adds a read-only column to the output schema.

    The ORIGINAL_MARK column labels output records:

    • Original records are labeled with the true label.
    • Substitute records are labeled with the false label.
  4. In the Modifications table, click the [+] button to add three rows.
    Each row corresponds to a masking operation for a part of the input phone numbers.
  5. In the Modifications table, configure the first row:
    1. From the Column to mask field, select the column which holds the data to be masked.
      In this example, select PhoneNumber.
    2. From the Field type field, select Enumeration as the field type the data belongs to and enter "02,03,07,08" in the Values field.
      Two-digit area code will be masked with one of those values.
  6. In the Modifications table, configure the second row:
    1. From the Column to mask field, select the column which holds the data to be masked.
      In this example, select PhoneNumber.
    2. From the Field type field, select Interval as the field type the data belongs to and enter "2000,9999" in the Range field.
      Four-digit line numbers will be masked with one of those values.
  7. In the Modifications table, configure the third row:
    1. From the Column to mask field, select the column which holds the data to be masked.
      In this example, select PhoneNumber.
    2. From the Field type field, select Interval as the field type the data belongs to and enter "0000,9999" in the Range field.
      Four-digit line numbers will be masked with a value within the defined range.
  8. Click the Advanced settings tab and select the Output the original row? check box.
    The Job will output original and substitute records.
  9. From the Method list, select FF1 with AES to generate encrypted masked values.

    When you use a FF1 method, the number of possible values that the component can generate from the input pattern must be greater than or equal to 1,000,000.

  10. In the Password for FF1 methods, enter a password to generate unique masked values.
    This password is also necessary to unmask data.