Configuring the unmasking operations - 7.3

Data privacy

Version
7.3
Language
English (United States)
Product
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Data Quality components > Data privacy components
Data Quality and Preparation > Third-party systems > Data Quality components > Data privacy components
Design and Development > Third-party systems > Data Quality components > Data privacy components

Configure one unmasking operation for each part of the input phone numbers. Separators will be left unchanged in the unmasked values.

In the Modifications table, the settings must be the same as the ones used for the masking operations performed by the tPatternMasking component.

About this task

The masked Australian phone numbers use the XX XXXX XXXX format:
  • A two-digit area code
  • A space used as a separator
  • A first four-digit line number
  • A space used as a separator
  • A second four-digit line number

Procedure

  1. Double-click tPatternUnmasking to display its Basic settings view in the Component tab.
  2. If required, click Sync columns to retrieve the schema defined in the input component.
  3. Click the Edit schema button to open the schema dialog box.

    tPatternUnmasking adds a read-only column to the output schema.

    The ORIGINAL_MARK column labels output records:

    • Original records are labeled with the true label.
    • Substitute records are labeled with the false label.
  4. In the Modifications table, click the [+] button to add three rows.
    Each row corresponds to an unmasking operation for a part of the input phone numbers.
  5. In the Modifications table, edit the first row to configure the unmasking operation for prefixes:
    1. From the Column to unmask field, select the column which holds the data to be unmasked.
      In this example, select PhoneNumber.
    2. From the Field type field, select Enumeration as the field type the data belongs to and enter "02,03,07,08" in the Values field.
  6. In the Modifications table, edit the second row to unmask the first four-digit line numbers:
    1. From the Column to unmask field, select the column which holds the data to be unmasked.
      In this example, select PhoneNumber.
    2. From the Field type field, select Interval as the field type the data belongs to and enter "2000,9999" in the Range field.
  7. In the Modifications table, configure the third row to unmask the second four-digit line numbers:
    1. From the Column to unmask field, select the column which holds the data to be unmasked.
      In this example, select PhoneNumber.
    2. From the Field type field, select Interval as the field type the data belongs to and enter "0000,99999" in the Range field.
  8. Click the Advanced settings tab and select the Output the original row? check box.
    The Job will output original and substitute records.
  9. From the Method list, select the method used when the data was masked using the tPatternMasking component.

    Example

    In this example, select FF1 with AES.

    When you use a FF1 method, the number of possible values that the component can generate from the input pattern must be greater than or equal to 1,000,000.

  10. In the Password for FF1 methods, enter the user-defined password used when the data was masked using the tPatternMasking component.

    Example

    In this example, enter "talend".