Data Masking

Talend Data Preparation User Guide

author
Talend Documentation Team
EnrichVersion
6.4
2.1
EnrichProdName
Talend MDM Platform
Talend Real-Time Big Data Platform
Talend Data Services Platform
Talend Big Data
Talend Data Management Platform
Talend Data Fabric
Talend ESB
Talend Data Integration
Talend Big Data Platform
task
Data Quality and Preparation > Cleansing data
EnrichPlatform
Talend Data Preparation

Depending on the semantic type of the column on which you use the Mask data (obfuscation) function, the effect will vary.

The table below describes the effects of the Mask data (obfuscation) function on the different semantic types that support data masking.

Semantic type

Data masking effect

ADDRESS_LINE

The street number is replaced by a randomly generated number and the other characters are replaced by X. However, the following key words are not transformed:

Rue, rue, r., strasse, Strasse, Street, street, St., St, Strae, Strada, Rua, Calle, Ave., avenue, Av., Allée, allée, alle, Avenue, Avenida, Bvd., Bd., Boulevard, boulevard, Blv., Viale, Avenida, Bulevar, Route, route, road, Road, Rd., Chemin, Way, Cour, Court, Ct., Place, place, Pl., Square, Impasse, Alle, Driveway, Auahrt, Viale, Esplanade, Esplanade, Promenade, Lungomare, Esplanada, Esplanada, Faubourg, faubourg, Suburb, Vorort, Periferia, Subrbio, Suburbio, Via, Via, industrial, area, zone, industrielle, Périphérique, Peripheral, Voie, voie, Track, Gleis, Carreggiata, Caminho, Pista, Forum, STREET, RUE, ST., AVENUE, BOULEVARD, BLV., BD, ROAD, ROUTE, RD., RTE, WAY, CHEMIN, COURT, CT., SQUARE, DRIVEWAY, ALLEE, DR., ESPLANADE, SUBURB, BANLIEUE, VIA, PERIPHERAL, PERIPHERIQUE, TRACK, VOIE, FORUM, INDUSTRIAL, AREA, ZONE, INDUSTRIELLE.

CITY

Replaces each character with a random one.

COMPANY

Generates a random but existing company name.

DECIMAL

Replaces each number with a random one.

EMAIL

Replaces everything before the @ character with X, and leaves the rest untransformed.

FIRST_NAME

Generates a random first name.

LAST_NAME

Generates a random last name.

FULL_NAME

Generates a random first name and last name.

FR_COMMUNE

Generates a random french city name

INTEGER

Replaces each number with a random one.

IPv4_ADDRESS

Generates a correct random IPv4 address

IPv6_ADDRESS

Generates a correct random IPv6 address

JOB_TITLE

Replaces each character with a random one.

LOCALIZATION

Generates random longitude and latitude coordinates.

LOCATION_COORDINATE

Generates random longitude and latitude coordinates.

MAC_ADDRESS

Generates a correct random MAC address.

ORGANIZATION

Generates a correct random organization name.

PASSPORT

Generates a correct random passport number.

US_PHONE

Generates a correct random phone number for the US.

FR_PHONE

Generates a correct random phone number for France.

UK_PHONE

Generates a correct random phone number for the UK.

DE_PHONE

Generates a correct random phone number for Germany.

US_POSTAL_CODE

Generates a correct random postal code for the US.

FR_POSTAL_CODE

Generates a correct random postal code for France.

UK_POSTAL_CODE

Generates a correct random postal code for the UK.

DE_POSTAL_CODE

Generates a correct random postal code for Germany.

BE_POSTAL_CODE

Generates a correct random postal code for Belgium.

FR_CODE_COMMUNE_INSEE

Generates a random french INSEE city code.

US_SSN

Generates a correct random Social Security Number for the US.

FR_SSN

Generates a correct random Social Security Number for France.

UK_SSN

Generates a correct random Social Security Number for the UK.

TEXT

Replaces each character with a random one.

MASTERCARD

Generates a correct random MasterCard credit card number.

US_CREDIT_CARD

Generates a correct random American Express credit card number.

VISACARD

Generates a correct random Visa credit card number.