Data Quality: new features - 7.3

Talend Big Data products Release Notes

Version
7.3
Language
English (United States)
EnrichDitaval
Big Data
Product
Talend Big Data
Talend Big Data Platform
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
Content
Installation and Upgrade
Release Notes

Feature

Description

Product

Explainable machine learning for data matching
  • A feature importance report can be generated when using the tMatchModel component.
  • In the tMatchPredict and tMatchIndexPredict components, the new output column (CONFIDENCE_SCORE) indicates the confidence score of a prediction for a pair or cluster.

Talend Big Data Platform

Talend Real-Time Big Data Platform

Data masking
  • When using the tDataMasking and tPatternMasking components, the data that cannot be masked can be sent to an invalid flow output.
  • The Format-Preserving Encryption is applied when masking:
    • Credit card numbers
    • IBAN and US bank account numbers

Talend Big Data Platform

Talend Real-Time Big Data Platform

Validating and enriching contact information A new component is available: tPersonator.

It ensures the quality of a US and Canadian contact database by checking, verifying, moving and appending contact information.

Talend Big Data Platform

Talend Real-Time Big Data Platform

Support for additional databases (data mart) Snowflake is now supported for the data quality data mart.

Talend Big Data Platform

Talend Real-Time Big Data Platform