Data Quality: new features - 7.3

Talend Big Data products Release Notes

author
Talend Documentation Team
EnrichVersion
7.3
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
task
Installation and Upgrade
Release Notes

Feature

Description

Product

Explainable machine learning for data matching
  • A feature importance report can be generated when using the tMatchModel component.
  • In the tMatchPredict and tMatchIndexPredict components, the new output column (CONFIDENCE_SCORE) indicates the confidence score of a prediction for a pair or cluster.

Talend Big Data Platform

Talend Real-Time Big Data Platform

Data masking
  • When using the tDataMasking and tPatternMasking components, the data that cannot be masked can be sent to an invalid flow output.
  • The Format-Preserving Encryption is applied when masking:
    • Credit card numbers
    • IBAN and US bank account numbers

Talend Big Data Platform

Talend Real-Time Big Data Platform

Validating and enriching contact information A new component is available: tPersonator.

It ensures the quality of a US and Canadian contact database by checking, verifying, moving and appending contact information.

Talend Big Data Platform

Talend Real-Time Big Data Platform

Support for additional databases (data mart) Snowflake is now supported for the data quality data mart.

Talend Big Data Platform

Talend Real-Time Big Data Platform