Analyzing duplicates - 7.3

Talend Open Studio User Guide

Version
7.3
Language
English
Product
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for Data Quality
Talend Open Studio for ESB
Module
Talend Studio
Content
Design and Development
Last publication date
2023-10-11
Available in...

Open Studio for Data Quality

You can use the match analysis in the Profiling perspective of Talend Studio to compare columns in databases or delimited files and create groups of similar records using the VSR or the T-Swoosh algorithm.

This analysis provides you with a simple way to create match rules, test them on a set of columns and see the results directly in the editor.

You can also use the Profiling perspective to define match rules in a match rule editor and save them in the Talend Studio repository.