Using deduplication components

Some data quality components enable you to analyze columns in databases and group duplicates or match values together using matching rules or comparison algorithms. Example components are tMatchGroup, tRecordMatching, tGenKey, and tRuleSurvivorship.

For further information about managing a survivorship rule package, see Managing a survivorship rule package.

For further information and example Jobs about the deduplication components, see Data Quality components and Cleansing delimited files (CSV files).

The data quality demo project has also ready-to-use Jobs that may use deduplication components. For further information, see Importing a data quality demo project.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!

Leave your feedback here