Working principles of data quality - 6.5

Talend Open Studio for MDM User Guide

EnrichVersion
6.5
EnrichProdName
Talend Open Studio for MDM
task
Data Governance
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

From the Profiling perspective of the studio, you can examine the data available in different data sources and collect statistics and information about this data.

A typical sequence of profiling data using the studio involves the following steps:

  1. Connecting to a data source including databases and delimited files in order to be able to access the tables and columns on which you want to define and execute analyses. For more information, see Setting up connections to data sources.

  2. Defining any of the available data quality analyses including database content analysis, column analysis, table analysis, redundancy analysis, correlation analysis, etc. These analyses will carry out data profiling processes that will define the content, structure and quality of highly complex data structures. The analysis results will be displayed graphically next to each of the analysis editors, or in more detail in the Analysis Results view.

    Note

    While you can use all analyses types to profile data in databases, you can only use Column Analysis and Column Set Analysis to profile data in delimited files.