Working principles of data quality - Cloud - 7.3

Talend Studio User Guide

Version
Cloud
7.3
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Cloud
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Design and Development
Last publication date
2024-02-13
Available in...

Big Data Platform

Cloud API Services Platform

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Management Platform

Data Fabric

Data Management Platform

Data Services Platform

MDM Platform

Real-Time Big Data Platform

Available in:

Big Data Platform

Data Fabric

Data Management Platform

Data Services Platform

MDM Platform

Real-Time Big Data Platform

Talend DQ Portal is deprecated from Talend 7.1 onwards.

From the Profiling perspective of Talend Studio, you can examine the data available in different data sources and collect statistics and information about this data.

A typical sequence of profiling data using Talend Studio involves the following steps:

  1. Connecting to a data source including databases and delimited files in order to be able to access the tables and columns on which you want to define and execute analyses. For more information, see Creating connections to data sources.
  2. Defining any of the available data quality analyses including database content analysis, column analysis, table analysis, redundancy analysis, correlation analysis, etc. These analyses will carry out data profiling processes that will define the content, structure, and quality of highly complex data structures. The analysis results will be displayed graphically next to each of the analysis editors, or in more detail in the Analysis Results view.
    Note: While you can use all analyses types to profile data in databases, you can only use Column Analysis and Column Set Analysis to profile data in delimited files.
  3. Generating reports from different analyses and save them in a distant database. These reports allow to compare current and historical statistics to determine the improvement or degradation of data. For more information, see What are reports?.
  4. Available in:

    Big Data Platform

    Data Fabric

    Data Management Platform

    Data Services Platform

    MDM Platform

    Real-Time Big Data Platform

  5. Accessing different analytical tools that will allow you to explore and monitor the reports generated in the Studio . For more information about the Portal, see the Talend DQ Portal User and Administrator Guide. For more information about installing the Portal, see the Talend Installation and Upgrade Guide.

Talend Studio provides you with lock modes that allow you, if you are the first user to open an item, to lock that item and thus have the "read and write" rights. All other users who try to open the same item simultaneously will have a read-only access. For more information, see Lock principle.