Creating analyses from table or column names - Cloud - 8.0

Talend Studio User Guide

Version
Cloud
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Cloud
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Design and Development
Last publication date
2024-02-22
Available in...

Big Data Platform

Cloud API Services Platform

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Management Platform

Data Fabric

Data Management Platform

Data Services Platform

MDM Platform

Real-Time Big Data Platform

About this task

You can use simplified ways to create one or multiple column analyses. All what you need to do is to start from the table name or the column name under the relevant DB Connection folder in the DQ Repository tree view.

The options from the table name are different from the options from the column name.

Procedure

  1. To create a column analysis directly from the relevant table name in the DB Connection, do the following:
    1. In the DQ Repository tree view, expand Metadata > DB Connections.
    2. Browse to the table that holds the columns you want to analyze and right-click it.
    3. From the context menu, select a type of analysis.
      • Semantic-aware Analysis: analyze the selected table based on information gathered in the semantic repository.

        For further information, see Steps to use the Semantic-aware analysis.

      • Match Analysis: open the match analysis editor where you can define match rules and select the columns on which you want to use the match rules.

        For more information see Analyzing duplicates.

      • Table Analysis: analyze the selected table using SQL business rules.

        For more information on the Simple Statistics indicators, see Simple statistics.

      • Column Analysis: analyze all the columns included in the selected table using the Simple Statistics indicators.

        For more information on the Simple Statistics indicators, see Simple statistics.

      • Pattern Frequency Analysis: analyze all the columns included in the selected table using the Pattern Frequency Statistics along with the Row Count and the Null Count indicators.

        For more information, see Pattern frequency statistics.

    These steps replace the procedures outlined in Defining the columns to be analyzed and setting indicators. You can proceed following the steps outlined in Finalizing and executing the column analysis.
  2. To create a column analysis directly from the column name in the DB Connection, do the following:
    1. In the DQ Repository tree view, expand Metadata > DB Connections.
    2. Browse to the columns you want to analyze and right-click them.
    3. From the context menu, select an action to perform or a type of analysis.
      • Analyze: create an analysis for the selected column. You must later set the indicators you want to use to analyze the selected column.

        For more information on setting indicators, see Setting indicators on columns. For more information on accomplishing the column analysis, see Finalizing and executing the column analysis.

      • Nominal Value Analysis: create a column analysis on nominal data preconfigured with indicators appropriate for nominal data, namely Value Frequency, Simple Statistics and Text Statistics indicators.
      • Simple Analysis: analyze the selected column using the Simple Statistics indicators.

        For more information on the Simple Statistics indicators, see Simple statistics.

      • Pattern Frequency Analysis: analyze the selected column using the Pattern Frequency Statistics along with the Row Count and the Null Count indicators.

        For more information on the Pattern Frequency Statistics, see Pattern frequency statistics.

      • Analyze Column Set: perform an analysis on the content of a set of columns. This analysis focuses on a column set (full records) and not on separate columns as it is the case with the column analysis.

        For more information, see Creating a simple table analysis (Column Set Analysis).

      • Analyze Correlation: perform column correlation analyses between nominal and interval columns or nominal and date columns in database tables.

        For more information, see Numerical correlation analyses.

      • Semantic-aware Analysis: analyze the selected column(s) based on information gathered in the semantic repository.

        For further information, see Steps to use the Semantic-aware analysis.

      • Analyze matches: open the match analysis editor where you can define match rules and select the columns on which you want to use the match rules.

        For more information see Analyzing duplicates.

    These steps replace one of or both of the procedures outlined in Defining the columns to be analyzed and setting indicators. You can proceed following the same steps outlined in Finalizing and executing the column analysis.