Selecting the columns you want to analyze - Cloud - 8.0

Talend Studio User Guide

Version
Cloud
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Cloud
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Design and Development
Last publication date
2024-04-16
Available in...

Big Data Platform

Cloud API Services Platform

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Management Platform

Data Fabric

Data Management Platform

Data Services Platform

MDM Platform

Real-Time Big Data Platform

Procedure

  1. In the analysis editor and from the Connection list, select the database connection on which to run the analysis.
    The nominal correlation analysis is possible only on database columns for the time being. You can change your database connection by selecting another connection from the Connection list. If the analyzed columns do not exist in the new database connection you want to set, you receive a warning message that enables you to continue or cancel the operation.
  2. Click Select Columns to open the Column Selection dialog box and select the columns you want to analyze, or drag them directly from the DQ Repository tree view.
    If you select too many columns, the analysis result chart will be very difficult to read.
    You can right-click any of the listed columns in the Analyzed Columns view and select Show in DQ Repository viewto locate the selected column under the corresponding connection in the tree view.
  3. If required, click Options icon in the Indicators view to open a dialog box where you can set thresholds for each indicator.
    The indicators representing the simple statistics are by-default attached to this type of analysis.
  4. In the Data Filter view, enter an SQL WHERE clause to filter the data on which to run the analysis, if required.
  5. In the Analysis Parameter view and in the Number of connections per analysis field, set the number of concurrent connections allowed per analysis to the selected database connection, if required.
    You can set this number according to the database available resources, that is the number of concurrent connections each database can support.
  6. If you have defined context variables in the analysis editor:
    1. use the Data Filter and Analysis Parameter views to set/select context variables to filter data and to decide the number of concurrent connections per analysis respectively.
    2. In the Context Settings view, select from the list the context environment you want to use to run the analysis.
    For further information about contexts and variables, see Using context variables in analyses.
  7. Press F6 to execute the analysis.
    The editor switches to the Analysis Results tab showing the results.
    For detail explanation of the analysis results, see Exploring the results of the nominal correlation analysis.