Creating a basic analysis on a database column - Cloud - 8.0

Talend Studio User Guide

Version
Cloud
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Cloud
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Design and Development
Last publication date
2024-02-29
Available in...

Big Data Platform

Cloud API Services Platform

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Management Platform

Data Fabric

Data Management Platform

Data Services Platform

MDM Platform

Real-Time Big Data Platform

About this task

You can build your analysis from scratch, analyze the content of one or multiple columns and execute the created analyses using the Java or the SQL engine. This type of analysis provides statistics about the values within each column.

When you use the Java engine to run a column analysis, you can view the analyzed data according to parameters you set yourself.

For more information, see Using the Java or the SQL engine.

Note: When you use the Java engine to run a column analysis on big sets or on data with many problems, it is advisable to define a maximum memory size threshold in Talend Studio Preferences to execute the analysis as you may end up with a Java heap error.

You can also analyze a set of columns. This type of analysis provides statistics on the values across all the data set (full records).

For more information, see Analyzing tables in databases.

The sequence of creating a basic column analysis involves the following steps:

Procedure

  1. Defining the columns to be analyzed.
  2. Setting predefined system indicators or indicators defined by the user for the column(s).
  3. Adding the patterns against which to define the content, structure and quality of the data.