Creating a basic analysis on a database column - 7.3

Talend Open Studio User Guide

Version
7.3
Language
English
Product
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for Data Quality
Talend Open Studio for ESB
Module
Talend Studio
Content
Design and Development
Last publication date
2023-10-11
Available in...

Open Studio for Data Quality

About this task

You can build your analysis from scratch, analyze the content of one or multiple columns and execute the created analyses using the Java or the SQL engine. This type of analysis provides statistics about the values within each column.

When you use the Java engine to run a column analysis, you can view the analyzed data according to parameters you set yourself.

For more information, see Using the Java or the SQL engine.

Note: When you use the Java engine to run a column analysis on big sets or on data with many problems, it is advisable to define a maximum memory size threshold in Talend Studio Preferences to execute the analysis as you may end up with a Java heap error.

You can also analyze a set of columns. This type of analysis provides statistics on the values across all the data set (full records).

For more information, see Analyzing tables in databases.

The sequence of creating a basic column analysis involves the following steps:

Procedure

  1. Defining the column(s) to be analyzed.
  2. Setting predefined system indicators or indicators defined by the user for the column(s).
  3. Adding the patterns against which to define the content, structure and quality of the data.