How to set system and user-defined indicators - 6.2

Talend Data Fabric Studio User Guide

EnrichVersion
6.2
EnrichProdName
Talend Data Fabric
task
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

The second step after defining the columns to be analyzed is to set statistics indicators for each of the defined columns.

Note

You can also use Java user-defined indicators when analyzing columns in a delimited file on the condition that a Java user-defined indicator is already created. For further information, see How to define Java user-defined indicators.

Prerequisite(s): An analysis of a delimited file is open in the analysis editor in the Profiling perspective of the studio. For more information, see How to define the columns to be analyzed.

To set system indicators for the column(s) to be analyzed, do the following:

  1. Follow the procedure outlined in How to define the columns to be analyzed.

  2. From the Data preview view in the analysis editor, click Select indicators to open the [Indicator Selection] dialog box.

  3. Set the indicators using the [Indicator Selection] dialog box as outlined in How to set system or user-defined indicators.

    In this example, you want to set the Simple Statistics indicators on all columns, the Text Statistics indicators on the first_name column and the Soundex Frequency on the first_name column as well.

    Note

    You can set the text statistics indicators on a column only if its data mining type is set to nominal. Otherwise, these indicators will be grayed out in the [Indicator Selection] dialog box.

    The selected indicators are attached to the analyzed columns in the Analyzed Columns view.

  4. Save the analysis.