How to set system and user-defined indicators - 6.5

Talend Data Fabric Studio User Guide

EnrichVersion
6.5
EnrichProdName
Talend Data Fabric
task
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

The second step after defining the columns to be analyzed is to set statistics indicators for each of the defined columns.

Note

You can also use Java user-defined indicators when analyzing columns in a delimited file on the condition that a Java user-defined indicator is already created. For further information, see How to define Java user-defined indicators.

Prerequisite(s): An analysis of a delimited file is open in the analysis editor in the Profiling perspective of the studio. For more information, see How to define the columns to be analyzed.

  1. Follow the procedure outlined in How to define the columns to be analyzed.

  2. From the Data preview view in the analysis editor, click Select indicators to open the [Indicator Selection] dialog box.

  3. From the [Indicator Selection] dialog box:

    • In the Data preview section, place the cursor on a row to display the complete data. This section lists the sample data you define in the analysis editor.

    • Click in the cells next to indicators names to set indicator parameters for the analyzed columns as needed. You can assign system or user-defined indicators to the columns.

    • Select the Hide non applicable indicators check box to hide the system and user-defined indicators that are not compatible with the engine you select to execute the analysis.

    • If required, change the order of columns by dropping them with the cursor.

      The order of the columns will be changed accordingly in the analysis editor.

    In this example, you want to set the Simple Statistics indicators on all columns, the Text Statistics indicators on the first_name column and the Soundex Frequency on the first_name column as well.

    Note

    You can set the text statistics indicators on a column only if its data mining type is set to nominal. Otherwise, these indicators will be grayed out in the [Indicator Selection] dialog box.

    The selected indicators are attached to the analyzed columns in the Analyzed Columns view.

  4. Save the analysis.