Setting system indicators - 7.0

Data Quality Job and Analysis Examples

author
Talend Documentation Team
EnrichVersion
7.0
EnrichProdName
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Open Studio for Data Quality
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
task
Data Quality and Preparation
EnrichPlatform
Talend Studio

Procedure

  1. From the Data preview view in the analysis editor, click Select indicators to open the Indicator Selection dialog box.
  2. Click in the cells next to indicators names to set indicator parameters for the analyzed columns and click OK.
    You want to see the row, blank and duplicate counts in all columns to see how consistent the data is. Also you want to use the Pattern Frequency Table indicator on the email and postal columns in order to compute the number of most frequent records for each distinct pattern or value.

    Indicators are added accordingly to the columns in the Analyzed Columns view.

  3. Click the option icon next to the Blank Count indicator and set 0 in the Upper threshold field.

    Defining thresholds on indicators is very helpful as it will write in red the count of the null values in the analysis results.