Identifying anomalies in data - 7.2

Talend Open Studio for Data Quality Getting Started Guide

Version
7.2
Language
English (United States)
Product
Talend Open Studio for Data Quality
Module
Talend Studio
Content
Data Quality and Preparation > Profiling data
Design and Development
Installation and Upgrade

The use case explains how to use the Profiling perspective of the studio to analyze customer email addresses and phone numbers. It uses out-of-box indicators and patterns on the columns and shows the matching and non-matching address data.

You can then use the Data Explorer perspective to browse the non-matching data.

The sequence of profiling customer data involves the following steps:

Procedure

  1. Create a column analysis on customer email addresses and phone numbers.
  2. Connect to the database which holds the customer data from the analysis editor.
  3. Add indicators to provide simple statistics on data such as row , blank and duplicate counts.
  4. Add standard patterns against which to match email addresses and phone numbers.
  5. Execute the analysis to show results in tables and charts.
  6. Access a view of the analyzed data to see invalid records.