Filtering data against patterns - Cloud - 8.0

Talend Studio User Guide

Version
Cloud
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Cloud
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Design and Development
Last publication date
2024-02-29
Available in...

Big Data Platform

Cloud API Services Platform

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Management Platform

Data Fabric

Data Management Platform

Data Services Platform

MDM Platform

Real-Time Big Data Platform

After analyzing a set of columns against a group of patterns and having the results of the rows that match or do not match "all" the patterns, you can filter the valid/invalid data according to the used patterns.

Before you begin

An analysis of a set of columns is open in the analysis editor in the Profiling perspective of Talend Studio.

You have used the Java engine to execute the analysis.

Procedure

  1. In the analysis editor, click the Analysis Results tab at the bottom of the editor to open the detailed result view.
  2. Click Data to open the corresponding view.
    A table lists the actual analyzed data in the analyzed columns.
    Overview of the Data section in the Analysis Results tab.
  3. Click Filter Data on top of the table.
    A dialog box is displayed listing all the patterns used in the column set analysis.
    List of the patterns used in the column set analysis.
  4. Select the check box of the patterns according to which you want to filter data.
  5. Select a display option:
    • All data shows all analyzed data.
    • matches shows only the data that matches the selected pattern.
    • non-matches shows the data that does not match the selected patterns.
  6. Click Finish to close the dialog box.

Results

In this example, data is filtered against the Email Address pattern, and only the data that does not match is displayed.
List of the non-matching data in the Data section.

All email addresses that do not match the selected pattern appear in red. Any data row that has a missing value appear with a red background.

The Previous and Next buttons under the table helps you to navigate back and forth through pages.

Numbered buttons are displayed under the table to access pages directly:
  • When you open the Data view for the first time after running the analysis.
  • If you did not select a pattern in the Filter Data dialog box.
  • If you selected All data as the display option in the Filter Data dialog box.