Filtering values using patterns - Cloud

Talend Cloud Data Preparation User Guide

Version
Cloud
Language
English
Product
Talend Cloud
Module
Talend Data Preparation
Content
Administration and Monitoring > Managing connections
Data Quality and Preparation > Cleansing data
Data Quality and Preparation > Managing datasets
Last publication date
2023-10-03
The Pattern tab of the profiling area shows a graphical representation of the type and number of characters your data is made of.

About this task

This example uses a dataset with typical customer information, such as their names, email, company they work in, or their subscription date.

Procedure

  1. Select a column containing data that you want to filter, Email for example.
  2. In the profiling area, click the Pattern tab.
    The different patterns used in this column are displayed in the form of a chart. Because this column uses text data, the chart shows the repartition of the data using word-based patterns.
  3. Switch to the character-based view by clicking the A icon.
    This can give you another point of view to analyze your data.
  4. Switch back to the word-based view by clicking the Text icon.
  5. Click the top bar to apply a filter on the most common pattern.

    The preparation now only displays the rows with the [word]@[word].[word] format.

    You can also use Ctrl + Click or Shift + Click to select multiple values at the same time and apply a more complex filter.

  6. While pressing the Ctrl button, click the bar corresponding to the [word][number]@[word].[word] pattern to add this filter to the previous one.
    The grid now only displays the data corresponding to those two filters.
  7. In the Functions panel, click a function to execute it on the data you filtered, Delete these Filtered Rows for example.
  8. In the filter bar, click the cross in each individual filter or click the garbage bin icon to clear the filters and display the whole dataset again.