Removing all the empty and invalid rows - 8.0

Talend Data Preparation User Guide

Version
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Data Preparation
Content
Data Quality and Preparation > Cleansing data
Last publication date
2024-03-26

Using the quality bar is a convenient way to filter and remove invalid rows for a given column, but this action is also available for the whole dataset.

You can apply a filter on all the invalid and empty rows from your dataset to remove them in a single action.

Let's take the example of a dataset containing customer data, where some phone numbers and email addresses entries are either invalid or empty.

Procedure

  1. Click the menu icon on the top left of the grid.
  2. Select Display rows with invalid or empty values.

    The filter has been applied and the grid now only displays the rows containing at least one empty or invalid entries.

    You can also choose to only filter the invalid or empty rows to remove them from your dataset.

  3. In the functions panel, search the Delete these Filtered Rows function and click it to apply it on you data.

    The rows have been deleted and you can now remove the filter.

  4. In the filter bar, click the cross in the filter or click the garbage bin icon to display the whole dataset again.

Results

Your dataset is now free of any invalid or empty values and the quality bar is fully green for all the columns.