Removing all the empty and invalid rows

Talend Data Preparation User Guide

author
Talend Documentation Team
EnrichVersion
6.3
2.0
EnrichProdName
Talend Data Integration
Talend Data Fabric
Talend Real-Time Big Data Platform
Talend ESB
Talend Data Services Platform
Talend Data Management Platform
Talend MDM Platform
Talend Big Data
Talend Big Data Platform
task
Data Quality and Preparation > Cleansing data
EnrichPlatform
Talend Data Preparation

Using the quality bar is a convenient way to filter and remove invalid rows for a given column, but this action is also available for the whole dataset.

You can apply a filter on all the invalid and empty rows from your dataset to remove them in a single action.

Let's take the example of a dataset containing customer data, where some phone numbers and email addresses entries are either invalid or empty.

Procedure

  1. Click the white arrow on the top left of the grid.
  2. Select Display rows with invalid or empty values.

    The filter has been applied and the grid now only displays the rows containing at least one empty or invalid entries.

    You can also choose to only filter the invalid or empty rows to remove them from your dataset.

  3. In the functions panel, search the Delete these Filtered Rows function and click it to apply it on you data.

    The rows have been deleted and you can now remove the filter.

  4. In the filter bar, click the cross in the filter or click the garbage bin icon to display the whole dataset again.

Results

Your dataset is now free of any invalid or empty values and the quality bar is fully green for all the columns.