Removing empty and invalid rows - 2.3

Talend Data Preparation User Guide

author
Talend Documentation Team
EnrichVersion
6.5
2.3
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Data Quality and Preparation > Cleansing data
EnrichPlatform
Talend Data Preparation

Using the quality bar is a quick way to remove the rows containing invalid or empty records for a given column.

Let's take the example of a dataset containing some customer data. One of the column contains email addresses but some of the entries are either invalid or empty.

You are going to use the quality bar to directly delete all the rows containing empty or invalid values for this column

Procedure

  1. Click the white part of the quality bar, under the column header.
  2. In the drop-down menu, click Delete the rows with empty cells.

    The empty cells of the column have been deleted and only the invalid values, represented by the orange bar, remain.

  3. Click the orange part of the quality bar.
  4. In the drop-down menu, click Delete the rows with invalid cells.

Results

Your column is now cleaned of all invalid data or empty cells.