Deleting multiple columns

Talend Data Preparation User Guide

author
Talend Documentation Team
EnrichVersion
6.4
2.1
EnrichProdName
Talend MDM Platform
Talend Real-Time Big Data Platform
Talend Data Services Platform
Talend Big Data
Talend Data Management Platform
Talend Data Fabric
Talend ESB
Talend Data Integration
Talend Big Data Platform
task
Data Quality and Preparation > Cleansing data
EnrichPlatform
Talend Data Preparation

Several scenarios may occur where you would need to delete unnecessary columns from your datasets, in order to remove empty columns, or delete information that is not relevant anymore.

Talend Data Preparation allows you to delete multiple columns in one single action.

Let's take the example of a dataset containing some customers data, but where the information on the age, salary or marital status is missing. This leaves you with three empty columns, that you will delete in one go, with the dedicated function, to clean your dataset.

Procedure

  1. While pressing the Ctrl key, click the headers of every column you want to delete, Age, Salary_Out and MaritalStatus in this example.

    All three columns are now selected and highlighted in blue.

  2. In the Functions panel, select the Delete column function.

    The function directly applies and the three columns are deleted. Three recipe steps are created, one for each column.

    It is also possible to delete an individual column by clicking the menu icon in a column header and selecting Delete Column. However, using the dedicated function is the most efficient way to delete multiple columns at the same time.