Deleting multiple columns - 8.0

Talend Data Preparation User Guide

Version
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Data Preparation
Content
Data Quality and Preparation > Cleansing data
Last publication date
2024-03-26

Several scenarios may occur where you would need to delete unnecessary columns from your datasets, in order to remove empty columns, or delete information that is not relevant anymore.

Talend Data Preparation allows you to delete multiple columns in one single action.

Let's take the example of a dataset containing some customers data, but where the information on the age, salary or marital status is missing. This leaves you with three empty columns, that you will delete in one go, with the dedicated function, to clean your dataset.

Procedure

  1. While pressing the Ctrl key, click the headers of every column you want to delete, Age, Salary_Out and MaritalStatus in this example.

    All three columns are now selected and highlighted in blue.

  2. In the Functions panel, select the Delete column function.

    The function directly applies and the three columns are deleted. Three recipe steps are created, one for each column.

    It is also possible to delete an individual column by clicking the menu icon in a column header and selecting Delete Column. However, using the dedicated function is the most efficient way to delete multiple columns at the same time.