List of functions

Talend Data Preparation User Guide

author
Talend Documentation Team
EnrichVersion
6.3
2.0
EnrichProdName
Talend Big Data Platform
Talend Data Services Platform
Talend Data Management Platform
Talend Big Data
Talend ESB
Talend Data Integration
Talend Real-Time Big Data Platform
Talend Data Fabric
Talend MDM Platform
task
Data Quality and Preparation > Cleansing data
EnrichPlatform
Talend Data Preparation
This table lists all the functions available in Talend Data Preparation and their effects.

Name

Category

Description

Negate Value

boolean

Reverse the boolean value of cells from this column

Change Data Type

column metadata

Change type of this column (number, text, date, etc.)

Change Semantic Domain

column metadata

Change semantic domain of this column (city, zipcode, last name, etc.)

Create new Column

column metadata

Copy a column or create a brand new one

Delete Column

column metadata

Delete this column

Duplicate Column

column metadata

Create an exact copy of this column

Rename Column

column metadata

Rename this column

Concatenate with

columns

Merge the content of this column with another one, and displays it in a new column

Reorder columns

columns

Change column order

Swap columns

columns

Swap the values with an other column

Convert temperature

conversions

Convert temperature measurement units

Clear on matching value

data cleansing

Clear cells that match the value

Clear the Cells with Invalid Values

data cleansing

Clear cells that contain a value recognized as invalid

Delete Line

data cleansing

Delete this line

Delete the Lines that Match

data cleansing

Delete rows where a cell in this column has a specific value

Delete the Lines with Empty Cell

data cleansing

Delete rows that have empty cells

Delete the Lines with Invalid Cell

data cleansing

Delete rows which contain an invalid cell

Fill Cells with Value

data cleansing

Fill cells from this column with a given value

Fill Empty Cells with Text

data cleansing

Fill empty cells from this column with a given value

Fill Empty Cells with Value

data cleansing

Fill cells from this column with a given value

Make as Header

data cleansing

Cells of this line will become columns names, the line will be deleted

Remove Negative Values

data cleansing

Lines with a negative value in this column will be deleted

Mask data (obfuscation)

data masking

Mask data according to the domain information of the column (anonymisation)

Lookup

data blending

Blends columns from another dataset into this one

Calculate Time since

dates

Calculate elapsed time since a date in the desired unit (year, month, day, hour)

Calculate Timestamp to Date

dates

Given a timestamp (ellapsed time since epoch in second), create a new column with the date

Change Date Format

dates

Change the date format to use in a date column

Compare Dates

dates

Compare this column to another column or a constant

Extract Date Parts

dates

Create columns with year, month, day, hour, minute, second, etc.

Modify Dates

dates

Add or substract time unit amount

Delete these Filtered Lines

filtered

Delete only the lines that match the current filters

Keep these Filtered Lines

filtered

Keep only the lines that match the current filters

Add, Multiply, Substract or Divide

math

Perform an operation/calculation on this column with another one or with a fixed value: Add/sum (+), multiply (x), subtract (-), or divide(/)

Base 10 Logarithm

math

Compute the base 10 logarithm from a column

Calculate Absolute Value

math

Calculate the absolute value for all the numeric values in this column.

Cosine

math

Compute the trigonometric Cosine from a column

Exponential

math

Exponential of a column number

Max

math

Max with another column or a constant

Min

math

Min with another column or a constant

Natural Logarithm

math

Compute the natural logarithm from a column

Negate

math

Negate a column number

Power

math

Power with another column or a constant

Sine

math

Compute the trigonometric Sine from a column

Square root

math

Square root of a column number

Tangent

math

Compute the trigonometric Tangent from a column

Compare numbers

numbers

Compare this column to another column or a constant

Format numbers

numbers

Allow to format number (decimal, integer & scientific) in a specific format or pattern

Remove Fractional Part

numbers

Round towards zero. (3.74 -> 3) and (-3.74 -> -3)

Round Value using Ceil mode

numbers

Round up value to the nearest number, depending on the precision you set. (3.14 -> 4 if Precision is set to 0, and 3.14 -> 3.2 if Precision is set to 1)

Round value using Down mode

numbers

Round towards zero. (3.74 -> 3 and -3.74 -> -3 for a Precision set to 0)

Round Value using Floor mode

numbers

Round down value to the nearest number, depending on the precision you set. (3.74 -> 3 if Precision is set to 0, and 3.74 -> 3.7 if Precision is set to 1)

Round Value using HalfUp mode

numbers

Round value to the closest number, depending on the precision you set. (3.14 -> 3 and 3.74 -> 4 for a Precision set to 0)

Format phone number

phones

Format a phone number to standard formats

Extract Email Parts

split

Extract local and domain parts from an email

Extract number

split

Extract number from the input

Extract string parts

split

Extract string tokens based on regex groups

Extract URL Parts

split

Extract protocol, host, port, query, etc... from an URL in separated columns

Split the Text in Parts

split

Split column from separators

Calculate Length

strings

Extract the number of digits from a value (23562 -> 5)

Change Style to lower Case

strings

Converts all of the cell text in this column to lower case

Change Style to Title Case

strings

Converts the text content from this column to title case (i.e. "data prep" -> "Data Prep")

Change Style to UPPER Case

strings

Converts all of the cell text in this column to UPPER case (capitalize)

Extract Parts of the Text

strings

Extract some parts of the text (substring) and create a new column

Match similar text

strings

Create a new column with <i>true</i> or <i>false</i> regarding if the value is less or equals the Levenshtein distance of a given value

Matches Pattern

strings

Create a new column with <i>true</i> or <i>false</i> regarding if the value that matches or not a given pattern

Remove Part of the Text

strings

Remove specified text from cells in this column

Remove Whitespaces (Trailing and Leading)

strings

Remove trailing and leading spaces (i.e. trim)

Replace the Cells that Match

strings

Replace the cells that have a specific value

Add Extra Characters

strings advanced

Add extra characters (padding) on the left or on the right of the original value to match an expected size

Find and Group Similar Text

strings advanced

Replace all similar values with the right one (i.e. cluster on fuzzy matching)

Remove all non Alpha Numeric Characters

strings advanced

Cleans values that contain unexpected characters by removing all characters that are not 0-9 and Aa-Zz.

For example, T=al**en#d 20%1,7 will become Talend 2017.

Remove all non Numeric Characters

strings advanced

Cleans numeric values by removing all characters that are not 0-9. Decimal separators are kept.

For example, T=al**en#d 20%1,7 will become 201,7.

Simplify Text (remove case, accent, etc.)

strings advanced

Simplify the content of this column (ie: Fran├žois -> francois)