Automatically formatting data based on examples - Cloud

Talend Cloud Data Preparation User Guide

Talend Documentation Team
Talend Cloud
Data Quality and Preparation > Cleansing data
Talend Data Preparation
The Magic Fill function offers a convenient solution to format data types that do not have a dedicated function or to easily perform a succession of transformations with the same function.

Via a machine learning algorithm, this function allows you to define a pattern, and automatically apply a transformation on a whole column, based on a few examples that you define beforehand.

At the moment, the Magic Fill function only supports the following transformation types:

  • susbstring
  • addition of constants (numbers, letters, special characters)
  • case sensitivity
  • semantic transformation for countries, states, emails, URLs and months

For the function to work, you need to enter at least two examples of the transformation you want to apply. You can then add up to three other examples. The more examples you input, the more accurately the pattern will be identified by the function.

Data types such as dates or phone numbers both have dedicated function that can be used to easily change their format. However full names, social security numbers or state codes, for example, do not. The following scenarios will illustrate how to use the Magic Fill function to format your data in those cases.