Harmonizing the date format - 2.0

Talend Data Preparation Getting Started Guide

author
Talend Documentation Team
EnrichVersion
6.3
2.0
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Data Quality and Preparation > Cleansing data
EnrichPlatform
Talend Data Preparation

Talend Data Preparation supports many different date formats, which you can harmonize to improve your data.

You can see in the SUBDATE column that even if your data respects the semantic type set as date, they do not follow only one date format. As a consequence, European and American standards, - and / are coexisting.

You are going to harmonize the DATE column and set only one date format for all your data. To do so:

Procedure

  1. Click the header of the SUBDATE column to select its content.
  2. In the statistics box on the bottom right, click Pattern.

    This tab gives you a better view of the different date formats currently used. Some dates follow the European standard, while other follow the American format. In any case, you can see that the dd-MMM-yyyy format is the most commonly used.

  3. To standardize the date format, click Change Date Format... in the functions list.

    A menu opens, where you can select from suggested date formats, or enter another one.

  4. From the list of formats, select other and type dd-MMM-yyyy in the field Your Format.

    The dd-MMM-yyyy format is the most suited since it is the one that already had the most occurrences.

Results

The DATE column now follows only one date format, which make it easier to read. You can also notice that the recipe highlights your last action and it is even possible to modify the date format again, directly from the recipe.