Harmonizing the date format - 2.3

Talend Data Preparation Getting Started Guide

author
Talend Documentation Team
EnrichVersion
6.5
2.3
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Data Quality and Preparation > Cleansing data
EnrichPlatform
Talend Data Preparation

Talend Data Preparation supports many different date formats, which you can harmonize to improve your data.

You can see in the SUBDATE column that even if your data respects the semantic type set as date, they do not follow only one date format. As a consequence, European and American standards, - and / are coexisting.

You are going to harmonize the DATE column and set only one date format for all your data. To do so:

Procedure

  1. Click the header of the SUBDATE column to select its content.
  2. In the statistics box on the bottom right, click Pattern.

    This tab gives you a better view of the different date formats currently used. Some dates follow the European standard, while other follow the American format. In any case, you can see that the dd-MMM-yyyy format is the most commonly used.

  3. To standardize the date format, click Change Date Format... in the functions list.

    A menu opens, where you can specify the current date formats, and the desired one.

  4. In the Current format drop-down list, leave I don't know, best guess selected.
  5. In the New format drop-down list, select custom.
  6. In the Your format field, type dd-MMM-yyyy.

    The dd-MMM-yyyy format is the most suited since it is the one that already had the most occurrences.

Results

The DATE column now follows only one date format, which make it easier to read. You can also notice that the recipe highlights your last action and it is even possible to modify the date format again, directly from the recipe.