Setting regular expressions and finalize the analysis - Cloud - 8.0

Talend Studio User Guide

Version
Cloud
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Cloud
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Design and Development
Last publication date
2024-02-29
Available in...

Big Data Platform

Cloud API Services Platform

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Management Platform

Data Fabric

Data Management Platform

Data Services Platform

MDM Platform

Real-Time Big Data Platform

You can add one or more regular expressions to one or more of the analyzed columns.

Before you begin

An analysis of a delimited file is open in the analysis editor in the Profiling perspective.

Procedure

  1. Define the regular expression you want to add to the analyzed column.
    In this example, the regular expression checks for all words that start with uppercase:'[A-Z][a-z]*$'
    Configuration in the Pattern Definition section.
  2. Add the regular expression to the analyzed column in the open analysis editor, the first_name column in this example.
  3. Save the analysis and press F6 to execute it.
    If the format of the file you are using has problems, you will have an error message to indicate which row causes the problem.
    The detailed analysis results view shows the generated graphics for the analyzed columns accompanied with tables that detail the statistic results.
    Below are the tables and the statistics graphics in the Analysis Results tab for the analyzed first_name column in the procedure outlined in Defining the columns to be analyzed in a file.
    Table and graphic showing the results of the Text Statistics indicator.
    Table and graphic showing the results of the Soundex Frequency indicator.
    Table and graphic showing the results of the Simple Statistics indicator.
    Table and graphic showing the results of the Pattern Matching indicator.