Adding a regular expression or an SQL pattern to a column analysis - Cloud

Talend Cloud API Services Platform Studio User Guide

author
Talend Documentation Team
EnrichVersion
Cloud
EnrichProdName
Talend Cloud
task
Design and Development
EnrichPlatform
Talend Management Console
Talend Studio

About this task

You can add to any column analysis one or more regular expressions or SQL patterns against which you can match the content of the column to be analyzed.

Warning: If the database you are using does not support regular expressions or if the query template is not defined in the studio, you need first to declare the user defined function and define the query template before being able to add any of the specified patterns to the column analysis. For more information, see Managing User-Defined Functions in databases.

Prerequisite(s): You have selected the Profiling perspective in the studio. A column analysis is open in the analysis editor.

To add a regular expression or an SQL pattern to a column analysis, do the following:

Procedure

  1. Follow the steps outlined in Defining the columns to be analyzed to create a column analysis.
  2. In the Analyze Columns view in the analysis editor, click the icon next to the column name to which you want to add a regular expression or an SQL pattern, the email column in this example.
    The Pattern Selector dialog box opens.
  3. Expand Patterns and browse to the regular expression or/and the SQL patterns you want to add to the column analysis.
  4. Select the check box(es) of the expression(s) or pattern(s) you want to add to the selected column.
  5. Click OK to proceed to the next step.
    The added regular expression(s) or SQL pattern(s) are displayed under the analyzed column in the Analyzed Column list.
    You can add a regular expression or an SQL pattern to a column simply by a drag and drop operation from the DQ Repository tree view onto the analyzed column.
  6. Save the analysis and press F6 to execute it.
    The editor switches to the Analysis result view. The results of the column analysis include those for pattern matching.

Results

If the regular expression you add to the column analysis is defined for a database, you will be able to generate ELT Jobs to recuperate valid and invalid rows.

If the regular expression you add to the column analysis is defined for the Java or the Default language, you will be able to generate an ETL Job to handle rows.

For further information on how to generate a Job to recuperate valid, invalid or both types of rows, see Recuperating valid and invalid rows in a column analysis. For further information on defining regular expressions, see Creating a new regular expression or SQL pattern