Big Data Platform
Cloud API Services Platform
Cloud Big Data Platform
Cloud Data Fabric
Cloud Data Management Platform
Data Management Platform
Data Services Platform
Real-Time Big Data Platform
You can add to any column analysis one or more regular expressions or SQL patterns against which you can match the content of the column to be analyzed.
If the database you are using does not support regular expressions or if the query template is not defined in the Studio, you need first to declare the user defined function and define the query template before being able to add any of the specified patterns to the column analysis.
For more information, see Managing User-Defined Functions in databases.
Before you begin
You have selected the Profiling perspective.
A column analysis is open in the analysis editor.
In the Analyze Columns
view in the analysis editor, click the icon next to the column name
to which you want to add a regular expression or an SQL pattern, the email column in this example.
The Pattern Selector dialog box opens.
- Expand Patterns and browse to the regular expression or/and the SQL patterns you want to add to the column analysis.
- Select the check box(es) of the expression(s) or pattern(s) you want to add to the selected column.
Click OK to proceed to
the next step.
The added regular expression(s) or SQL pattern(s) are displayed under the analyzed column in the Analyzed Column list.You can add a regular expression or an SQL pattern to a column simply by a drag and drop operation from the DQ Repository tree view onto the analyzed column.
Save the analysis and press F6 to execute it.
The editor switches to the Analysis result view. The results of the column analysis include those for pattern matching.
If the regular expression you add to the column analysis is defined for a database, you will be able to generate ELT Jobs to recuperate valid and invalid rows.
If the regular expression you add to the column analysis is defined for the Java or the Default language, you will be able to generate an ETL Job to handle rows.