You can generate a ready-to-use Job on the results of a column analysis. This Job recuperates the valid/invalid rows or both types of rows and writes them in output files or databases.
Prerequisite(s): A column analysis that uses patterns has been created and executed.
To generate a Job that recuperates the valid and invalid rows in the analyzed column, do the following:
Follow the steps outlined in How to define the columns to be analyzed and How to add a regular expression or an SQL pattern to a column analysis to create a column analysis that uses a pattern.
Execute the column analysis.
In the analysis editor, click the Analysis Results tab at the bottom of the editor to open the corresponding view.
The display of the Analysis Results view depends on the parameters you set in the [Preferences] window. For more information, see Setting preferences of analysis editors and analysis results.
Click Pattern Matching under the name of the analyzed column.
The generated graphic for the pattern matching is displayed accompanied with a table that details the matching results.
Right-click the pattern line in the Pattern Matching table and select Generate Jobs.
The [Job Selector] dialog box is displayed.
When you analyze the column using a pattern that is defined for a specific database, you will be able to generate ELT Jobs.
When you analyze the column using a pattern that is defined for the Java or the Default language, you will be able to generate an ETL Job.
For further information on how to create and define regular expressions or SQL patterns, see How to create a new regular expression or SQL pattern.
In the dialog box, select:
generate an ELT job to get only valid rows
to generate a Job that uses the Extract Load Transform process to write the valid rows of the analyzed column in an output file
generate an ELT job to get only invalid rows
to generate a Job that uses the Extract Load Transform process to write the invalid rows of the analyzed column in an output file
generate an ETL job to handle rows
to generate a Job that uses the Extract Transform Load process to write the valid/invalid rows of the analyzed column in output files
In this example we select the generate an ETL job to handle rows option to generate a Job that will output in two separate output files the valid and invalid email rows.
In the dialog box, click Finish to proceed to the next step.
The Integration perspective opens on the generated Job.
If required, use different output components to recuperate the valid/invalid rows in different type of files or in databases.
Save your Job and press F6 to execute it.
The valid and invalid email rows of the analyzed column are written in the defined output files.
The results in the retrieved files may depend on the ETL or ELT mode. In the ETL mode, the data is retrieved against Java regular expressions while in the ELT mode, the data is retrieved against the appropriate database regular expressions. The regular expression engines work differently in Java and in the DBMS, hence the result may differ, even more if you defined different regular expressions in the pattern editor.