Recuperating matching and non-matching rows - Cloud - 7.3

Talend Studio User Guide

Version
Cloud
7.3
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Cloud
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Design and Development
Last publication date
2024-02-13
Available in...

Big Data Platform

Cloud API Services Platform

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Management Platform

Data Fabric

Data Management Platform

Data Services Platform

MDM Platform

Real-Time Big Data Platform

When you add patterns to the analysis of a set of columns (simple table analysis), the result chart will show the percentage of the values in all the columns that match all used patterns and not only one of them.

After the execution of the analysis of a set of columns, you can generate ready-to-use Jobs that will recuperate the matching/non-matching rows and write them in output files or databases.

Before you begin

An analysis of a set of columns that uses patterns has been created and executed in the Profiling perspective. For more information, see Creating an analysis of a set of columns using patterns.

Procedure

  1. Execute the column analysis.
  2. In the Analysis Results view, click All Match to open the corresponding view.
    The generated chart is a single bar chart for the totality of the used patterns. This chart shows the number of the rows that match and those that do not match "all" the patterns and is accompanied with a table that details the matching results.
  3. Right-click the pattern line in the All Match table and select Generate an ETL Job to handle rows. The Integration perspective opens on the generated Job.
    This Job uses the Extract Transform Load process to write in two separate output files the valid/invalid rows of the analyzed columns that match/does not match "all" the patterns.
  4. If required, use different output components to recuperate the valid or invalid rows in different type of files or databases.
  5. Save your Job and press F6 to execute it. The valid and invalid rows of the analyzed columns are written in the defined output files.