Skip to main content

Extracting only the data that corresponds to a defined pattern from a delimited file

This scenario applies only to Talend Data Management Platform, Talend Big Data Platform, Talend Real-Time Big Data Platform, Talend MDM Platform, Talend Data Services Platform, Talend MDM Platform and Talend Data Fabric.

For more technologies supported by Talend, see Talend components.

This scenario describes a four-component Job where the tExtractPattern component is used to extract only customers' email addresses (that match the Email address pattern) from a delimited file that holds different customer data. Then it writes the extracted data into another delimited file. A tFilterColumns component is used to adapt the output schema.

In this scenario, the delimited file holds names, email addresses and telephone numbers, all in a single column: Name_Telephone_Address. The following shows an extract of the input file:

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!