Creating a Job script to validate email addresses using custom code components - 6.5

Talend Job Script Reference Guide

Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Talend CommandLine
Talend Studio
Design and Development > Designing Jobs

This example shows how to write a Job script to define a Job that will validate email addresses. The Job will first find out duplicate email addresses, then validate the deduplicated email addresses, and display the deduplication and validation results on the console.

The Job will contain the following components:

  • a tFileInputDelimited component, to read the source file that contain valid, invalid, and duplicate email addresses, as shown below.
    jjohnson@company com
    jjohnson@company com
  • a tUniqRow component, to identify duplicate addresses.

  • two tJavaRow components, to validate the deduplicated addresses and output the deduplication and validation results.

  • a tLibraryLoad component, to load the Java library required to execute the Java code for email address validation.