Creating a Job script to validate email addresses using custom code components - 6.5

Talend Job Script Reference Guide

author
Talend Documentation Team
EnrichVersion
6.5
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Design and Development > Designing Jobs
EnrichPlatform
Talend CommandLine
Talend Studio

This example shows how to write a Job script to define a Job that will validate email addresses. The Job will first find out duplicate email addresses, then validate the deduplicated email addresses, and display the deduplication and validation results on the console.

The Job will contain the following components:

  • a tFileInputDelimited component, to read the source file that contain valid, invalid, and duplicate email addresses, as shown below.

    jsmith@company.com
    rmartin.company.com
    sgoodman@companycom
    gbush@company.com
    jjohnson@company com
    jsmith@company.com
    jjohnson@company com
    sgoodman@companycom
  • a tUniqRow component, to identify duplicate addresses.

  • two tJavaRow components, to validate the deduplicated addresses and output the deduplication and validation results.

  • a tLibraryLoad component, to load the Java library required to execute the Java code for email address validation.