Scenario: Editing addresses and giving the verification status - 6.1

Talend Components Reference Guide

EnrichVersion
6.1
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for Data Quality
Talend Open Studio for ESB
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
EnrichPlatform
Talend Studio
task
Data Governance
Data Quality and Preparation
Design and Development

Below is a five-component Job created in Talend Studio.

This Job:

  • reads an input csv file that holds some client-related information,

  • uses the tMap component to concatenate the three fields Address, Postal, and City from the incoming data flow in one output column: Edit_Address,

  • uses the tQASAddressRow component to analyze the output column Edit_Address and gives the verification status of all edited addresses,

  • uses a tFilterRow component to output only the addresses which status is not equal to None.

  • and finally displays the correct formatted address along with their verification status on the console.

In this scenario, we have already stored the input schema of the input file in the Repository. For more information about storing schema metadata in the Repository tree view, see Talend Studio User Guide.

Setting up the Job

  1. In the Repository tree view, expand Metadata and the file node where you have stored the input schemas and drop the relevant file onto the design workspace.

    The [Components] dialog box displays.

  2. Select tFileInputDelimited from the list and click OK to close the dialog box.

    The tFileInputDelimited component displays on the workspace. The input file used in this scenario is called address_template, which is a csv file that holds some French client personal information.

  3. Drop the following components from the Palette onto the design workspace: tMap, tQASAddressRow, tFilterRow, and tLogRow.

  4. Connect tFileInputDelimited to tMap and tQASAddressRow to tFilterRow using Main links, tMap to tQASAddressRow using the New Output link, and tFilterRow to tLogRow using the Filter link.

Configuring the components

  1. Double-click the tMap component to open the Map Editor and concatenate the Address, Postal, and City fields from the incoming data flow in one output column: Edit_Address.

    When done, click OK to close the Map Editor and propagate the changes to the next component.

  2. Double-click the tQASAddressRow component to display its Basic settings and define its properties.

  3. In the QAS WSDL url field, enter the URL for the QuickAdress XML document (provided by Experian QAS).

  4. On the Country list, select the country corresponding to your input addresses, France in this example.

  5. If needed, click Edit schema to view the input and output data flow. The output schema should include the Edit_Address column that hold the Address, Postal, and City initial input columns.

    The output schema of any of the QuickAddress components depends on the selected country in the Country list since every country has different address norms.

    Click OK to close the dialog box.

  6. On the Column to analyze list, select Edit_Address.

  7. Double-click the tFilterRow component to display its Basic settings view and define its properties.

  8. In the Conditions area, click the plus button to add one condition to the output flow and in the corresponding table cells:

    -select the input column you want to operate on,

    -select the needed function on the list,

    -select the operator to bind the input column with the value,

    -type in between the quotes the address value to be filtered.

    In this example, we want to exclude the addresses which status is equal to None.

  9. Double-click the tLogRow component to display its Basic settings and define its properties.

    In this example, and for clarity purposes, we want the result to display on the console in a separate key/value tabular list for each row.

Executing the Job

  • Save your Job an press F6 to execute it and display the result on the console.

    In the above result samples, the tQASAddressRow reads the input rows, corrects and formats the addresses, gives the result in the Edit_Address row, and gives the verification status in the Status row.