Scenario: Transforming data flow to a list - 6.1

Talend Components Reference Guide

EnrichVersion
6.1
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for Data Quality
Talend Open Studio for ESB
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
task
Data Governance
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

The following scenario describes a Job that reads a list of files from a defined input file, iterates on each of the files and displays their content row by row on the Run console.

Setting up the Job

  1. Drop the following components from the Palette onto the design workspace: two tFileInputDelimited components, a tFlowToIterate, and a tLogRow.

  2. Connect the first tFileInputDelimited to tFlowToIterate using a Row > Main link, tFlowToIterate to the second tFileInputDelimited using an Iterate link, and the second tFileInputDelimited to tLogRow using a Row > Main link.

Configuring the Components

  1. Double-click the first tFileInputDelimited to display its Basic settings view.

  2. Click the [...] button next to the File Name field to select the path to the input file.

    Note

    The File Name field is mandatory.

    The input file used in this scenario is Customers.txt. It is a text file that contains a list of names of three other simple text files: Name.txt, E-mail.txt and Address.txt. The first text file, Name.txt, is made of one column holding customers' names. The second text file, E-mail.txt, is made of one column holding customers' e-mail addresses. The third text file, Address.txt, is made of one column holding customers' postal addresses.

    Fill in all other fields as needed. For more information, see tFileInputDelimited properties. In this scenario, the header and the footer are not set and there is no limit for the number of processed rows.

  3. Click Edit schema to describe the data structure of this input file. In this scenario, the schema is made of one column, FileName.

  4. Double-click tFlowToIterate to display its Basic settings view.

    Click the plus button to add new parameter lines and define your variables, and click in the key cell to enter the variable name as desired. In this scenario, one variable is defined: "Name_of_File".

    Alternatively, you can select the Use the default (key, value) in global variables check box to use the default in global variables.

  5. Double-click the second tFileInputDelimited to display its Basic settings view.

    In the File name field, enter the directory of the files to be read, and then press Ctrl+Space to select the global variable "Name_of_File". In this scenario, the syntax is as follows:

    "C:/scenario/flow_to_iterate/"+((String)globalMap.get("Name_of_File"))

    Click Edit schema to define the schema column name. In this scenario, it is RowContent.

    Fill in all other fields as needed. For more information, see tFileInputDelimited properties.

  6. In the design workspace, select the last component, tLogRow, and click the Component tab to define its basic settings.

    Define your settings as needed. For more information, see tLogRow properties.

Saving and executing the Job

  1. Save your Job by pressing Ctrl+S.

  2. Execute the Job by pressing F6 or clicking Run on the Run tab.

Customers' names, customers' e-mails, and customers' postal addresses appear on the console preceded by the schema column name.