Scenario 1: Writing data in a delimited file - 6.1

Talend Components Reference Guide

Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for Data Quality
Talend Open Studio for ESB
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
Data Governance
Data Quality and Preparation
Design and Development
Talend Studio

This scenario describes a three-component Job that extracts certain data from a file holding information about clients, customers, and then writes the extracted data in a delimited file.

In the following example, we have already stored the input schema under the Metadata node in the Repository tree view. For more information about storing schema metadata in the Repository, see Talend Studio User Guide.

Dropping and linking components

  1. In the Repository tree view, expand Metadata and File delimited in succession and then browse to your input schema, customers, and drop it on the design workspace. A dialog box displays where you can select the component type you want to use.

  2. Click tFileInputDelimited and then OK to close the dialog box. A tFileInputDelimited component holding the name of your input schema appears on the design workspace.

  3. Drop a tMap component and a tFileOutputDelimited component from the Palette to the design workspace.

  4. Link the components together using Row > Main connections.

Configuring the components

Configuring the input component

  1. Double-click tFileInputDelimited to open its Basic settings view. All its property fields are automatically filled in because you defined your input file locally.

  2. If you do not define your input file locally in the Repository tree view, fill in the details manually after selecting Built-in in the Property type list.

  3. Click the [...] button next to the File Name field and browse to the input file, customer.csv in this example.


    If the path of the file contains some accented characters, you will get an error message when executing your Job. For more information regarding the procedures to follow when the support of accented characters is missing, see the Talend Installation Guide.

  4. In the Row Separators and Field Separators fields, enter respectively "\n" and ";" as line and field separators.

  5. If needed, set the number of lines used as header and the number of lines used as footer in the corresponding fields and then set a limit for the number of processed rows.

    In this example, Header is set to 6 while Footer and Limit are not set.

  6. In the Schema field, schema is automatically set to Repository and your schema is already defined since you have stored your input file locally for this example. Otherwise, select Built-in and click the [...] button next to Edit Schema to open the [Schema] dialog box where you can define the input schema, and then click OK to close the dialog box.

Configuring the mapping component

  1. In the design workspace, double-click tMap to open its editor.

  2. In the tMap editor, click on top of the panel to the right to open the [Add a new output table] dialog box.

  3. Enter a name for the table you want to create, row2 in this example.

  4. Click OK to validate your changes and close the dialog box.

  5. In the table to the left, row1, select the first three lines (Id, CustomerName and CustomerAddress) and drop them to the table to the right

  6. In the Schema editor view situated in the lower left corner of the tMap editor, change the type of RegisterTime to String in the table to the right.

  7. Click OK to save your changes and close the editor.

Configuring the output component

  1. In the design workspace, double-click tFileOutputDelimited to open its Basic settings view and define the component properties.

  2. In the Property Type field, set the type to Built-in and fill in the fields that follow manually.

  3. Click the [...] button next to the File Name field and browse to the output file you want to write data in, customerselection.txt in this example.

  4. In the Row Separator and Field Separator fields, set "\n" and ";" respectively as row and field separators.

  5. Select the Include Header check box if you want to output columns headers as well.

  6. Click Edit schema to open the schema dialog box and verify if the recuperated schema corresponds to the input schema. If not, click Sync Columns to recuperate the schema from the preceding component.

Saving and executing the Job

  1. Press Ctrl+S to save your Job.

  2. Press F6 or click Run on the Run tab to execute the Job.

    The three specified columns Id, CustomerName and CustomerAddress are output in the defined output file.

For an example of how to use dynamic schemas with tFileOutputDelimited, see Scenario 4: Writing dynamic columns from a MySQL database to an output file.