Step 3: Reference file definition, remapping, inner join mode selection - 7.3

Data Integration Job Examples

Version
7.3
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Design and Development > Designing Jobs
Last publication date
2024-02-13

Procedure

  1. Define the metadata corresponding to the LosAngelesandOrangeCounties.txt file in the way described in Step 6 of Step 1: Job creation, input definition, file reading using the wizard.

    At Step1 of the wizard, name this metadata entry as LA_Orange_cities.

  2. Drop the newly created metadata to the top of the design area to create automatically a reading component pointing to this metadata.
  3. Link this component to the tMap component.
  4. Double-click again on the tMap component to open its interface. Note that the reference input table (row2) corresponding to the LA and Orange county file shows to the left of the window, right under the main input (row1).
  5. Define the join between the main flow and the reference flow.

    Drop the City column from your first input table onto the City column of your reference table to establish a join between the two files. A violet link displays, to materialize this join.

  6. Drop the County column from the right section (row2) onto the Expression field of the County column on the right section (out1).
  7. Click the OK button to validate your changes and run the new Job.

    The following output should display on the console.

    The last columns of some of the records are filled with Los Angeles and Orange while others are empty. The reason for this is that by default, the tMap implements a left outer join mode. If you want to filter your data to only display lines for which a match is found by the tMap, then open again the tMap, click the tMap settings button and select the Inner Join in the Join Model list on the reference table (row2).