Example Job implementing the different match models - 7.1

tMap

Version
7.1
Language
English (United States)
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for ESB
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Processing components (Integration) > tMap
Data Quality and Preparation > Third-party systems > Processing components (Integration) > tMap
Design and Development > Third-party systems > Processing components (Integration) > tMap

Source data

The main source reads like:

ID Name
1 Shong
2 Elisa
3 Sabrina

The Lookup source reads as follows:

ID Email
1 Shong1@talend.com
1 Shong2@talend.com
2 Elisa@talend.com
3 Sabrina@talend.com

Now we plan to perform an inner join between the main source and the lookup source, and to produce the data structure as below based on the two sources.

ID Name Email

The result varies depending on the match model to be used.

Creating the Job

We use a tFixedFlowInput component to generate the main source.

And use a second tFixedFlowInput component to generate the lookup source.

Use tMap to perform the inner join, and output the result to a tLogRow component (with Table mode) that prints the result on the console.

Using the match models to generate different results

Unique match: this is the default option for the JOIN operation. It outputs the last matching record of the lookup source.

The result of the JOIN by the Unique match model reads as follows:

Starting job tMap_Match_modes at 17:46 25/09/2013.

[statistics] connecting to socket on port 3367
[statistics] connected
.--+-------+-------------------.
|          tLogRow_2           |
|=-+-------+------------------=|
|ID|Name   |Email              |
|=-+-------+------------------=|
|1 |Shong  |Shong2@talend.com  |
|2 |Elisa  |Elisa@talend.com   |
|3 |Sabrina|Sabrina1@talend.com|
'--+-------+-------------------'
[statistics] disconnected
Job tMap_Match_modes ended at 17:46 25/09/2013. [exit code=0]

First match: it outputs the first matching record of the lookup source.

The result of the JOIN by the First match model reads as follows:

Starting job tMap_Match_modes at 17:51 25/09/2013.

[statistics] connecting to socket on port 3942
[statistics] connected
.--+-------+-------------------.
|          tLogRow_2           |
|=-+-------+------------------=|
|ID|Name   |Email              |
|=-+-------+------------------=|
|1 |Shong  |Shong1@talend.com  |
|2 |Elisa  |Elisa@talend.com   |
|3 |Sabrina|Sabrina1@talend.com|
'--+-------+-------------------'
[statistics] disconnected
Job tMap_Match_modes ended at 17:51 25/09/2013. [exit code=0]

All match: it outputs all matching records of the lookup source.

The result of the JOIN by the All match model reads as follows:

Starting job tMap_Match_modes at 17:58 25/09/2013.

[statistics] connecting to socket on port 3381
[statistics] connected
.--+-------+-------------------.
|          tLogRow_2           |
|=-+-------+------------------=|
|ID|Name   |Email              |
|=-+-------+------------------=|
|1 |Shong  |Shong1@talend.com  |
|1 |Shong  |Shong2@talend.com  |
|2 |Elisa  |Elisa@talend.com   |
|3 |Sabrina|Sabrina1@talend.com|
'--+-------+-------------------'
[statistics] disconnected
Job tMap_Match_modes ended at 17:58 25/09/2013. [exit code=0]