Configuring the inner join - Cloud - 8.0


Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for ESB
Talend Real-Time Big Data Platform
Talend Studio
Data Governance > Third-party systems > Processing components (Integration) > tMap
Data Quality and Preparation > Third-party systems > Processing components (Integration) > tMap
Design and Development > Third-party systems > Processing components (Integration) > tMap


  1. Double-click the Resellers component to display its Basic settings view.
  2. Select Repository from the Property type list and select the component's schema, resellers in this scenario, from the Repository Content dialog box. The rest fields are automatically filled.

    In this scenario, the input schemas are stored in the Metadata node of the Repository tree view for easy retrieval. For further information regarding metadata creation in the Repository, see Talend Studio User Guide.

  3. Double-click the tMap component to open the Map Editor.
    Note that the schema of the new input component is already added in the Input area.
  4. Create a join between the main input flow and the new input flow by dropping the ID_Reseller column of the Cars_data table to the ID_Reseller column of the Resellers_data table.
  5. Click the tMap settings button at the top of the Resellers_data table and set Join Model to Inner Join.
  6. Drag all the columns except ID_Reseller of the Resellers_data table to the main output table, Insured.

    When two inner joins are defined, you either need to define two different inner join reject tables to differentiate the two rejections or, if there is only one inner join reject output, both inner join rejections will be stored in the same output.

  7. Click the [+] button at the top of the output area to add a new output table, and name this new output table Reject_ResellerID.
  8. Drag all the columns of the Cars_data table to the Reject_ResellerID table.
  9. Click the tMap settings button and select Catch lookup inner join reject to true to define this new output table as an inner join reject output.
    If the defined inner join cannot be established, the information about the relevant cars will be gathered through this output flow.
  10. Now apply filters on the two Inner Join reject outputs, in order for to distinguish the two types of rejection.
    In the first Inner Join output table, Reject_OwnerID, click the plus arrow button to add a filter line and fill it with the following formula to gather only owner ID related rejection: Owners_data.ID_Owner==null
  11. In the second Inner Join output table, Reject_ResellerID, repeat the same operation using the following formula: Resellers_data.ID_Reseller==null
    Click OK to validate the map settings and close the Mapper Editor.
  12. Double-click the No_Reseller_ID component to display its Basic settings view.
    Specify the output file path and select the Include Header check box, and leave the other parameters as they are.
  13. To demonstrate the work of the Mapper, in this example, remove reseller IDs 5 and 8 from the input file Resellers.csv.