Scenario 1: Populating a data warehouse - 6.1

Talend Components Reference Guide

Version
6.1
Language
English (United States)
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for Data Quality
Talend Open Studio for ESB
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance
Data Quality and Preparation
Design and Development

The following Java scenario creates a three-component Job that populates a data warehouse. A tOracleInput component reads your client data stored in the CLIENT table. A tMap component allows you to modify this data and the modifications are transmitted to the Leadfact table in the database through a tOracleOutput component.

  1. Drop the following components from the Palette onto the design workspace: tOracleInput, tMap, and tOracleOutput.

  2. Connect the three components using Row Main links. Click the link between tMap and tOracleOutput and name it leadfact.

  3. Double-click tOracleInput to display its Basic settings view and define its properties.

  4. Set Property Type to Repository and then select the connection to the TALEND database that holds the information about your clients. The connection details will display automatically in the corresponding fields.

    Note

    If you have not stored the DB connection details in the Metadata entry in the Repository, select Built-in in the property type list and set the connection details manually.

  5. Set Schema to Repository and click the three-dot button to select the schema of the CLIENT table stored in the Metadata folder.

    Related topics: see Talend Studio User Guide.

  6. In the Table Name field, enter the name of the table holding the information you want to modify, CLIENT in this example.

  7. Click Guess Query to retrieve all data from your table.

  8. Double-click the tMap component to open the Map Editor. Notice that the input area to the left is already filled with the metadata of the input component.

  9. Drag the fields in the input zone to the fields in the leadfact table in the output zone. For more information regarding data mapping, see Talend Studio User Guide.

  10. Click OK to validate the operation.

  11. In the design workspace, double-click tOracleOutput to display its Basic settings view and define its properties.

  12. Set Property Type to Repository and then select the cdc_data connection to the data warehouse. The connection details will display automatically in the corresponding fields

    Note

    If you have not stored the data warehouse connection details in the Metadata folder in the Repository tree view, select Built-in in the property type list and set the connection details manually.

    Related topics: see Talend Studio User Guide.

  13. In the Table Name field, enter the name of the table you want to populate with modified data, LEADFACT in this example.

  14. Save your job and press F6 to create and populate the LEADFACT table in the data warehouse.