Mapping the schemas - Cloud - 8.0

ELT Hive

Version
Cloud
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for ESB
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > ELT components > ELT Hive components
Data Quality and Preparation > Third-party systems > ELT components > ELT Hive components
Design and Development > Third-party systems > ELT components > ELT Hive components

Procedure

  1. Click ELT Hive Map Editor to map the schemas
  2. Click the [+] button on the upper left corner and in the pop-up dialog box, select the customer table and click OK.
  3. Repeat the operations to select the state_city table.
  4. Drag and drop the idstate column from the customer table onto the id column of the state_city table. Thus an inner join is created automatically.
  5. On the output side (the right side in the figure), the agg_result table is empty at first. Click at the bottom of this side to add as many columns as required and rename them to replicate the schema of the agg_result table you created earlier in Hive.
    Note:

    The type column is the partition column of the agg_result table and should not be replicated in this schema. For further information about the partition column of the Hive table, see the Hive manual.

  6. From the customer table, drop id, name, address, and sum1 to the corresponding columns in the agg_result table.
  7. From the state_city table, drop postal, state, capital and mostpopulouscity to the corresponding columns in the agg_result table.

    In this scenario, context variables are not used in the Expression column in the Map editor. If you use context variables, put them in single quotation marks. For example:

  8. Click OK to validate these changes.