Defining the schema - 7.3

Hive

Version
7.3
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Database components (Integration) > Hive components
Data Quality and Preparation > Third-party systems > Database components (Integration) > Hive components
Design and Development > Third-party systems > Database components (Integration) > Hive components
Last publication date
2024-02-21

Procedure

  1. Double-click tHiveCreateTable to open its Component view.
  2. Select the Use an existing connection check box and from Component list, select the connection configured in the tHiveConnection component you are using for this Job.
  3. Click the button next to Edit schema to open the schema editor.
  4. Click the button four times to add four rows and in the Column column, rename them to Id, FirstName, LastName and Reg_date, respectively.
    Note that you cannot use the Hive reserved keywords to name the columns, such as location or date.
  5. In the Type column, select the type of the data in each column. In this scenario, Id is of the Integer type, Reg_date is of the Date type and the others are of the String type.
  6. In the DB type column, select the Hive type of each column corresponding to their data types you have defined. For example, Id is of INT and Reg_date is of TIMESTAMP.
  7. In the Data pattern column, define the pattern corresponding to that of the raw data. In this example, use the default one.
  8. Click OK to validate these changes.