Configuring a DataBase Connection to Hive - 7.3

Hive

Version
7.3
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Database components (Integration) > Hive components
Data Quality and Preparation > Third-party systems > Database components (Integration) > Hive components
Design and Development > Third-party systems > Database components (Integration) > Hive components
Last publication date
2024-02-21

About this task

This example uses version 3.6 of Azure HDInsight.

Procedure

  1. In the Repository view, extend the Metadata drop-down menu.
  2. Click Db Connections, and then right-click Create Connection .
  3. Give a name to your connection.
  4. Click Next.
  5. Set up the connection configuration similarly to the following table:

    DB Type

    Select Hive.

    Hadoop Cluster

    Select None.

    Distribution

    Select Horton Works.

    HDInsight is leveraging Horton Works distribution on the backend. This will allow you to use Horton Works libraries to connect to HDInsighs.

    DB Type

    Select Hive.

    Version

    Select Hortonworks Data Platform V2.6.0.3-8 [Built in].

    Hive Model

    Select Standalone.

    Login

    Password

    Server

    Fill in the fields as required.

    Port

    Input 443.

    You will be able to communicate through the proxy port since the HDInsight cluster sits behind a proxy by default.

    DataBase

    Leave default.

    Additional JDBC Setting

    Input transportMode=http;ssl=true;httpPath=/hive2, where:
    • transportMode=http sets the transport mode to HTTP instead of the default Hive JDBC transport mode.
    • SSL=true enables SSL.
    • httpPath=/hive2 sets the HTTP endpoint.
  6. Click Test Connection to ensure the Talend Studio connects successfully to the cluster.