Setting up Hortonworks High Availability in Talend Studio

author
Talend Documentation Team
EnrichVersion
6.4
6.3
6.2
6.1
6.0
EnrichProdName
Talend Real-Time Big Data Platform
Talend Data Services Platform
Talend Data Management Platform
Talend MDM Platform
Talend Data Fabric
Talend Big Data Platform
task
Design and Development > Designing Jobs > Hadoop distributions
EnrichPlatform
Talend Studio

Setting up Hortonworks High Availability in Talend Studio

This article describes how to enable High Availability (HA) for a specific Hortonworks connection defined in the Studio.
Prerequisites
  • You are using a Talend solution with Big Data
  • You have High Availability properly configured in your Hortonworks cluster
Resolution

The easiest way to enable High Availability is to import the HA configuration from the following Hadoop configuration files into the metadata of the Hortonworks connection:

  • core-site.xml
  • hdfs-site.xml
  • hive-site.xml
  • mapred-site.xml
  • yarn-site.xml

You can ask the administrator of your cluster for these files and store them in a folder.

Proceed as follows:

  1. In the Repository , from the Hortonworks connection metadata you have created under the Hadoop cluster node, open the [Hadoop Cluster Connection] wizard.
  2. Select the Use custom Hadoop configurations check box and click the [...] button next to it to open the [Hadoop Configuration Import Wizard].

    Note: The above image is only an example.

  3. Follow the instructions in this wizard to import the *-site.xml files. If you need help using this wizard, see Import Hadoop configuration.