Enabling the HDFS High Availability feature in the Studio
The HDFS High Availability feature addresses the single point of failure issue of a typical Hadoop cluster.
This article describes how to enable your Talend Studio with Big Data to use the HDFS High Availability (HA) feature.
Environment:
The Studio can be any of the Talend solutions with Big Data.
The Hadoop cluster you are using along with the Studio must support the HDFS HA feature. For further information, see the documentation of that Hadoop distribution you are using.
In the cluster to be used, the properties required by the HDFS High Availability must have been set in the hdfs-site.xml file by the Administrator.
Finding the properties to be set
Procedure
Results
All the properties to be replicated have been found and you need to set them in the Hadoop properties table provided in the Studio.
Setting properties in the Studio
The Hadoop properties table is provided along with many different components, Hadoop configuration view or metadata wizards that create connection to a Hadoop cluster.
In this article, we take tHDFSConnection as example to demonstrate how to set the properties mentioned above.
Procedure
Results
Then these properties will be taken into account at runtime when this component is used to connect to a Hadoop cluster.