Amazon EMR - Updating Cluster Connection Metadata
This article shows how to update Amazon EMR cluster connection metadata in the Talend Studio.
This example uses these licensed products provided by Amazon:
- Amazon EC2
- Amazon EMR
For more information about how to launch an Amazon EMR cluster from the Talend Studio, see Amazon EMR - Getting Started.
Updating cluster connection metadata
Before you begin
Each time you start a new cluster, it is necessary to update the cluster connection metadata in the Talend Studio Repository.
Once your new cluster is started using the Amazon EMR web interface, you should be
able to find the new private IP and DNS of the cluster master node. The private IP
and DNS will be used to update the hosts file.
- On a Windows instance, navigate to C:\Windows\System32\drivers\etc\ and open the hosts file.
- On a Linux instance, open the /etc/hosts file.
Then configure it as follows:
To update the Hadoop cluster metadata in the Talend Studio with the connection information of the new cluster, do the following: