Setting up context-smart Hadoop connections

Setting up a connection to a given Hadoop distribution in Repository allows you to avoid configuring that connection each time when you need to use it in your Jobs.

When defining this connection, you can contextualize the connection parameters using values from different Hadoop environments such as from a test and a production environments, in order to adjust the connection and the Jobs using the connection to the proper environments at runtime by only one click.

The security configuration such as the Kerberos parameters cannot be contextualized. Therefore, ensure that the values you use for security work in all the involved environments among which the Hadoop connection switches.

If available in your Studio, the advanced Spark properties and the advanced Hadoop properties you define cannot be contextualized either. For this reason, ensure that these properties are valid for all the involved environments among which the Hadoop connection switches.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!

Leave your feedback here