How user credentials work in Hive embedded mode in Talend Studio

author
Talend Documentation Team
EnrichVersion
6.5
EnrichProdName
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
Talend Data Fabric
Talend Big Data
task
Design and Development > Third-party systems > Database components
Data Governance > Third-party systems > Database components
Data Quality and Preparation > Third-party systems > Database components
EnrichPlatform
Talend Studio

How user credentials work in Hive embedded mode in Talend Studio

Apache Hadoop user group information employs the JDK security framework which uses a static "Subject" instance across the application scope. There is no way to change it in runtime. Due to this restriction, and before the Hive SQL query execution in embedded mode, the user must be correctly specified.

The Hive SQL query execution will obtain a user name in this order from Talend Studio:

  1. Find the user name from the system property of HADOOP_USER_NAME.
  2. If the user name is not found, then find it from the system property of HADOOP_PROXY_USER.
  3. If the user name is not found, then find it from OS environment variables.
  4. If the user name is not found, then use the OS login user name.