Managing Hadoop metadata - 6.1

Talend Data Fabric Studio User Guide

EnrichVersion
6.1
EnrichProdName
Talend Data Fabric
task
Data Governance
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

Click Metadata in the Repository tree view to expand the relevant folder. Each of the connection nodes will gather the various connections and schemas you have set up. Among these connection nodes is theHadoop cluster node.

The following sections explain in detail how to use the Hadoop cluster node to set up:

  • an HBase connection,

  • an HCatalog connection,

  • an HDFS file schema,

  • a Hive connection, and

  • an Oozie connection.

If you need to create a connection to Cloudera's analytic database, Impala, you need to use the DB connection node under the Metadata node of the Repository. Its configuration is similar to that of a Hive connection but less complicated than the latter.

For further information about this DB connection node, see Managing Metadata for data integration.