Scenario: HCatalog table management on Hortonworks Data Platform

HCatalog

author
Talend Documentation Team
EnrichVersion
6.4
EnrichProdName
Talend Data Fabric
Talend Real-Time Big Data Platform
Talend Big Data Platform
Talend Open Studio for Big Data
Talend Big Data
task
Data Quality and Preparation > Third-party systems > Database tools > HCatalog components
Design and Development > Third-party systems > Database tools > HCatalog components
Data Governance > Third-party systems > Database tools > HCatalog components
EnrichPlatform
Talend Studio

This scenario applies only to a Talend solution with Big Data.

For more technologies supported by Talend, see Talend components.

This scenario describes a six-component Job that includes the common operations for the HCatalog table management on Hortonworks Data Platform. Sub-sections in this scenario covers DB operations including:

  • Creating a table to the database in HDFS;

  • Writing data to the HCatalog managed table;

  • Writing data to the partitioned table using tHCatalogLoad;

  • Reading data from the HCatalog managed table;

  • Outputting the data read from the table in HDFS.

Note:

Knowledge of Hive Data Definition Language and HCatalog Data Definition Language is required. For further information about Hive Data Definition Language, see https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL. For further information about HCatalog Data Definition Language, see https://cwiki.apache.org/confluence/display/HCATALOG/Design+Document+-+Java+APIs+for+HCatalog+DDL+Commands.