Creating a table in HDFS

HCatalog

author
Talend Documentation Team
EnrichVersion
6.4
EnrichProdName
Talend Data Fabric
Talend Real-Time Big Data Platform
Talend Big Data Platform
Talend Open Studio for Big Data
Talend Big Data
task
Data Quality and Preparation > Third-party systems > Database tools > HCatalog components
Design and Development > Third-party systems > Database tools > HCatalog components
Data Governance > Third-party systems > Database tools > HCatalog components
EnrichPlatform
Talend Studio

Procedure

  1. Double-click tHCatalogOperation to open its Basic settings view.
  2. Click Edit schema to define the schema for the table to be created.
  3. Click [+] to add at least one column to the schema and click OK when you finish setting the schema. In this scenario, the columns added to the schema are: name, country and age.
  4. Fill the Templeton hostname field with URL of the Templeton webservice you are using. In this scenario, fill this field with "192.168.0.131".
  5. Fill the Templeton port field with the port for Templeton hostname. By default, the value for this field is "50111"
  6. Select Table from the Operation on list and Drop if exist and create from the Operation list to create a table in HDFS.
  7. Fill the Database field with an existing database name in HDFS. In this scenario, the database name is "talend".
  8. Fill the Table field with the name of the table to be created. In this scenario, the table name is "Customer".
  9. Fill the Username field with the username for the DB authentication.
  10. Select the Set the user group to use check box to specify the user group. The default user group is "root", you need to specify the value for this field according to real practice.
  11. Select the Set the permissions to use check box to specify the user permission. The default value for this field is "rwxrwxr-x".
  12. Select the Set partitions check box to enable the partition schema.
  13. Click the Edit schema button next to the Set partitions check box to define the partition schema.
  14. Click [+] to add one column to the schema and click OK when you finish setting the schema. In this scenario, the column added to the partition schema is: match_age.