Skip to main content Skip to complementary content

Defining the schema

Procedure

  1. Double-click tHiveCreateTable to open its Component view.
  2. Select the Use an existing connection check box and from Component list, select the connection configured in the tHiveConnection component you are using for this Job.
  3. Click the button next to Edit schema to open the schema editor.
  4. Click the button four times to add four rows and in the Column column, rename them to Id, FirstName, LastName and Reg_date, respectively.
    Note that you cannot use the Hive reserved keywords to name the columns, such as location or date.
  5. In the Type column, select the type of the data in each column. In this scenario, Id is of the Integer type, Reg_date is of the Date type and the others are of the String type.
  6. In the DB type column, select the Hive type of each column corresponding to their data types you have defined. For example, Id is of INT and Reg_date is of TIMESTAMP.
  7. In the Data pattern column, define the pattern corresponding to that of the raw data. In this example, use the default one.
  8. Click OK to validate these changes.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!