Building the Job - Cloud - 8.0

Hive

Version
Cloud
8.0
Language
English (United States)
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for ESB
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Database components > Hive components
Data Quality and Preparation > Third-party systems > Database components > Hive components
Design and Development > Third-party systems > Database components > Hive components

Procedure

  1. From the Repository view of the Talend Studio, right-click Job Designs, and then click Create Standard Job.
  2. Give a name to your Job.
  3. Click Finish.
  4. Add a tPreJob component to your workspace.
  5. Add a tHiveConnection component to your workspace.
  6. Double-click the tHiveConnection component and choose Repository as the Property Type and the Database Connection created above.
  7. Right-click the tPreJob component.
  8. Select Trigger > On Component Ok and connect the tPreJob to the tHiveConnection.
  9. Add a tHiveInput component to your workspace.
  10. Select it and check the box Use an existing connection, then select the tHiveConnection component in the Component List drop-down menu.
  11. In the Query field, input show tables to run a query displaying the available tables in the database.
  12. Add a tLogRow component to your workspace.
  13. Right-click the tHiveInput component and select Row > Main.
  14. Click the tLogRow component to connect both components. They will display the information from the query above.
  15. From the Component tab of the tLogRow, select Table (print values in celles of a table).
  16. Add a tPostJob component to your workspace.
  17. Add a tHiveClose component to your workspace.
  18. Connect the tPostJob component to the tHiveClose component using an On Component Ok connection to close the connection opened.
  19. From the Run tab, click Run to run the Job and ensure of a successful connection to Hive on HDInsight and of the readability of the table data.