Building the Job - Cloud - 8.0
Hive
- Version
- Cloud
- 8.0
- Language
- English
- Product
- Talend Big Data
- Talend Big Data Platform
- Talend Data Fabric
- Talend Data Integration
- Talend Data Management Platform
- Talend Data Services Platform
- Talend ESB
- Talend MDM Platform
- Talend Open Studio for Big Data
- Talend Open Studio for Data Integration
- Talend Open Studio for ESB
- Talend Real-Time Big Data Platform
- Module
- Talend Studio
- Content
- Data Governance > Third-party systems > Database components (Integration) > Hive components
- Data Quality and Preparation > Third-party systems > Database components (Integration) > Hive components
- Design and Development > Third-party systems > Database components (Integration) > Hive components
Procedure
-
From the Repository view of the Talend Studio, right-click Job Designs, and then click Create Standard Job.
-
Give a name to your Job.
-
Click Finish.
-
Add a tPreJob component
to your workspace.
-
Add a tHiveConnection
component to your workspace.
-
Double-click the tHiveConnection component and choose Repository as the Property Type and the Database Connection created above.
-
Right-click the tPreJob
component.
-
Select and connect the tPreJob to the tHiveConnection.
-
Add a tHiveInput
component to your workspace.
-
Select it and check the box Use an
existing connection, then select the tHiveConnection component in the Component List drop-down menu.
-
In the Query field,
input show tables to run a query
displaying the available tables in the database.
-
Add a tLogRow component
to your workspace.
-
Right-click the tHiveInput component and select .
-
Click the tLogRow
component to connect both components. They will display the information from the
query above.
-
From the Component tab of the tLogRow, select Table (print values in celles of a table).
-
Add a tPostJob component
to your workspace.
-
Add a tHiveClose
component to your workspace.
-
Connect the tPostJob
component to the tHiveClose component
using an On Component Ok connection to
close the connection opened.
-
From the Run tab,
click Run to run the Job and ensure of a
successful connection to Hive on HDInsight and of the readability of the table
data.