Configuring the process of extracting data from the HBase

HBase

author
Talend Documentation Team
EnrichVersion
6.5
EnrichProdName
Talend Data Fabric
Talend Big Data Platform
Talend Big Data
Talend Real-Time Big Data Platform
Talend Open Studio for Big Data
task
Design and Development > Third-party systems > Database components > HBase components
Data Governance > Third-party systems > Database components > HBase components
Data Quality and Preparation > Third-party systems > Database components > HBase components
EnrichPlatform
Talend Studio

About this task

To do this, perform the following operations:

Procedure

  1. Double-click tHBaseInput to open its Component view.
  2. Select the Use an existing connection check box and then select the connection you have configured earlier. In this example, it is tHBaseConnection_1.
  3. Click the three-dot button next to Edit schema to open the schema editor.
  4. Click the plus button three times to add three rows and rename them as id, name and age respectively in the Column column. This means that you extract these three columns from the HBase.
  5. Select the types for each of the three columns. In this example, Integer for id and age, String for name.
  6. Click OK to validate these changes and accept the propagation prompted by the pop-up dialog box.
  7. In the Table name field, type in the table from which you extract the columns of interest. In this scenario, the table is customer.
  8. In the Mapping table, the Column column has been already filled automatically since the schema was defined, so simply enter the name of every family in the Column family column, each corresponding to the column it contains.
  9. Double-click tHBaseClose to open its Component view.
  10. In the Component List field, select the connection you need to close. In this example, this connection is tHBaseConnection_1.