Scanning data from Kudu - Cloud - 8.0

Kudu

Version
Cloud
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Database components (Integration) > Kudu components
Data Quality and Preparation > Third-party systems > Database components (Integration) > Kudu components
Design and Development > Third-party systems > Database components (Integration) > Kudu components
Last publication date
2024-02-20

Procedure

  1. Double-click tKuduInput to open its Component view.

    Example

  2. Click the [...] button next to Edit schema to open the schema editor.
  3. Click the [+] button to add the schema columns for output as shown in this image.

    Example

  4. In the Type field, select Integer as data type for the age column.
  5. In the Key column, select the check box for the age column because this is the primary key column.
  6. Click OK to validate these changes and accept the propagation prompted by the pop-up dialog box.
  7. In the Table name field, enter the name of the table from which you need to read data. In this scenario, it is ychen_kudu.
  8. In the Query mode area, select the Use scan radio button to read all the data from the Kudu table.
  9. Double-click tLogRow to open its Component view and select the Table radio button to present the result in a table.
  10. Press F6 to run this Job.

Results

Once done, in the console of the Run view, you can check the data read from the Kudu table.

The record 04;tom;60 is not written in the table because it is out of the partition boundaries.

In the real-world practice, upon the success of the execution, you can deploy and launch your Job on a Talend JobServer if you have one.