Scanning data from Kudu - 7.3

Kudu

Version
7.3
Language
English (United States)
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Database components > Kudu components
Data Quality and Preparation > Third-party systems > Database components > Kudu components
Design and Development > Third-party systems > Database components > Kudu components

Procedure

  1. Double-click tKuduInput to open its Component view.

    Example

  2. Click the [...] button next to Edit schema to open the schema editor.
  3. Click the [+] button to add the schema columns for output as shown in this image.

    Example

  4. In the Type field, select Integer as data type for the age column.
  5. In the Key column, select the check box for the age column because this is the primary key column.
  6. Click OK to validate these changes and accept the propagation prompted by the pop-up dialog box.
  7. In the Table name field, enter the name of the table from which you need to read data. In this scenario, it is ychen_kudu.
  8. In the Query mode area, select the Use scan radio button to read all the data from the Kudu table.
  9. Double-click tLogRow to open its Component view and select the Table radio button to present the result in a table.
  10. Press F6 to run this Job.

Results

Once done, in the console of the Run view, you can check the data read from the Kudu table.

The record 04;tom;60 is not written in the table because it is out of the partition boundaries.

In the real-world practice, upon the success of the execution, you could deploy and launch your Job on a Talend JobServer if you have one.

For related information, search for running a Job remotely on Talend Help Center (https://help.talend.com).