Skip to main content Skip to complementary content

Scanning data from Kudu

Procedure

  1. Double-click tKuduInput to open its Component view.

    Example

  2. Click the [...] button next to Edit schema to open the schema editor.
  3. Click the [+] button to add the schema columns for output as shown in this image.

    Example

  4. In the Type field, select Integer as data type for the age column.
  5. In the Key column, select the check box for the age column because this is the primary key column.
  6. Click OK to validate these changes and accept the propagation prompted by the pop-up dialog box.
  7. In the Table name field, enter the name of the table from which you need to read data. In this scenario, it is ychen_kudu.
  8. In the Query mode area, select the Use scan radio button to read all the data from the Kudu table.
  9. Double-click tLogRow to open its Component view and select the Table radio button to present the result in a table.
  10. Press F6 to run this Job.

Results

Once done, in the console of the Run view, you can check the data read from the Kudu table.

The record 04;tom;60 is not written in the table because it is out of the partition boundaries.

In the real-world practice, upon the success of the execution, you could deploy and launch your Job on a Talend JobServer if you have one.

For related information, search for running a Job remotely on Talend Help Center (https://help.talend.com).

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!