This article demonstrates how to get started with Hortonworks 2.4.
- You have installed and configured Hortonworks 2.4 cluster (HDP).
You can also use Hortonworks (sandbox), a downloadable virtual machine (VM).
- You have installed Talend Studio.
- The dataset used (pearsonData.csv) in this article is called Pearson’s Height Data,
named for its creator Karl Pearson who, in the early 1900’s, founded the Mathematical
You can download the Pearson dataset here. Feel free to use your own data, being mindful that aspects of this article will need to be adjusted.