Creating a Hadoop cluster for machine learning - 7.3

Machine Learning

Version
7.3
Language
English (United States)
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Machine Learning components
Data Quality and Preparation > Third-party systems > Machine Learning components
Design and Development > Third-party systems > Machine Learning components
This sections explains how to create a Hadoop cluster to develop a machine learning routine.

Procedure

  1. Expand Metadata.
  2. Right-click Hadoop Cluster and create a new cluster.
  3. Specify a Linux OS user on the cluster.

    Here, the user puccini was already created.

    Training and test data used in this article have been slightly modified from the original source and pre-loaded into HDFS. Those data sets can be downloaded below.

  4. Configure the HDFS connection as follows.