Importing a Hadoop cluster metadata definition - 8.0

First steps using Big Data in Talend Studio

Version
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Design and Development > Designing Jobs > Hadoop distributions
Last publication date
2024-02-06

You can import your Hadoop cluster configuration to create a Hadoop cluster metadata definition to be able to quickly configure components with its information. Talend Studio also allows you to create a cluster metadata definition from scratch.

Before you begin

  • This tutorial makes use of a Hadoop cluster. You must have a Hadoop cluster available to you.
  • Select the Integration perspective (Window > Perspective > Integration).

Procedure

  1. In the Repository, expand Metadata, right-click Hadoop Cluster and click Create Hadoop Cluster.
  2. In the Name field, enter a name.

    Example

    MyHadoopCluster_files
  3. Optional: In the Purpose field, enter a purpose.

    Example

    Cluster connection metadata
  4. Optional: In the Description field, enter a description.

    Example

    Metadata to connect to a Cloudera CDH cluster
    Tip: Enter a Purpose and Description to stay organized.
  5. Click Next.
  6. Select a Distribution.

    Example

    Select Cloudera.
  7. Select a Version.

    Example

    Select Cloudera CDH6.1.1 [Built in].
  8. Select Import configuration from local files.
  9. Click Next.
  10. Under Location, select the file of your choice in the File Explorer.
  11. Select your modules.

    Example

    Select HDFS or YARN.
  12. Click Finish.

    Example

    You are brought to the Hadoop Cluster Connection window, and your Connection details have been entered already.
  13. Optional: Click Check Services.
  14. Click Finish.

Results

The Hadoop cluster metadata definition appears in the Repository.