Skip to main content Skip to complementary content

Importing a Hadoop cluster metadata definition

You can import your Hadoop cluster configuration to create a Hadoop cluster metadata definition to be able to quickly configure components with its information. Talend Studio also allows you to create a cluster metadata definition from scratch.

Before you begin

  • This tutorial makes use of a Hadoop cluster. You must have a Hadoop cluster available to you.
  • Select the Integration perspective (Window > Perspective > Integration).

Procedure

  1. In the Repository, expand Metadata, right-click Hadoop Cluster and click Create Hadoop Cluster.
  2. In the Name field, enter a name.

    Example

    MyHadoopCluster_files
  3. Optional: In the Purpose field, enter a purpose.

    Example

    Cluster connection metadata
  4. Optional: In the Description field, enter a description.

    Example

    Metadata to connect to a Cloudera CDH cluster
    Information noteTip: Enter a Purpose and Description to stay organized.
  5. Click Next.
  6. Select a Distribution.

    Example

    Select Cloudera.
  7. Select a Version.

    Example

    Select Cloudera CDH6.1.1 [Built in].
  8. Select Import configuration from local files.
  9. Click Next.
  10. Under Location, select the file of your choice in the File Explorer.
  11. Select your modules.

    Example

    Select HDFS or YARN.
  12. Click Finish.

    Example

    You are brought to the Hadoop Cluster Connection window, and your Connection details have been entered already.
  13. Optional: Click Check Services.
  14. Click Finish.

Results

The Hadoop cluster metadata definition appears in the Repository.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!