Creating the MapReduce Job

Talend Big Data Getting Started Guide

author
Talend Documentation Team
EnrichVersion
6.4
EnrichProdName
Talend Big Data
task
Installation and Upgrade
Design and Development
A Talend MapReduce Job allows you to access and use the Talend MapReduce components to visually design MapReduce programs to read, transform or write data.

Before you begin

  • You have launched your Talend Studio and opened the Integration perspective.

Procedure

  1. In the Repository tree view, expand the Job Designs node, right-click the Big Data Batch node and select Create folder from the contextual menu.
  2. In the New Folder wizard, name your Job folder getting_started and click Finish to create your folder.
  3. Right-click the getting_started folder and select Create folder again.
  4. In the New Folder wizard, name the new folder to mapreduce and click Finish to create the folder.
  5. Right-click the mapreduce folder and select Create Big Data Batch Job.
  6. In the New Big Data Batch Job wizard, select MapReduce from the Framework drop-down list.
  7. Enter a name for this MapReduce Job and other useful information.

    For example, enter aggregate_movie_director_mr in the Name field.

Results

The MapReduce component Palette is now available in the Studio. You can start to design the Job by leveraging this Palette and the Metadata node in the Repository.