Converting the Job

Talend Big Data Getting Started Guide

author
Talend Documentation Team
EnrichVersion
6.4
EnrichProdName
Talend Big Data
task
Design and Development
Installation and Upgrade
Converting the existing MapReduce Job to a Spark Batch Job allows you to make full use of existing assets to easily create Spark Jobs.

Before you begin

Procedure

  1. In the Repository tree view, expand the Job Designs node, the Big Data Batch node and then the getting_started folder and the mapreduce folder.
  2. Right-click the aggregate_movie_director_mr Job and from the contextual menu, select Duplicate.

    The Duplicate window is opened.

  3. In the Input new name field, name this duplicate to aggregate_movie_director_spark_batch.
  4. From the Framework list, select Spark and click OK to validate the changes.

    The aggregate_movie_director_spark_batch Job is displayed in the mapreduce folder in the Repository.

  5. Right-click the getting_started folder and select Create folder from the contextual menu.
  6. In the New Folder wizard, name the new folder to spark_batch and click Finish to create the folder.
  7. Drop the aggregate_movie_director_spark_batch Job into this spark_batch folder.

Results

This new Spark Batch Job is now ready for further editing.