Converting the Job - 6.5

Talend Real-Time Big Data Platform Getting Started Guide

English (United States)
Talend Real-Time Big Data Platform
Data Quality and Preparation > Cleansing data
Data Quality and Preparation > Profiling data
Design and Development
Installation and Upgrade
Converting the existing MapReduce Job to a Spark Batch Job allows you to make full use of existing assets to easily create Spark Jobs.

Before you begin


  1. In the Repository tree view, expand the Job Designs node, the Big Data Batch node and then the getting_started folder and the mapreduce folder.
  2. Right-click the aggregate_movie_director_mr Job and from the contextual menu, select Duplicate.

    The Duplicate window is opened.

  3. In the Input new name field, name this duplicate to aggregate_movie_director_spark_batch.
  4. From the Framework list, select Spark and click OK to validate the changes.

    The aggregate_movie_director_spark_batch Job is displayed in the mapreduce folder in the Repository.

  5. Right-click the getting_started folder and select Create folder from the contextual menu.
  6. In the New Folder wizard, name the new folder to spark_batch and click Finish to create the folder.
  7. Drop the aggregate_movie_director_spark_batch Job into this spark_batch folder.


This new Spark Batch Job is now ready for further editing.