Using parallelization to optimize Job performance - Cloud - 8.0

Talend Studio User Guide

Version
Cloud
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Cloud
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Design and Development
Last publication date
2024-02-29

Parallelization in terms of Talend Jobs means to accomplish technical processes through parallel executions. When properly designed, a parallelization-enabled technical process can be completed within a shorter time frame.

Talend Studio allows you to implement different types of parallelization depending on ranging circumstances. These circumstances could be:

  1. Parallel executions of multiple subJobs. For further information, see Executing multiple subJobs in parallel.

  2. Parallel iterations for reading data. For further information, see Launching parallel iterations to read data.

  3. Orchestrating executions of subJobs. For further information, see Orchestrating parallel executions of subJobs.

  4. Speeding-up data writing into a database. For further information, see Writing data in parallel.

  5. Speeding-up processing of a data flow. For further information, see Enabling parallelization of data flows.

Parallelization is an advanced feature and requires basic knowledge about a Talend Job such as how to design and execute a Job or a subJob, how to use components and how to use the different types of connections that link components or Jobs. If you feel that you need to acquire this kind of knowledge, see What is a Job design?.