tPartitioner - 7.3

Orchestration (Integration)

Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Talend Studio
Data Governance > Third-party systems > Orchestration components (Integration)
Data Quality and Preparation > Third-party systems > Orchestration components (Integration)
Design and Development > Third-party systems > Orchestration components (Integration)
Last publication date

Partitions the input data before tCollector can transfer them to the parallel execution processes.

The tPartitioner splits the input records into a given number of threads which tCollector sends for parallel execution.

Note that Talend Studio also enables the automatic implementation of parallelization across a Job without use of the parallelization components and we recommend using that approach. For further information, see the section describing how to enable parallelization of data flows of the Talend Studio User Guide. However, if you need to understand how to use these specific parallelization components, bear in mind that the parallelization components work closely with each other to accomplish parallel execution on given processes: the tPartitioner component dispatches the input records into a specific number of threads; the tCollector component sends these threads to its following components for parallel execution; the tDepartitioner component regroups the outputs of the processed parallel threads; the tRecollector component captures the output of a given tDepartitioner component and sends the captured data to the next component.

For more technologies supported by Talend, see Talend components.