How to set advanced execution settings

Talend Data Management Platform Studio User Guide

EnrichVersion
6.2
EnrichProdName
Talend Data Management Platform
task
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

Several advanced execution settings are available to make the execution of the Jobs handier:

How to display Statistics

The Statistics feature displays each component performance rate, under the flow links on the design workspace.

It shows the number of rows processed and the processing time in row per second, allowing you to spot straight away any bottleneck in the data processing flow.

For trigger links like OnComponentOK, OnComponentError, OnSubjobOK, OnSubjobError and If, the Statistics option displays the state of this trigger during the execution time of your Job: Ok or Error and True or False.

Note

Exception is made for external components which cannot offer this feature if their design does not include it.

In the Run view, click the Advanced settings tab and select the Statistics check box to activate the Stats feature and clear the box to disable it.

The calculation only starts when the Job execution is launched, and stops at the end of it.

Click the Clear button from the Basic or Debug Run views to remove the calculated stats displayed. Select the Clear before Run check box to reset the Stats feature before each execution.

Note

The statistics thread slows down Job execution as the Job must send these stats data to the design workspace in order to be displayed.

You can also save your Job before the execution starts. Select the relevant option check box.

How to display the execution time and other options

To display the Job total execution time after Job execution, select in the Advanced settings tab of the Run view the Exec time check box before running the Job.

This way you can test your Job before going to production.

You can also clear the design workspace before each Job execution by selecting the check box Clear before Run.

You can also save your Job before the execution starts. Select the relevant option check box.

How to specify the number of MB used in each streaming chunk by Talend Data Mapper

When running Jobs which contain maps created using Talend Data Mapper and which stream data, it is possible to specify the number of MB used in each streaming chunk. The default is 10MB, but you can increase this amount if you have more memory to devote to the transformation.

To specify the number of MB used in each streaming chunk:

  1. In the Run view, in the Advanced settings tab, select the Use specific JVM arguments checkbox.

  2. Click the New button and then, in the [Set the VM Argument] dialog box that opens, enter the argument to use.

    For example, use -DTDM_STREAM_MEMORY_LIMIT=20 to stream in chunks of 20MB.

  3. Click OK to add the argument.