How to set advanced execution settings - 6.5

Talend Data Fabric Studio User Guide

EnrichVersion
6.5
EnrichProdName
Talend Data Fabric
task
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

In the Advanced settings tab of the Run view, several advanced execution settings are available to make the execution of the Jobs or Route handier:

  • Statistics, this feature displays processing performance rate. For more information, see How to display Statistics.

  • Exec time, this feature displays the execution time in the console at the end of the execution. For more information, see How to display the execution time and other options.

  • Save Job before execution, this feature allows to automatically save the Job or Route before its execution.

  • Clear before run, this feature clears all the results of a previous execution before re-executing the Job or Route.

  • log4jLevel, this feature allows you to change the output level at runtime for log4j loggers activated in components in the Job or Route. For more information, see How to customize log4j output level at runtime.

  • JVM Setting, this feature allows you to define the parameters of your JVM according to your needs. For an example of how this can be used, see How to display special characters in the console.

How to display Statistics

The Statistics feature displays each component performance rate, under the flow links on the design workspace.

It shows the number of rows processed and the processing time in row per second, allowing you to spot straight away any bottleneck in the data processing flow.

For trigger links like OnComponentOK, OnComponentError, OnSubjobOK, OnSubjobError and If, the Statistics option displays the state of this trigger during the execution time of your Job: Ok or Error and True or False.

Note

Exception is made for external components which cannot offer this feature if their design does not include it.

In the Run view, click the Advanced settings tab and select the Statistics check box to activate the Stats feature and clear the box to disable it.

The calculation only starts when the Job or Route execution is launched, and stops at the end of it.

Click the Clear button from the Basic or Debug Run views to remove the calculated stats displayed. Select the Clear before Run check box to reset the Stats feature before each execution.

Note

The statistics thread slows down Job or Route execution as the Job or Route must send these stats data to the design workspace in order to be displayed.

You can also save your Job or Route before the execution starts. Select the relevant option check box.

How to display the execution time and other options

To display the total execution time, select in the Advanced settings tab of the Run view the Exec time check box before running the Job or Route.

This way you can test your Job or Route before going to production.

You can also clear the design workspace before each Job or Route execution by selecting the check box Clear before Run.

You can also save your Job or Route before the execution starts. Select the relevant option check box.

How to display special characters in the console

Talend Studio can display special characters in the console. To enable the display of Chinese, Japanese or Korean characters, for example, proceed as follows before executing the Job or Route:

  1. Select the Advanced settings tab.

  2. In the JVM settings area of the tab view, select the Use specific JVM arguments check box to activate the Argument table.

  3. Next to the Argument table, click the New... button to pop up the [Set the VM argument] dialog box.

  4. In the dialog box, type in -Dfile.encoding=UTF-8.

  5. Click OK to close the dialog box.

    This argument can be applied for all of your Job or Route executions in Talend Studio. For further information about how to apply this JVM argument for all of the Job executions, see Debug and Job execution preferences (Talend > Run/Debug).

How to specify the number of MB used in each streaming chunk by Talend Data Mapper

When running Jobs which contain maps created using Talend Data Mapper and which stream data, it is possible to specify the number of MB used in each streaming chunk. The default is 10MB, but you can increase this amount if you have more memory to devote to the transformation.

To specify the number of MB used in each streaming chunk:

  1. In the Run view, in the Advanced settings tab, select the Use specific JVM arguments checkbox.

  2. Click the New button and then, in the [Set the VM Argument] dialog box that opens, enter the argument to use.

    For example, use -DTDM_STREAM_MEMORY_LIMIT=20 to stream in chunks of 20MB.

  3. Click OK to add the argument.