Defining the maximum memory size threshold - 6.1

Talend Data Fabric Studio User Guide

EnrichVersion
6.1
EnrichProdName
Talend Data Fabric
task
Data Governance
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

From the studio, you can control memory usage when using the Java engine to run two types of analyses: column analysis and the analysis of a set of columns.

Why would you like to set a memory limit when running such analyses? If you use column analysis or column set analysis to profile very big sets of data or data with many problems, you may run out of memory and end up with a Java heap error. By defining the maximum memory size threshold for these analyses, the Studio will stop the analysis execution when the memory limit size is reached and provide you with the analysis results that were measured on the data before the analysis execution was terminated by the memory limit size.

Prerequisite(s): You have selected the Profiling perspective of the studio.

To define the maximum memory size threshold, do the following:

  1. On the menu bar, select Window > Preferences to display the [Preferences] window.

  2. Either:

    • expand Talend > Profiling and select Analysis tuning, or,

    • start typing analysis tuning in the dynamic filter field.

    The Analysis tuning view is displayed.

  3. In the Memory area, select the Enable analysis thread memory control check box.

  4. Move the slider to the right to define the memory limit at which the analysis execution will be stopped.

The execution of any column analysis or column set analysis will be stopped if it exceeds the allocated memory size. The analysis results given in the Studio will cover the data analyzed before the interruption of the analysis execution.