Skip to main content Skip to complementary content
Close announcements banner

Defining the dataset sample size

To ensure optimal performances, Talend Data Preparation limits the number of rows that are displayed in the grid at 10,000 rows by default.

This means that even if you import a 50,000 rows dataset for example, only a sample of the first 10,000 rows will be displayed in the application. This limit applies to all dataset types. However, this value is not hard-coded and can be modified by editing the Talend Data Preparation configuration file.

Procedure

  1. To change the maximum number of rows that can be displayed for your datasets, open the <Data_Preparation_Path>/config/application.properties file.
  2. Change the value of the dataset.records.limit parameter to the desired one.
    The default value is 10000, so you can modify the property to dataset.records.limit=30000 for example.
  3. Save the file and restart your Talend Data Preparation instance.

Results

From now on, when opening a dataset in Talend Data Preparation, a sample of a maximm of 30,000 rows will be displayed on the grid.

Datasets that were cached before the configuration file update will keep their previous setting. For this reason, it is recommended to empty your cache after this operation.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!