Working on large datasets - Cloud

Talend Cloud Data Preparation User Guide

Version
Cloud
Language
English
Product
Talend Cloud
Module
Talend Data Preparation
Content
Administration and Monitoring > Managing connections
Data Quality and Preparation > Cleansing data
Data Quality and Preparation > Managing datasets
Last publication date
2024-04-15
By default, a dataset that exceeds 10,000 rows in Talend Cloud Data Preparation is considered a large dataset.

Even if there is no limitation regarding the size of the dataset that you can create, the export settings and the display of large datasets are different than usual. You will be able to work on a sample displaying the first 10,000 rows, but your preparation can also be applied to the rest of your dataset. The following scenario will illustrate the example of a dataset containing 50,000 rows.