Data sampling properties - Cloud

Talend Cloud Pipeline Designer Processors Guide

Version
Cloud
Language
English
Product
Talend Cloud
Module
Talend Pipeline Designer
Content
Design and Development > Designing Pipelines
Last publication date
2024-02-26

Properties to randomly extract a subset of your data.

Configuration
Property Configuration
Ratio sampling Enable this toggle if you want to specify the percentage of records you want to extract from your input dataset to create your output subset of data.
Note: This percentage is an approximate value. For example, if the proportion is set to 20%, it might output 15% or 28% of records. The higher the total number of records is, the more precise the approximation will be.
Number of records Enter the fixed number of records you want to extract from your input dataset to create your output subset of data.
Note: If you are designing a Streaming pipeline and you want to specify a fixed number of records, you will need to add a Window processor before this processor.
Info

Property

Configuration

Description (optional)

Enter a description for the processor.

To rename the processor, click the Rename Rename icon next to the default name of the processor.