tReservoirSampling - 7.0

Sampling

EnrichVersion
7.0
EnrichProdName
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
EnrichPlatform
Talend Studio
task
Data Governance > Third-party systems > Data Quality components > Sampling components
Data Quality and Preparation > Third-party systems > Data Quality components > Sampling components
Design and Development > Third-party systems > Data Quality components > Sampling components

Extracts a random sample data from a big data set.

tReservoirSampling extracts a sample from the input data set in such a way that profiling results on the sample data are uniform and homogeneous with the profiling results on the full data set.

For more technologies supported by Talend, see Talend components.