Configuring the sample data - 7.2

Sampling

Version
7.2
Language
English (United States)
Product
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Data Quality components > Sampling components
Data Quality and Preparation > Third-party systems > Data Quality components > Sampling components
Design and Development > Third-party systems > Data Quality components > Sampling components

Procedure

  1. Double-click tReservoirSampling to display the Basic settings view and define the component properties.
  2. Click the Edit schema button to view the input and output columns and do any modifications in the output schema, if needed.
  3. In the Sample Size field, enter a number for the rows you want to extract from the input flow, 24 in this example.
  4. Click the Advanced settings tab and enter a random number in the Seed for random generator field.
    By setting a number in this field, you will extract the same sample in each execution of the Job. Change the value if you want to extract a different sample.