Aggregating the sorted data - 7.1

Processing (Integration)

Version
7.1
Language
English (United States)
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for ESB
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Processing components (Integration)
Data Quality and Preparation > Third-party systems > Processing components (Integration)
Design and Development > Third-party systems > Processing components (Integration)

Procedure

  1. Double-click tAggregateSortedRow to open its Basic settings view.
  2. Click the [...] button next to Edit schema and in the pop-up window define the schema by adding five columns: AggTeam of String type, AggCount, MinAge, MaxAge, and AvgAge of Integer type.
    Click OK to close the schema editor and accept the propagation prompted by the pop-up dialog box.
  3. In the Input rows count field, enter the exact number of rows of the input data. In this example, it is 8.
  4. Click the [+] button below the Group by table to add as many rows as required and specify the aggregation set in the table. In this example, the data will be aggregated based on the input column Team.
  5. Click the [+] button below the Operations table to add as many rows as required and specify the operation to be carried out and the corresponding input column from which the data will be taken for each output column. In this example, we want to calculate the number of the input entries, the minimum age, the maximum age, and the average age for each team.
  6. Double-click the second tLogRow to open its Basic settings view.
  7. In the Mode area, select Table (print values in cells of a table) for better readability of the sorting result.