Selecting the salary records above the average using a Map/Reduce Job - 7.3

Global variable

Version
7.3
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for ESB
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Custom code components (Integration) > Global variable components
Data Quality and Preparation > Third-party systems > Custom code components (Integration) > Global variable components
Design and Development > Third-party systems > Custom code components (Integration) > Global variable components

This scenario applies only to subscription-based Talend products with Big Data.

For more technologies supported by Talend, see Talend components.

In this scenario, a six-component Job is created to calculate the average salary of a set of sample data and select the salaries above the average.

The sample data to be used is already stored in the HDFS system to be used and read as follows:
1	Lyndon	1200	
2	Ronald	3500	
3	Ulysses	5000	
4	Harry	2000	
5	Garfield	1800	
6	James	3300	
7	Chester	4200	
8	Dwight	2200	
9	Jimmy	2800	
10	Herbert	3500

You can read that the separator between the fields is /t and the three columns of the sample data are id, name and salary.

You can use the tHDFSOutput component to write the sample data in the HDFS system to be used. For further information, see tHDFSOutput.