Counting words using custom map and reduce code - 6.4

Java custom code for Map Reduce

author
Talend Documentation Team
EnrichVersion
6.4
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Custom code components (Integration) > Java custom code component for Map Reduce
Data Quality and Preparation > Third-party systems > Custom code components (Integration) > Java custom code component for Map Reduce
Design and Development > Third-party systems > Custom code components (Integration) > Java custom code component for Map Reduce
EnrichPlatform
Talend Studio

This scenario applies only to subscription-based Talend products with Big Data.

For more technologies supported by Talend, see Talend components.

Inspired by the MapReduce example explained in Apache's documentation on http://wiki.apache.org/hadoop/WordCount, this scenario demonstrates how to use tJavaMR to create a MapReduce program to count words.

The sample data to be used in this scenario reads as follows:
Hello world goodbye world
Hello hadoop bye Hadoop

Before starting to replicate this scenario, ensure that you have appropriate rights and permissions to access the Hadoop distribution to be used. Then proceed as follows: