Executing the Job - 7.1

Kafka

author
Talend Documentation Team
EnrichVersion
7.1
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Messaging components (Integration) > Kafka components
Data Quality and Preparation > Third-party systems > Messaging components (Integration) > Kafka components
Design and Development > Third-party systems > Messaging components (Integration) > Kafka components
EnrichPlatform
Talend Studio

About this task

Then you can run this Job.

The tLogRow component is used to present the execution result of the Job.

Procedure

  1. Ensure that your Twitter streaming program is still running and keep writing the received Tweets into the given topic.
  2. Press F6 to run this Job.

Results

Leave the Job running a while and then in the console of the Run view, you can read the Job is listing the 5 most used hashtags in each batch of Tweets mentioning Paris. According to the configuration of the size of each micro batch and the Spark window, each of these Tweet batches contains the last 20 seconds' worth of Tweets received at the end of each 15-second interval.

Note that you can manage the level of the execution information to be outputted in this console by selecting the log4jLevel check box in the Advanced settings tab and then selecting the level of the information you want to display.

For more information on the log4j logging levels, see the Apache documentation at http://logging.apache.org/log4j/1.2/apidocs/org/apache/log4j/Level.html.