Architecture of Talend Data Preparation in cluster mode - 7.1

Talend Big Data Platform Installation Guide for Linux

English (United States)
Talend Big Data Platform
Talend Activity Monitoring Console
Talend Administration Center
Talend Artifact Repository
Talend CommandLine
Talend Data Preparation
Talend Data Stewardship
Talend DQ Portal
Talend Identity and Access Management
Talend Installer
Talend JobServer
Talend Log Server
Talend Repository Manager
Talend Runtime
Talend SAP RFC Server
Talend Studio
Installation and Upgrade

The following diagram illustrates the architecture behind Talend Data Preparation and Talend Dictionary Service when set up in cluster mode.

This architecture is composed of four main functional blocks:

  • A Load Balancer, that distributes the workload from the different users accessing the Talend Data Preparation Web application at the same time, as well as the Talend Dictionary Service server.
    Note: The same Load Balancer can be used for Talend Data Preparation, Talend Data Stewardship and Talend Dictionary Service. In addition, the Load Balancer can be either physical or logical.
  • The Talend Data Preparation instances, connected by a Network File System or any shared folder available to all the Talend Data Preparation instances.
  • The Talend Dictionary Service instances, that you can optionally install if you want to add, remove, or edit the semantic types used on data in Talend Data Preparation.
  • A block containing the various componants necessary for Talend Data Preparation and Talend Dictionary Service to work, namely several instances of MongoDB for storage, Kafka and Zookeeper for messaging, and an instance of Talend Administration Center to manage authorizations.