Architecture of the Talend products - 6.3

Talend Big Data Platform Installation Guide for Mac

Talend Big Data Platform
Talend Studio
Installation and Upgrade

The operating principles of the Talend products could be summarized as briefly as the following topics:

  • building technical or business-related processes and data profiling analyses,

  • administrating users, projects, access rights and processes and their dependencies,

  • deploying and executing technical processes,

  • monitoring the execution of technical processes.


Depending on your license, some of the functional blocks may not be available to you.

Each of the above topics can be isolated in different functional blocks and the different types of blocks and their interoperability can be described as in the following architecture diagram :

Building and administrating

  • The CLIENTS block includes one or more Talend Studio APIs and Web browsers that could be on the same or on different machines.

    From the Studio API, end-users can carry out technical processes and data profiling analyses and reports regardless of data volume and process complexity. The Talend Studio allows the user to work on any project for which he has authorization. For more information, see the Talend Studio User Guide.

    From a Web browser, end-users connect to the remotely based Administration Center through a secured HTTP protocol. The end-user category in this description may include developers, project managers, administrators and any other person involved in building data flows, Each of these end-users will use either the Studio or the Administration Center or both of them depending on the company policy.

    Additionally, end-users also use a Web browser to connect to Talend Data Quality Portal that plugs to the data quality data mart and extends the dynamic reporting and monitoring capabilities of the Talend Studio, and to Talend Data Stewardship Console to resolve the data issues included in the listed tasks to reach the golden data record (if they subscribed to this option).

  • The TALEND SERVERS and DATABASES blocks and the Git/SVN grey circle include a web-based Administration Center (application server) connected to two shared repositories: one based on a Git or SVN server and one based on a database server (Admin).

    The Administration Center enables the management and administration of all projects. Administration metadata (user accounts, access rights and project authorization for example) is stored in the database server and project metadata (Jobs, Business Models, Routines for example) is stored in the Git or SVN server (to easily share them between the different end-users).

    The Administration Center also enables to configure the tasks that handle job executions and triggers. It also looks after the job generation and deployment to the execution servers. For more information, see the Talend Administration Center User Guide.

    Talend Data Preparation Web application allows you to import your data, from local files or other sources, and cleanse or enrich it by creating new preparations on this data.

    Talend Data Stewardship Web application allows campaign owners and data stewards to manage campaigns and tasks.

    Talend Dictionary Service allows you to add, remove or edit the semantic types used on data in the web applications.

Deploying and executing

  • The Nexus grey circle represents the artifact repository that stores all the:

    • Software Updates available for download.

  • The TALEND EXECUTION SERVERS block represents the execution servers that run technical processes according to the execution scheduling set up in the Talend Administration Center Web application. Those execution servers can be of:

    • One or more Talend Runtimes (execution container) deployed inside your information system. The Talend Runtime deploys and executes the technical processes according to the set up defined in the Talend Administration Center Web application. Those processes are Jobs built from the Studio and centralized on the Git or SVN server. .

    • One or more JobServers deployed inside your information system that run technical processes (Jobs) according to scheduled time, date or event set in the Talend Administration Center Web application.

      The end-user can transfer technical processes to a remote execution server directly from the Studio (distant run).


      You must install the JobServer files ("Agent"), delivered by Talend, on each of the execution servers to become operational.


  • The Monitoring circle represents the monitoring: the Activity Monitoring Console.

    The Activity Monitoring Console allows end-users to monitor the execution of technical processes. It provides detailed monitoring capabilities that can be used to consolidate log information collected, understand the interaction between underlying data flows, prevent faults that could be unexpectedly generated and support system management decisions. For more information on the Activity Monitoring Console, see the Talend Activity Monitoring Console User Guide.