Talend Data Stewardship architecture - 7.1

Talend Data Stewardship User Guide

English (United States)
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Talend Data Stewardship
Administration and Monitoring > Managing users
Data Governance > Assigning tasks
Data Governance > Managing campaigns
Data Governance > Managing data models
Data Quality and Preparation > Handling tasks
Data Quality and Preparation > Managing semantic types
This architecture diagram identifies the functional blocks of Talend Data Stewardship, and the interactions among them.

Several functional blocks are defined:

  • The Clients block, with a web browser and a Talend Studio.

    From the web browser, you access the Talend Data Stewardship Web application. This is where campaign owners and data stewards manage campaigns and tasks.

    From Talend Studio, campaign owners can use Jobs to load, retrieve or delete tasks from the campaigns created in the web application. tDataStewarshipTaskInput, tDataStewardshipTaskOutput, and tDataStewardshipTaskDelete are the three dedicated component to work with Talend Data Stewardship.

    From the command line or a user interface, you can optionally access the Talend Dictionary Service server and add, remove or edit the semantic types used on data in the web application.

  • The Servers block includes the Talend Data Stewardship application server, connected to Talend Administration Center and optionally to the Talend Dictionary Service server.

    The Talend Identity and Access Management Service is used to enable Single Sign-On.

    Talend Administration Center allows administrators to manage licenses, users and roles. Assigning one or more of the predefined roles to users grants them specific rights to what can they access or perform in Talend Data Stewardship. For further information, see Talend Administration Center User Guide.

    Talend Dictionary Service allows campaign owners to manage the semantic categories used on data in the web application. It communicates with the Kafka server.

    The Kafka server is used to exchange messages on every single event done in the web application such as modifying a data model, using a data model in a campaign, auditing actions performed on tasks and managing semantic categories using Talend Dictionary Service.

  • The Databases block contains the databases used with Talend Administration Center and a MongoDB database.

    The Administration database is used to manage user accounts and rights, the Audit database is used to evaluate different aspects of the Jobs implemented in the Studio and the Monitoring database is used to monitor the execution of technical processes and service calls.

    One MongoDB is used to store campaigns and tasks and another MongoDB is used to store semantic types. Nothing is saved directly on your computer.

    However, you can share one MongoDB between Talend Data Stewardship and Talend Dictionary Service. In such a case you need to have two databases on the MongoDB instance.