Talend Cloud Data Inventory architecture - Cloud

Talend Cloud Data Inventory User Guide

Talend Cloud
Talend Data Inventory
Administration and Monitoring > Managing connections
Data Governance
Data Quality and Preparation > Enriching data
Data Quality and Preparation > Identifying data
Data Quality and Preparation > Managing datasets

This architecture diagram identifies the functional blocks of Talend Cloud Data Inventory.

This diagram is divided into two main parts: the local network and the cloud infrastructure.

Local network

The local network includes a web browser that is used to access and manage your data assets in Talend Cloud Data Inventory and a Remote Engine Gen2 to run objects from the other Talend Cloud applications, as well as creating connections and fetching data samples.

Cloud infrastructure

The cloud infrastructure includes the cloud applications, that rely on the Dataset service for some of them, and the Cloud Engine for Design.

  • The Dataset service is what provides the unified dataset list within Talend Cloud.
  • Talend Cloud Data Inventory is the central place where you access and maintain your dataset collection. You will be able to quickly search your data, assess its quality, rate, document or share it with other data consumers.
  • Talend Cloud Data Preparation and Talend Cloud Pipeline Designer are the two other applications that benefit from the common dataset inventory, and allow you to cleanse or transform your data.
  • In Talend Cloud Management Console, you can administrate roles, users, projects, and licences. You can create new users for the cloud applications and assign them to custom groups. You can then define roles and assign them to your users.
  • The Cloud Engine for Design is used to run artifacts, tasks, preparations and pipelines in the cloud, as well as creating connections and fetching data samples.