Dataset overview - Cloud

Talend Cloud Data Inventory User Guide

Version
Cloud
Language
English
Product
Talend Cloud
Module
Talend Data Inventory
Content
Administration and Monitoring > Managing connections
Data Governance
Data Quality and Preparation > Enriching data
Data Quality and Preparation > Identifying data
Data Quality and Preparation > Managing datasets
Last publication date
2024-02-28

When selecting a dataset from the list, the dataset overview panel opens, displaying different information and metadata.

Note: This feature becomes available for Talend Cloud Pipeline Designer and Talend Cloud Data Preparation users when Talend Cloud Data Inventory is enabled for the account.
The information that you can find at a glance, is structured in the form of tiles:
  • Talend Trust Score™: Visualize the Talend Trust Score™ of your dataset around five metrics axis and learn how to improve its global trustworthiness.
  • Data quality: Get a quick look at the quality of your data with dedicated bar charts that show the repartition of empty, invalid, and valid values across the entire dataset.
  • Data quality rules: List of rules applied to this dataset. Each compliance bar lets you see the repartition of invalid, non-applicable and valid values.
  • Schema: See the list of columns that make up the structure of your dataset, as well as the semantic type and quality for each column.
  • Preparations: List of preparations that use this dataset as source, as well a list of preparations that are compatible with this dataset and can be directly applied.
  • Pipelines: List of pipelines that use this dataset as source or destination.
  • Rating: This tile allows you to apply or edit your individual rating, as well as having access to the global rating of the dataset.
  • Description: The optional description that you entered during the dataset creation can be found here. It can also be edited to include any other context information you want to share on this dataset.
  • Custom attributes: All the custom attributes definitions that have been created for the tenant are regrouped in this tile. From there, you can apply a value to any of the categories or modify an existing one to complete the dataset metadata.
  • Tags: Easily apply tags to better document your dataset and improve its searchability.
  • API: This tile is visible for compatible datasets. It allows you to enable an API, so that consumers can get the dataset information, and monitor its activity.
  • Details: This tile regroups the basic information about the dataset creator, the creation and last modification dates, as well as who modified it.
Dataset overview panel
Dataset overview panel showing Talend Trust Score™ information, data quality, data quality rules, as well as the schema of a dataset named 'Scholarship'.