tDataQualityRules - Cloud - 8.0

Validation (Integration)

Version
Cloud
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Data Quality components > Validation components (Integration)
Data Quality and Preparation > Third-party systems > Data Quality components > Validation components (Integration)
Design and Development > Third-party systems > Data Quality components > Validation components (Integration)
Last publication date
2024-02-20

Validates data using data quality rules by connecting Talend Studio to:

  • Talend Cloud Data Stewardship. For more information, see What is a data quality rule? in the Talend Cloud Data Stewardship User Guide.
  • The hybrid version of Talend Data Stewardship 8.0 R2022-07 and greater. For more information, see the What is a data quality rule? in the Talend Data Stewardship User Guide. For more information on the hybrid version, see What is hybrid for Talend Cloud?.
  • Talend Cloud Data Inventory. For more information, see the What is a data quality rule? in the Talend Cloud Data Inventory User Guide. You can connect to this app in Talend Studio 8.0 R2023-06 and greater.

A data quality rule is a set of business requirements which defines the values your data must comply with.

To retrieve the data quality rules into a JAR file, tDataQualityRules connects Talend Studio to one of the app listed above. Talend Studio uses the retrieved library so you can apply data quality rules to your data. For more information on how the component works, see tDataQualityRules local and Cloud/hybrid process.

The component returns the data into categories and subcategories:
  • Valid:
    • Valid: The data fulfill the condition and the validation expression or the data fulfill the alternative validation expression only.
    • Not applicable (NA): The data do not fulfill the condition. The rule cannot be applied to the data.

    These data follow the Main flow.

  • Invalid:
    • Invalid: The data fulfill the condition but not the validation expression.
    • Not executable (NE): The rule cannot be executed on the data.

    These data follow the Reject flow.

This component is not shipped with your Talend Studio by default. You need to install it using the Feature Manager. For more information, see Installing features using the Feature Manager.

tDataQualityRules supports:
  • In local mode, Apache Spark 3.0 and greater.
  • Cloudera Data Engineering service with Apache Spark 3.1 or 3.2.

For more technologies supported by Talend, see Talend components.