Ensures the data quality of any source data against a reference data source.
tSchemaComplianceCheck validates all input rows against a reference schema or check types, nullability, length of rows against reference values. The validation can be carried out in full or partly.
This component is not shipped with your Talend Studio by default. You need to install it using the Feature Manager. For more information, see Installing features using the Feature Manager.
For more technologies supported by Talend, see Talend components.
Depending on the Talend product you are using, this component can be used in one, some or all of the following Job frameworks:
-
Standard: see tSchemaComplianceCheck Standard properties.
The component in this framework is available in all Talend products.
-
Spark Batch: see tSchemaComplianceCheck for Apache Spark Batch.
The component in this framework is available in all subscription-based Talend products with Big Data and Talend Data Fabric.
-
Spark Streaming: see tSchemaComplianceCheck for Apache Spark Streaming.
This component is available in Talend Real Time Big Data Platform and Talend Data Fabric.