tXSDValidator Properties - 6.3

Talend Open Studio for Big Data Components Reference Guide

EnrichVersion
6.3
EnrichProdName
Talend Open Studio for Big Data
task
Data Governance
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

Component family

XML

 

Basic settings

Mode

Select the validation mode from the drop-down list.

  • File Mode: to validate an input file.

  • Flow Mode: to validate an input flow.

 

Schema and Edit schema

A schema is a row description. It defines the number of fields to be processed and passed on to the next component.

Note that when File Mode is selected from the Mode list, the schema of this component is read-only and it contains standard information regarding the file validation.

 

XSD file

Specify the path to the XSD reference file. The HTTP URL is also supported, for example, http://localhost:8080/book.xsd.

This field is available only when File Mode is selected from the Mode drop-down list.

 

XML file

Specify the path to the XML file to be validated.

This field is available only when File Mode is selected from the Mode drop-down list.

 

If XML is valid, display

Type in the message to be displayed on the console if the XML file is valid.

This field is available only when File Mode is selected from the Mode drop-down list.

 

If XML is invalid, display

Type in the message to be displayed on the console if the XML file is invalid.

This field is available only when File Mode is selected from the Mode drop-down list.

 

Print to console

Select this check box to display the validation message on the console.

This check box is available only when File Mode is selected from the Mode drop-down list.

 

Allocate

Click the [+] button to add as many rows as needed, and in each row set the value of the following columns:

  • Input Column: click the cell and select a column to be validated.

  • XSD File: enter the path to the corresponding XSD reference file.

This table is available only when Flow Mode is selected from the Mode drop-down list.

Advanced settings

Enable Features

Click the [+] button to add as many rows as needed, and in each row enter the feature to be enabled on the underlying parser between double quotation marks, for example, "http://apache.org/xml/features/honour-all-schemaLocations".

For more information about the features, see https://xerces.apache.org/xerces2-j/features.html.

 

Encoding

Enter the encoding type between double quotation marks.

tStatCatcher Statistics

Select this check box to gather the Job processing metadata at the Job level as well as at each component level.

Global Variables

ERROR_MESSAGE: the error message generated by the component when an error occurs. This is an After variable and it returns a string. This variable functions only if the Die on error check box is cleared, if the component has this check box.

DIFFERENCE: the result of the validation. This is a Flow variable and it returns a string.

VALID: the validation result. This is a Flow variable and it returns a boolean.

XSD_ERROR_MESSAGE: the xsd error message generated by the component. This is a Flow variable and it returns a string.

A Flow variable functions during the execution of a component while an After variable functions after the execution of the component.

To fill up a field or expression with a variable, press Ctrl + Space to access the variable list and choose the variable to use from it.

For further information about variables, see Talend Studio User Guide.

Usage

When File Mode is selected, this component can be used as a standalone component but it is usually linked to an output component to gather the log data.

Limitation

n/a