Setting and managing parser rules

Talend Data Management Platform Studio User Guide

EnrichVersion
6.2
EnrichProdName
Talend Data Management Platform
task
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

Parser rules describe and define the expected composition of a data string to be analyzed using the ANTLR symbols. Using parser rules this way allows you to normalize and standardize the analyzed data according to your expectation. Parser rules can also be called standardization rules.

For example, a parser rule reading <INT><WORD+><StreetType> defines a standard address data format like 12 main street.

You can create a set of parser rules either form the DQ Repository tree view in the Profiling perspective of the studio or from the Basic settings view of the tStandardizeRow component. However, when you need to test, store and manage the sets of parser rules for purposes such as sharing, reusing, or adapting any given set of rules, you must use the DQ Repository tree view.

For further information about how to create and use parser rules to analyze, normalize and standardize data using tStandardizeRow, see Talend Components Reference Guide.

For further information about the ANTLR parser, go to: http://www.antlr.org/.

The following sections present how to create and manage parser rules from the DQ Repository tree view.