Default schema columns - 6.5

Synonym index

author
Talend Documentation Team
EnrichVersion
6.5
EnrichProdName
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Data Quality components > Standardization components > Synonym index components
Data Quality and Preparation > Third-party systems > Data Quality components > Standardization components > Synonym index components
Design and Development > Third-party systems > Data Quality components > Standardization components > Synonym index components
EnrichPlatform
Talend Studio

This section presents the detailed information about the default schema columns provided natively with the tSynonymSearch component.

Tip: In addition to the matching-related information presented in the default schema columns, you need to define more columns in order to output the input data and their matched reference entries.

Columns

Description

GID

Group IDs. These IDs are created automatically at runtime to index the input data groups recognized by this component.

GRP_SIZE

Numbers of the matched reference entries for each group of the input data. This size is limited by the number you set in the Limit of each group field and presents always the entries scored the highest.

SCORE

Lucene score used to measure in total the match degree between the selected input columns and their matched reference entries. The Lucene score is a numerical value that starts from 0 and is not bounded. Good matches will usually score higher than 1, but there is no definite rule to choose what is a good match and what is a bad match.

SCORES

Lucene scores used to measure the match degree between each input column you have selected and its matching reference entries.

NB_MATCHED _FIELDS

Number of the input columns you have selected for the matching operation.