Address verification codes in tLoqateAddressRow - 6.1

Talend Components Reference Guide

EnrichVersion
6.1
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for Data Quality
Talend Open Studio for ESB
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
task
Data Governance
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

The tLoqateAddressRow component outputs an ACCURACYCODE column. This column holds the verification codes for processed addresses.

The verification code is made up of the following values:

Verification code values

Description

The verification status

used to specify the full mailing address in the relevant country.

The post-processed verification match level.

used to specify input data for the address line in the relevant country, split into individual address lines.

The pre-processed verification match level

used to specify the full address including line breaks without the Organization, Locality, AdministrativeArea and PostalCode fields.

The parsing status

used to specify the individual lines contained within the DeliveryAddress field.

The lexicon identification match level

used to supply the country name or code.

The context identification match level

used to supply the ISO 3166 official country name.

The postcode status

used to supply the ISO 3166 2-character country code.

The matchscore

used to supply the ISO 3166 3-character country code.

For example, the V44-I44-P3-100 verification code implies:

  • Verification status = V (verified): a complete match was made between the input address and a single record from the available reference data.

  • Post-processed verification match level = 4 (premises): the level to which the input data matches the available reference data once all changes and additions performed during the verification process have been taken into account.

  • Pre-processed verification match level = 4 (premises): the level to which the input data matches the available reference data prior to any changes or additions performed during the verification process.

  • Parsing status = I (identified and parsed): all components of the input data have been able to be identified and placed into output fields.

  • Lexicon identification match level = 4 (premises): using pattern matching, a numeric value or word has been identified as a premise number or name.

  • Context identification match level = 4 (premises): using a least accurate form of matching, a numeric value or word has been identified as a premises number or name.

  • Postcode Status = P3 (added): the primary postal code for the country has been added.

  • Match score = 100 (complete similarity): the input data and closest reference data match completely.

The following sections explain in more details all segments of the verification code.

Verification status

The verification status can be one of the followings:

Status

Description

V (Verified)

the address was parsed and an exact match in the reference data was found for all the address components.

P (Partially Verified)

the reference data has more detail than the input data for the address. The address was parsed and most of the components of the address were matched against the reference data.

U (Unverified)

the input data could not be parsed. The output fields will contain the input data.

A (Ambiguous)

more than one item in the reference data match the input data.

C (Conflict)

individual address components are valid, but the address is not valid when combining the components together.

R (Reverted)

the address was parsed and verified but a minimum acceptable level of verification was not reached. The output fields will contain the input data.

Post-processed verification match level

The post-processed verification match level gives the level to which the input data matches the available reference data once all changes and additions performed during the verification process have been taken into account.

Match level

Description

5

delivery point (PostBox or SubBuilding).

4

premises (Premises or Building).

3

thoroughfare.

2

locality.

1

administrative area.

0

none.

Pre-processed verification match level

The pre-processed verification match level gives the level to which the input data matches the available reference data prior to any changes or additions performed during the verification process.

Match level

Description

5

delivery point (PostBox or SubBuilding).

4

premises (Premises or Building).

3

thoroughfare.

2

locality.

1

administrative area.

0

none.

Parsing status

The parsing status can be one of the followings:

  • I (identified and parsed): all input data was identified and placed into different address fields.

  • U (unable to parse): not all input data was identified and parsed.

Lexicon identification match level

The lexicon identification match level gives the level to which the input data has some recognized form, through the use of:

  • pattern matching, for example a numeric value could be a premises number, and

  • lexicon matching, for example rd could be a Thoroughfare type (road) and London could be a Locality.

Match level

Description

5

delivery point (PostBox or SubBuilding).

4

premises (Premises or Building).

3

thoroughfare.

2

locality.

1

administrative area.

0

none.

Context identification match level

The context identification match level gives the level to which the input data can be recognized based on the context in which it appears.

This is the least accurate form of matching and is based on identifying a word as, for instance, a Thoroughfare based on it being preceded by something that could be a Premise, and followed by something that could be a Locality, the latter items being identified through a match against the reference data or the lexicon.

Match level

Description

5

delivery point (PostBox or SubBuilding).

4

premises (Premise or Building).

3

thoroughfare.

2

locality.

1

administrative area.

0

none.

Postcode status

The postal code status can be of the following values:

Status

Description

P8

PostalCodePrimary and PostalCodeSecondary are verified.

P7

PostalCodePrimary is verified and PostalCodeSecondary is added or changed.

P6

PostalCodePrimary is verified.

P5

PostalCodePrimary is verified with small change.

P4

PostalCodePrimary is verified with large change.

P3

PostalCodePrimary is added.

P2

PostalCodePrimary is identified by lexicon.

P1

PostalCodePrimary is identified by context.

P0

PostalCodePrimary is empty.

Match score

The match score gives the similarity between the input data and closest reference data match as a percentage between 0 and 100. 100% means complete similarity.