Configuring the output component and executing the Job - 7.1

Address standardization

author
Talend Documentation Team
EnrichVersion
7.1
EnrichProdName
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Data Quality components > Standardization components > Address standardization components
Data Quality and Preparation > Third-party systems > Data Quality components > Standardization components > Address standardization components
Design and Development > Third-party systems > Data Quality components > Standardization components > Address standardization components
EnrichPlatform
Talend Studio

Procedure

  1. Double-click the tFileOutputExcel component to display the Basic settings view and define the component properties.
  2. Set the destination file name as well as the sheet name and then select the Define all columns auto size check box.
  3. Save your Job and press F6 to execute it.
    The tAddressRowCloud component uploads data to the cloud, retrieves the corrected data and writes the result in the output file.
  4. Right-click the output component and select Data Viewer to display the formatted address data.
    tAddressRowCloud matches input address data against the Loqate repository.
    The all_info and Geo_info columns retrieve additional address information from the Raw_Response and GeoAccuracy columns respectively in the Loqate repository. The Raw_Response column provides you with all address information from the provider repository without any formatting. if you want this information to be more readable, you must parse it using json or xml.
    The STATUS output column returns the OK status for all address rows. This means that the verification process of all address rows could be completed successfully by the component. For further information about process status, see Process status in tLoqateAddressRow.
    The VerificationLevel output column provides you with a verification status of the processed addresses. For further information, see Verification status.
    The AddressVerificationCode output column returns a verification code for each of the processed address rows. For example, the first verification code V44-I45-P3-100 means:
    • Verification status = V (verified): a complete match was made between the input address and a single record from the available reference data.

    • Post-processed verification match level = 4 (premises): the level to which the input data matches the available reference data once all changes and additions performed during the verification process have been taken into account.

    • Pre-processed verification match level = 4 (premises): the level to which the input data matches the available reference data prior to any changes or additions performed during the verification process.

    • Parsing status = I (identified and parsed): all components of the input data have been able to be identified and placed into output fields.

    • Lexicon identification match level = 4 (premises): using pattern matching, a numeric value or word has been identified as a premises number or name.

    • Context identification match level = 5 (delivery point, PostBox or SubBuilding): a numeric value or word has been identified as a post box number or sub building name.

    • Postcode Status = P3 (added): the primary postal code for the country has been added.

    • Match score = 100 (complete similarity): the input data and closest reference data match completely.

    For further information about what values this code is made up of and the implications of each segment, see Address verification codes in tLoqateAddressRow.