Parsing addresses against Google - 7.1

Address standardization

author
Talend Documentation Team
EnrichVersion
7.1
EnrichProdName
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Data Quality components > Standardization components > Address standardization components
Data Quality and Preparation > Third-party systems > Data Quality components > Standardization components > Address standardization components
Design and Development > Third-party systems > Data Quality components > Standardization components > Address standardization components
EnrichPlatform
Talend Studio

About this task

You can run the Parsing addresses against reference data in the Cloud Job against Google Places API by doing the followings:

Procedure

  1. In the tFixedFlowInput Basic settings, create the schema through the Edit Schema button.
    In the open dialog box, click the [+] button and add one column that will hold the information in the input address, in this example: address.
  2. Click OK.
    An address column is created in the Inline Table.
  3. In the Inline Table table, enter the address data you want to analyze, for example:
    "1211 AVENUE OF AMERICAS FL 8 10036 NEW YORK USA"
    "B69 2lt 9kings   United Kingdom ave"
    "1729号 黄兴路 China, 200433"
    "15 Rue Nelaton   Paris PARIS 92800 France"
    "1211 AVENUE OF AMERICAS FL 8 10036 NEW YORK"
    "1 Rue de l'Abbaye, Paris"
    "1 Chemin de l'Abbaye, Paris"
    "1 Place de l'Abbaye basset, Paris"
    "8000 Cummings Hall,Hanover,New Hampshire,03755,"
    
  4. In the basic settings of tAddressRowCloud, select Google from the Address Provider list.
  5. In the License/API key field, enter the API key you generate from the Google Developer Console at https://developers.google.com/console/help/new/.
  6. In the Mapping table, click the [+] button to add a line and then select Address.
    The component will map the values of this field to the input column you set in this table.
  7. If required, select the Use Additional Output check box and use the Output Mapping table to retrieve additional address information from the provider repository.
    For further information, see Defining additional address fields.
  8. In the Advanced settings view, set Output Script to FRENCH and leave the other parameters unchanged.
  9. Save your Job and press F6 to execute it.
    The tAddressRowCloud component uploads data to the cloud, retrieves the corrected data and writes the result in the output file.
  10. Right-click the output component and select Data Viewer to display the formatted address data.
    tAddressRowCloud matches input address data against Google Places API and writes formatted addresses in the output file.
    The VerificationLevel output column provides you with a verification status of the processed addresses. For further information, see Address verification levels in tAddressRowCloud.