Scenario: Identifying a real-world geographic location of an IP - 6.3

Talend Open Studio for Big Data Components Reference Guide

EnrichVersion
6.3
EnrichProdName
Talend Open Studio for Big Data
task
Data Governance
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

The following scenario creates a three-component Job that associates an IP with a geographical location. It obtains a site visitor's geographical location based on its IP.

Dropping and linking components

  1. Drop the following components from the Palette onto the design workspace: tFixedFlowInput, tAddLocationFromIP, and tLogRow.

  2. Connect the three components using Row Main links.

Configuring the components

  1. In the design workspace, select tFixedFlowInput, and click the Component tab to define the basic settings for tFixedFlowInput.

  2. Click the [...] button next to Edit Schema to define the structure of the data you want to use as input. In this scenario, the schema is made of one column that holds an IP address.

  3. Click OK to close the dialog box, and accept propagating the changes when prompted by the system. The defined column is displayed in the Values panel of the Basic settings view.

  4. In the Number of rows field, enter the number of rows to be generated, and click in the Value cell and set the value for the IP address.

  5. In the design workspace, select tAddLocationFromIP and click the Component tab to define the basic settings for tAddLocationFromIP.

  6. Click the Sync columns button to synchronize the schema with the input schema set with tFixedFlowInput.

  7. Browse to the GeoIP.dat file to set its path in the Database filepath field.

    Note

    Ensure to download the latest version of the IP address lookup database file from the relevant site as indicated in the Basic settings view of tAddLocationFromIp.

  8. In the Input parameters panel, set your input parameters as needed. In this scenario, the input column is the ip column defined earlier that holds an IP address.

  9. In the Location type panel, set location type as needed. In this scenario, we want to display the country name.

  10. In the design workspace, select tLogRow and click the Component tab and define the basic settings for tLogRow as needed. In this scenario, we want to display values in cells of a table.

Saving and executing the Job

  1. Press Ctrl+S to save your Job.

  2. Press F6 or click Run in the Run tab to execute the Job.

One row is generated to display the country name that is associated with the set IP address.