Loading the event data

Pig

author
Talend Documentation Team
EnrichVersion
6.5
EnrichProdName
Talend Real-Time Big Data Platform
Talend Open Studio for Big Data
Talend Big Data Platform
Talend Big Data
Talend Data Fabric
task
Data Governance > Third-party systems > Processing components (Integration) > Pig components
Data Quality and Preparation > Third-party systems > Processing components (Integration) > Pig components
Design and Development > Third-party systems > Processing components (Integration) > Pig components
EnrichPlatform
Talend Studio

Procedure

  1. Double-click the tPigLoad labeled event to open its Component view.
  2. Click the button next to Edit schema to open the schema editor.
  3. Click the button three times to add three rows and in the Column column, rename them as date, street and event, respectively.
  4. Click OK to validate these changes.
  5. In the Mode area, select Map/Reduce.
    As you have configured the connection to the given Hadoop distribution in that first tPigLoad component, traffic, this event component reuses that connection and therefore, the corresponding options in the Distribution and the Version lists have been automatically selected.
  6. In the Load function field, select the PigStorage function to read the source data.
  7. In the Input file URI field, enter the directory where the event data is stored. As explained earlier, the directory in this example is "/user/ychen/tpigmap/date&event".