Loading the main input file

Pig

author
Talend Documentation Team
EnrichVersion
6.5
EnrichProdName
Talend Real-Time Big Data Platform
Talend Open Studio for Big Data
Talend Big Data
Talend Data Fabric
Talend Big Data Platform
task
Data Quality and Preparation > Third-party systems > Processing components (Integration) > Pig components
Data Governance > Third-party systems > Processing components (Integration) > Pig components
Design and Development > Third-party systems > Processing components (Integration) > Pig components
EnrichPlatform
Talend Studio

Procedure

  1. Double-click tPigLoad to open its Basic settings view.
  2. Click the [...] button next to Edit schema to open the [Schema] dialog box.
  3. Click the [+] button to add columns, name them and define the column types according to the structure of the input file. In this example, the input schema has five columns: id (integer), firstName (string), lastName (string), groupId (integer), and salary (double).
    Then click OK to validate the setting and close the dialog box.
  4. Click Local in the Mode area.
  5. Select PigStorage from the Load function list.
  6. Fill in the Input file URI field with the full path to the input file, and leave the rest of the setting as they are.