Procedure
-
Double-click the new Map/Reduce Job to open it in the workspace.
The Map/Reduce component Palette is opened.
- Delete tMysqlInput in this scenario as it is not a Map/Reduce component and use tRowGenerator in its place. Link it to tGenKey with a Row > Main link.
-
Double-click tRowGenerator to open its
editor.
- Define the schema you want to use to write data in Hadoop.
- Click OK to validate your schema and close the editor.
- Leave the settings of the other components as you defined initially in the standard version of the Job.