Procedure
-
In the
Integration
perspective of Talend Studio, create an empty Spark Batch
Job from the Job Designs node in the
Repository tree view.
For further information about how to create a Spark Batch Job, see Talend Big Data Getting Started Guide.
- Drop the following components from the Palette onto the design workspace: tHDFSConfiguration, tFixedFlowInput, tDataprepRun and tLogRow.
- Connect the last three components using Row > Main links.