Procedure
-
In the
Integration
perspective of the Studio, create an empty Job from the Job Designs node in the Repository tree view.
For further information about how to create a Job, see Talend Open Studio for Big Data Getting Started Guide .
- In the workspace, enter the name of the component to be used and select this component from the list that appears.
- Connect tFileInputDelimited to tReplicate using the Row > Main link.
- Do the same to connect tReplicate to tModelEncoder and then tModelEncoder to tKMeansModel.
- Repeat the operations to connect tReplicate to tPredict and then tPredict to tFileOutputDelimited.
- Leave tHDFSConfiguration as it is.