Arranging data flow for the KMeans Job
Procedure
- In the Integration perspective of Talend Studio, create an empty Job from the Job Designs node in the Repository tree view.
- In the workspace, enter the name of the component to be used and select this component from the list that appears.
- Connect tFileInputDelimited to tReplicate using the Row > Main link.
- Do the same to connect tReplicate to tModelEncoder and then tModelEncoder to tKMeansModel.
- Repeat the operations to connect tReplicate to tPredict and then tPredict to tFileOutputDelimited.
- Leave tHDFSConfiguration as it is.
Did this page help you?
If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!