Skip to main content

tMahoutClustering (deprecated)

Groups unlabeled numerical data into clusters that can reveal interesting patterns or helps identifying abnormal data items in the data set.

tMahoutClustering groups data together into clusters based on some similarities. The component offers several similarity methods that can be used in different clustering algorithms.

tMahoutClustering uses clustering algorithms from Mahout libraries. All processes are run in a given distributed file system.

Information noteNote:

Currently, the studio supports Mahout 0.9.

For more technologies supported by Talend, see Talend components.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!