tMahoutClustering (deprecated) - 7.3

Machine Learning

Version
7.3
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Machine Learning components
Data Quality and Preparation > Third-party systems > Machine Learning components
Design and Development > Third-party systems > Machine Learning components
Last publication date
2024-02-21

Groups unlabeled numerical data into clusters that can reveal interesting patterns or helps identifying abnormal data items in the data set.

tMahoutClustering groups data together into clusters based on some similarities. The component offers several similarity methods that can be used in different clustering algorithms.

tMahoutClustering uses clustering algorithms from Mahout libraries. All processes are run in a given distributed file system.

Note:

Currently, the studio supports Mahout 0.9.

For more technologies supported by Talend, see Talend components.