Trevor Grant created MAHOUT-1976: ------------------------------------ Summary: Add Canopy Clustering Algorithm Key: MAHOUT-1976 URL: https://issues.apache.org/jira/browse/MAHOUT-1976 Project: Mahout Issue Type: Bug Components: Algorithms Affects Versions: 0.13.2 Reporter: Trevor Grant
Primarily, we need to lay out the clustering section of the Algorihtms Framework. The Canopy Clustering Algorithm is very simple and yet very useful as a preprocessing step for more advanced clustering algorithms such as KMeans and Hierarchical Clustering. https://en.wikipedia.org/wiki/Canopy_clustering_algorithm The majority of the "work" on this PR will be creating the framework. It is also one of the Legacy MR algorithms that would be nice to port. -- This message was sent by Atlassian JIRA (v6.3.15#6346)