Trevor Grant created MAHOUT-1976:
------------------------------------

             Summary: Add Canopy Clustering Algorithm
                 Key: MAHOUT-1976
                 URL: https://issues.apache.org/jira/browse/MAHOUT-1976
             Project: Mahout
          Issue Type: Bug
          Components: Algorithms
    Affects Versions: 0.13.2
            Reporter: Trevor Grant


Primarily, we need to lay out the clustering section of the Algorihtms 
Framework.

The Canopy Clustering Algorithm is very simple and yet very useful as a 
preprocessing step for more advanced clustering algorithms such as KMeans and 
Hierarchical Clustering. 

https://en.wikipedia.org/wiki/Canopy_clustering_algorithm

The majority of the "work" on this PR will be creating the framework. 

It is also one of the Legacy MR algorithms that would be nice to port.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to