Re: Canopy and other clustering approaches

2011-11-02 Thread Grant Ingersoll
Cool. On Nov 1, 2011, at 11:46 PM, Paritosh Ranjan wrote: > Hi Grant, > > I have been working on Top Down Clustering. > https://issues.apache.org/jira/browse/MAHOUT-843 > > In this, the top level clustering algorithm ( for eg. Canopy ) can run with > big t1,t2 values. And then any other clus

Re: Canopy and other clustering approaches

2011-11-01 Thread Paritosh Ranjan
Hi Grant, I have been working on Top Down Clustering. https://issues.apache.org/jira/browse/MAHOUT-843 In this, the top level clustering algorithm ( for eg. Canopy ) can run with big t1,t2 values. And then any other clustering algorithm (selected by user) is executed on clusters produced by

Canopy and other clustering approaches

2011-11-01 Thread Grant Ingersoll
In reviewing clustering for upcoming training, I'm wondering about something w/ Canopy clustering that we claim, but wanted to check here first. In the lectures, etc. I've seen on it, the idea is to run Canopy first and then some other more expensive algorithm, such as k-means, etc. with the id