Hello Suneel, I got it. Next step to canopy is to feed these centroids to kmeans and cluster.
However I want is to use centroids from these clusters and do clustering on them so as to find related clusters. Thanks Rajesh On Thu, May 30, 2013 at 8:38 PM, Suneel Marthi <suneel_mar...@yahoo.com>wrote: > The input to canopy is your vectors from seq2sparse and not cluster > centroids (as u had it), hence the error message u r seeing. > > The output of canopy could be fed into kmeans as input centroids. > > > > > ________________________________ > From: Rajesh Nikam <rajeshni...@gmail.com> > To: "user@mahout.apache.org" <user@mahout.apache.org> > Sent: Thursday, May 30, 2013 10:56 AM > Subject: bottom up clustering > > > Hi, > > I want to do bottom up clustering (rather hierarchical clustering) rather > than top-down as mentioned in > > https://cwiki.apache.org/MAHOUT/top-down-clustering.html > kmeans->clusterdump->clusterpp and then kmeans on each cluster > > How to use centroid from first phase of canopy and use them for next level > of course with correct t1 and t2. > > I have tried using 'canopy' which give centroids as output. How to apply > one more level of clustering on these centroids ? > > /user/hadoop/t/canopy-centroids/clusters-0-final is output of first level > of canopy. > > mahout canopy -i /user/hadoop/t/canopy-centroids/clusters-0-final -o > /user/hadoop/t/hclust -dm > org.apache.mahout.common.distance.TanimotoDistanceMeasure -t1 0.01 -t2 0.02 > -ow > > It gave following error: > > 13/05/30 20:21:38 INFO mapred.JobClient: Task Id : > attempt_201305231030_0519_m_000000_0, Status : FAILED > java.lang.ClassCastException: > org.apache.mahout.clustering.iterator.ClusterWritable cannot be cast to > org.apache.mahout.math.VectorWritable > > Thanks > Rajesh >