Hello Suneel,

I got it. Next step to canopy is to feed these centroids to kmeans and
cluster.

However I want is to use centroids from these clusters and do clustering on
them so as to find related clusters.

Thanks
Rajesh


On Thu, May 30, 2013 at 8:38 PM, Suneel Marthi <suneel_mar...@yahoo.com>wrote:

> The input to canopy is your vectors from seq2sparse and not cluster
> centroids (as u had it), hence the error message u r seeing.
>
> The output of canopy could be fed into kmeans as input centroids.
>
>
>
>
> ________________________________
>  From: Rajesh Nikam <rajeshni...@gmail.com>
> To: "user@mahout.apache.org" <user@mahout.apache.org>
> Sent: Thursday, May 30, 2013 10:56 AM
> Subject: bottom up clustering
>
>
> Hi,
>
> I want to do bottom up clustering (rather hierarchical clustering) rather
> than top-down as mentioned in
>
> https://cwiki.apache.org/MAHOUT/top-down-clustering.html
> kmeans->clusterdump->clusterpp and then kmeans on each cluster
>
> How to use centroid from first phase of canopy and use them for next level
> of course with correct t1 and t2.
>
> I have tried using 'canopy' which give centroids as output. How to apply
> one more level of clustering on these centroids ?
>
> /user/hadoop/t/canopy-centroids/clusters-0-final is output of first level
> of canopy.
>
> mahout canopy -i /user/hadoop/t/canopy-centroids/clusters-0-final -o
> /user/hadoop/t/hclust -dm
> org.apache.mahout.common.distance.TanimotoDistanceMeasure -t1 0.01 -t2 0.02
> -ow
>
> It gave following error:
>
>   13/05/30 20:21:38 INFO mapred.JobClient: Task Id :
> attempt_201305231030_0519_m_000000_0, Status : FAILED
> java.lang.ClassCastException:
> org.apache.mahout.clustering.iterator.ClusterWritable cannot be cast to
> org.apache.mahout.math.VectorWritable
>
> Thanks
> Rajesh
>

Reply via email to