Hi Everyone,

I have a question related to cluster dump in Mahout 0.9 while doing text 
clustering -

https://mahout.apache.org/users/clustering/clusteringyourdata.html

  *   One case of cluster dump is to output the top k kerms and for that you 
don't specify the parameter p (pointsDir).
  *   Second case of cluster dump is where you specify the parameter p 
(pointsDir) and you get points associated with a cluster.

Both the outputs have same exact cluster id but the number of records shown in 
Case 1 - Where Top Terms are displayed is different than the number of records 
appearing in Case 2 - Where you get points associated with a cluster.

Why does this happen? I means its bizzare to see different # of points 
associated with a specific cluster and not sure which one is correct?

Has anyone seen this happening?

Thank you in advance!


Cheers,
Viral

Reply via email to