Hi Everyone, I have a question related to cluster dump in Mahout 0.9 while doing text clustering -
https://mahout.apache.org/users/clustering/clusteringyourdata.html * One case of cluster dump is to output the top k kerms and for that you don't specify the parameter p (pointsDir). * Second case of cluster dump is where you specify the parameter p (pointsDir) and you get points associated with a cluster. Both the outputs have same exact cluster id but the number of records shown in Case 1 - Where Top Terms are displayed is different than the number of records appearing in Case 2 - Where you get points associated with a cluster. Why does this happen? I means its bizzare to see different # of points associated with a specific cluster and not sure which one is correct? Has anyone seen this happening? Thank you in advance! Cheers, Viral