Each iteration of k-means clustering will produce a cluster-X file. In this case, there were 7 iterations prior to the clusters converging. The final cluster data can be found in clusters-6.
There is a utility in mahout-util, o.a.m.utils.clustering.ClusterDumper that can be used to dump the data from clusters-6 and points into a json-like format. You could use that code as a starting point for discovering how to get at the data you're interested in. On Thu, Jan 7, 2010 at 3:23 PM, diveman <[email protected]> wrote: > > I'm new to Mahout. Installed 0.3 in a 4-node cluster and run mahout kmean > example with syntheticcontrol data. I got outputs like the following: > > output/canopies > output/clusters-0 > output/clusters-1 > output/clusters-2 > output/clusters-3 > output/clusters-4 > output/clusters-5 > output/clusters-6 > output/data > output/points > > by which I understand in the points folder, each point is labeled with a > cluster id. I'm wondering where I can find the cluster center, radius info, > etc. And what's in clusters-0~6? BTW, the sample data has 6 groups and the > result has 7 clusters, any clue? > > Thanks! > -- > View this message in context: > http://old.nabble.com/Kmeans-clustering-tp27066415p27066415.html > Sent from the Mahout User List mailing list archive at Nabble.com. > >
