I had similar issue before. I added the parameter --namedVector to the command to create named vectors. With that I was able to identify the which documents belonged to a given cluster using the same clusterdump command. Hope this helps.
On Fri, Apr 8, 2011 at 9:23 PM, sarath pr <[email protected]> wrote: > A text file created using the clusterdump utility has been attached > here. Can anyone tell me how to identify the document IDs belonging to > the cluster.? > > mahout clusterdump -s > /home/sarathpr/NetBeansProjects/SNACK1/newsClusters/clusters/clusters-1 > -o /home/sarathpr/Desktop/readable/out4.txt -b 100 -n 50 -p > > /home/sarathpr/NetBeansProjects/SNACK1/newsClusters/clusters/clusteredPoints > -d /home/sarathpr/NetBeansProjects/SNACK1/newsClusters/dictionary.file-0 > -dt sequencefile > > > -- > Thank You..!! > Sarath Ramachandran > [email protected] > +919995024287 > -- Everything we hear is an opinion, not a fact. Everything we see is perspective, not the truth.
