Thanks Madhusudan for your response. Its OK, i have solved the issue by turning the last boolean argument of the below code to true.
DictionaryVectorizer.createTermFrequencyVectors(tokenizedPath, new Path(outputDir), conf, minSupport, maxNGramSize, minLLRValue, 2, true, reduceTasks,chunkSize, sequentialAccessOutput, false); On 4/11/11, Madhusudan Joshi <[email protected]> wrote: > I had similar issue before. I added the parameter --namedVector to the > command to create named vectors. With that I was able to identify the which > documents belonged to a given cluster using the same clusterdump command. > Hope this helps. > > On Fri, Apr 8, 2011 at 9:23 PM, sarath pr <[email protected]> wrote: > >> A text file created using the clusterdump utility has been attached >> here. Can anyone tell me how to identify the document IDs belonging to >> the cluster.? >> >> mahout clusterdump -s >> /home/sarathpr/NetBeansProjects/SNACK1/newsClusters/clusters/clusters-1 >> -o /home/sarathpr/Desktop/readable/out4.txt -b 100 -n 50 -p >> >> /home/sarathpr/NetBeansProjects/SNACK1/newsClusters/clusters/clusteredPoints >> -d /home/sarathpr/NetBeansProjects/SNACK1/newsClusters/dictionary.file-0 >> -dt sequencefile >> >> >> -- >> Thank You..!! >> Sarath Ramachandran >> [email protected] >> +919995024287 >> > > > > -- > Everything we hear is an opinion, not a fact. > Everything we see is perspective, not the truth. > -- Thank You..!! Sarath Ramachandran [email protected] +919995024287
