Thanks Madhusudan for your response. Its OK, i have solved the issue
by turning the last boolean argument of the below code to true.


DictionaryVectorizer.createTermFrequencyVectors(tokenizedPath, new
Path(outputDir), conf, minSupport, maxNGramSize, minLLRValue, 2, true,
reduceTasks,chunkSize, sequentialAccessOutput, false);


On 4/11/11, Madhusudan Joshi <[email protected]> wrote:
> I had similar issue before. I added the parameter --namedVector to the
> command to create named vectors. With that I was able to identify the which
> documents belonged to a given cluster using the same clusterdump command.
> Hope this helps.
>
> On Fri, Apr 8, 2011 at 9:23 PM, sarath pr <[email protected]> wrote:
>
>> A text file created using the clusterdump utility has been attached
>> here. Can anyone tell me how to identify the document IDs belonging to
>> the cluster.?
>>
>> mahout clusterdump -s
>> /home/sarathpr/NetBeansProjects/SNACK1/newsClusters/clusters/clusters-1
>> -o /home/sarathpr/Desktop/readable/out4.txt -b 100 -n 50 -p
>>
>> /home/sarathpr/NetBeansProjects/SNACK1/newsClusters/clusters/clusteredPoints
>> -d /home/sarathpr/NetBeansProjects/SNACK1/newsClusters/dictionary.file-0
>> -dt sequencefile
>>
>>
>> --
>> Thank You..!!
>> Sarath Ramachandran
>> [email protected]
>> +919995024287
>>
>
>
>
> --
> Everything we hear is an opinion, not a fact.
> Everything we see is perspective, not the truth.
>


-- 
Thank You..!!
Sarath Ramachandran
[email protected]
+919995024287

Reply via email to