In the folder clusters what I used there is this file: part-randomSeed

created by the command:

bin/mahout kmeans -i /home/vuvvo/reuters-out-seqdir-sparse/tfidf-vectors/ -c 
/home/vuvvo/clusters -o /home/vuvvo/reuters-kmeans -k 3 --maxIter 5

I need to use the files in the folder reuters-kmeans? inside it I have got some
other sub-directory called cluster-x where x is from 1 to 5.

I tried to give the cluster-5 as input and inside finalOutput I have got a file
big 1,4 Mb but very hard to open also with 4 mb of ram on a 64 bit ^^

I can read the first and the second row:
CL-21551 {n=1855 c =[1:0.011,2:0.005...to 31:0.012
and the second row:
VL-21560{n=19722 c[0:0.012 etc etc...

is now converged and correct?

is there a more comfortable way to read this file?because than i need to convert
it in data for weka.

This night i will try to convert an irff data in a sgm to see what i obtain with
it.



Reply via email to