Thanks for help, I will modified the kmeans code to work with .arff files of weka directly. About the bugs is just a sensation,also because I founded an old topic talking about it. This afternoon i will try to clean and install again mahout to do some test on reuters sgm files. I will let you know if it works or if i will have the same problems.
I have just one more question for you: Mahout works on hadoop, I have already installed hadoop, so mahout uses an indipendent hadoop inside it or I need to attach mahout to mine hadoop?
