Ahem, KMeans is not busted. It is being maintained by me, at least. The build-reuters.sh script runs only in local mode on 0.5 and fails in both KMeans and LDA when run on a cluster. The MIA examples are not always correct. Most of this has been reported before.
-----Original Message----- From: Sean Owen [mailto:[email protected]] Sent: Thursday, June 09, 2011 12:29 AM To: [email protected] Subject: Re: Problems running examples (Assuming you are on HEAD,) I think KMeans is busted -- this has come up before. I don't know if it is being maintained. Anyone who's willing to step up and fix it is also welcome to overhaul it IMHO. On Thu, Jun 9, 2011 at 12:03 AM, Hector Yee <[email protected]> wrote: > I got a slightly different error on the next line of KMeansDriver.java > (running on OS X Snow Leopard) > > 11/06/08 16:02:12 INFO compress.CodecPool: Got brand-new compressor > Exception in thread "main" java.lang.ClassCastException: > org.apache.hadoop.io.IntWritable cannot be cast to > org.apache.mahout.math.VectorWritable > at > > org.apache.mahout.clustering.kmeans.RandomSeedGenerator.buildRandom(RandomSeedGenerator.java:90) > at > org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:102) > > > On Sun, Jun 5, 2011 at 9:31 PM, Jeff Eastman <[email protected]> wrote: > > > IIRC, Reuters used to run on a cluster but no longer does due to some > > obscure Lucene changes. In 0.5 it only works in local mode. I really hope > > this can be repaired by 0.6 as Reuters is a key entry point into Mahout > > clustering for many users. > > >
