Re: KMeans Clustering Issues

2011-02-06 Thread Chris McConnell
iles. So make sure you are pointing to >> the output of seq2sparse, which would be something like: path/tfidf-vectors >> or path/tf-vectors >> Cheers, >> Tim >> On Wed, Feb 2, 2011 at 1:21 PM, Jeff Eastman wrote: >> > Sounds like you might not be using the mahout-

Re: KMeans Clustering Issues

2011-02-03 Thread Lokendra Singh
don't run on Hadoop 0.20.1, only 20.2. Finally, trunk always has the > > latest and greatest patches in it and the clustering stuff is quite > stable > > there. > > > > Jeff > > > > -Original Message- > > From: McConnell, Christopher (GE Glo

Re: KMeans Clustering Issues

2011-02-03 Thread Chris McConnell
> there. > > Jeff > > -Original Message- > From: McConnell, Christopher (GE Global Research) [mailto:mccon...@ge.com] > Sent: Wednesday, February 02, 2011 11:35 AM > To: user@mahout.apache.org > Subject: KMeans Clustering Issues > > All, > > I'

Re: KMeans Clustering Issues

2011-02-02 Thread Timothy Potter
tches in it and the clustering stuff is quite stable > there. > > Jeff > > -Original Message- > From: McConnell, Christopher (GE Global Research) [mailto:mccon...@ge.com] > Sent: Wednesday, February 02, 2011 11:35 AM > To: user@mahout.apache.org > Subject: KMeans Clu

RE: KMeans Clustering Issues

2011-02-02 Thread Jeff Eastman
r (GE Global Research) [mailto:mccon...@ge.com] Sent: Wednesday, February 02, 2011 11:35 AM To: user@mahout.apache.org Subject: KMeans Clustering Issues All, I've begun to look into Mahout on top of Hadoop, specifically for large scale cluster analysis. I am running into an issue however,

KMeans Clustering Issues

2011-02-02 Thread McConnell, Christopher (GE Global Research)
All, I've begun to look into Mahout on top of Hadoop, specifically for large scale cluster analysis. I am running into an issue however, attempting to run the KMeansDriver.run(Configuration, Path, Path, Path, DistanceMeasure, double, int, Boolean, Boolean) with the last (runSequential) false w