Shan, All of Mahout implementations use Hadoop API, but if u r trying to run kmeans in sequential (non-MapReduce) mode; pass in runSequential = true instead of false as the last parameter to KMeansDriver.run() or Amit run them in LOCAL_MODE as pointed out earlier by Amit.
On Sunday, December 1, 2013 10:28 PM, Shan Lu <shanlu...@gmail.com> wrote: Thanks for your reply. In the example code, they run the k-means algorithm using org.apache.hadoop.conf.Configuration, org.apache.hadoop.fs.FileSystem, and org.apache.hadoop.fs.Path parameters. Is there any algorithm that doesn't need any Configuration and Path parameter, just use the data in memory? I mean, can I run the k-means algorithm without using the hadoop api, just using java? Thanks. On Sun, Dec 1, 2013 at 9:58 PM, Amit Nithian <anith...@gmail.com> wrote: > When you say without hadoop does that include local mode? You can run these > examples in local mode that doesn't require a cluster for testing and > poking around. Everything then runs in a single jvm. > On Dec 1, 2013 9:18 PM, "Shan Lu" <shanlu...@gmail.com> wrote: > > > Hi, > > > > I am working on a very simple k-means clustering example. Is there a way > to > > run clustering algorithms in mahout without using Hadoop? I am reading > the > > book "Mahout in Action". In chapter 7, the hello world clustering code > > example, they use > > == > > > > KMeansDriver.run(conf, new Path("testdata/points"), new > > Path("testdata/clusters"), > > new Path("output"), new EuclideanDistanceMeasure(), 0.001, 10, > > true, false); > > > > == > > to run the k-means algorithm. How can I run the k-means algorithm without > > Hadoop? > > > > Thanks! > > > > Shan > > > -- Shan Lu ECE Dept., NEU, Boston, MA 02115