Hi,
I am working on a very simple k-means clustering example. Is there a way to
run clustering algorithms in mahout without using Hadoop? I am reading the
book Mahout in Action. In chapter 7, the hello world clustering code
example, they use
==
KMeansDriver.run(conf, new Path(testdata/points),
When you say without hadoop does that include local mode? You can run these
examples in local mode that doesn't require a cluster for testing and
poking around. Everything then runs in a single jvm.
On Dec 1, 2013 9:18 PM, Shan Lu shanlu...@gmail.com wrote:
Hi,
I am working on a very simple
Thanks for your reply. In the example code, they run the k-means algorithm
using org.apache.hadoop.conf.Configuration,
org.apache.hadoop.fs.FileSystem, and org.apache.hadoop.fs.Path parameters.
Is there any algorithm that doesn't need any Configuration and Path
parameter, just use the data in
Shan,
All of Mahout implementations use Hadoop API, but if u r trying to run kmeans
in sequential (non-MapReduce) mode; pass inĀ runSequential = true instead of
false as the last parameter to KMeansDriver.run() or Amit run them in
LOCAL_MODE as pointed out earlier by Amit.
On Sunday,
Thanks, Suneel, I'll try this way.
In this recommender example:
https://github.com/ManuelB/facebook-recommender-demo/blob/master/src/main/java/de/apaxo/bedcon/AnimalFoodRecommender.java#L142
,
they only use mahout api. So I am thinking that can I do the clustering
similarly.
On Sun, Dec 1,
The new Ball k-means and streaming k-means implementations have non-Hadoop
versions. The streaming k-means implementation also has a threaded
implementation that runs without Hadoop.
The threaded streaming k-means implementation should be pretty fast.
On Sun, Dec 1, 2013 at 7:55 PM, Shan Lu
Thanks, Ted. I went through some introductions of Ball k-means and
streaming k-means, but still not clear how to implement the algorithm
without hadoop. Do you know any hello world example code using non-Hadoop
version streaming k-means? Thanks.
On Sun, Dec 1, 2013 at 11:12 PM, Ted Dunning
Hi Florents,
it just became different but still works without hdfs, i also had trouble
getting the right classes together but here is something that will
hopefully work correctly:
DistanceMeasure measure = new CosineDistanceMeasure();
// ClusterUtils is no mahout class
ListCluster