Re: How KMeans clustering works in Mahout 0.8?

2014-01-30 Thread Pat Ferrel
Is there any qualitative difference sequential v MR?


On Jan 28, 2014, at 10:11 PM, Suneel Marthi suneel_mar...@yahoo.com wrote:

All of Mahout's clustering algos can be run in both MR and non-MR mode.
By default its the MR mode that's executed unless the user chooses the non-MR 
mode by specifiying '-xm sequential' while invoking the driver.





On Wednesday, January 29, 2014 1:09 AM, Saeed Adel Mehraban 
s.ade...@gmail.com wrote:

Thank you for the details.
So KMeans could be run in both map-reduce and non-mapreduce version and the
decision will be made in driver, yes?



On Tue, Jan 28, 2014 at 11:12 PM, Yoonmin Nam rony...@dgist.ac.kr wrote:

 K-Means clustering works by using mahout.clustering.iterator package.
 
 In the iterator package, there is a classes called CIMapper and
 CIReducer.
 
 Both of  them are used when we choose a method (-xm) as a mapreduce.
 
 Then, this mapper and reducer works for you!
 
 Thanks.
 
 -Original Message-
 From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com]
 Sent: Wednesday, January 29, 2014 4:35 AM
 To: user@mahout.apache.org
 Subject: Re: How KMeans clustering works in Mahout 0.8?
 
 I see the package, but I couldn't find anything related to map-reduce. I
 wonder why!
 
 
 On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman 
 andrew.mussel...@gmail.com wrote:
 
 In the source code you could take a look in the
 org.apache.mahout.clustering.kmeans package to get a start, if you
 want to understand the implementation.
 
 If you just want to run some clustering, take a look at
 examples/bin/cluster-reuters.sh which has an option to run kmeans.
 
 
 On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban
 s.ade...@gmail.com
 wrote:
 
 I read Mahout KMeans Design of implementation and it seems to be
 clear
 wrt
 map-reduce paradigm. But when I refer to source code, I can not find
 the mapper, reducer, combiner or almost anything mentioned in the
 official website. What happened here and what I need to do to
 understand KMeans implementation of Mahout?
 
 
 
 
 
 
 



Re: How KMeans clustering works in Mahout 0.8?

2014-01-30 Thread Suneel Marthi
No

Sent from my iPhone

 On Jan 30, 2014, at 10:57 AM, Pat Ferrel p...@occamsmachete.com wrote:
 
 Is there any qualitative difference sequential v MR?
 
 
 On Jan 28, 2014, at 10:11 PM, Suneel Marthi suneel_mar...@yahoo.com wrote:
 
 All of Mahout's clustering algos can be run in both MR and non-MR mode.
 By default its the MR mode that's executed unless the user chooses the non-MR 
 mode by specifiying '-xm sequential' while invoking the driver.
 
 
 
 
 
 On Wednesday, January 29, 2014 1:09 AM, Saeed Adel Mehraban 
 s.ade...@gmail.com wrote:
 
 Thank you for the details.
 So KMeans could be run in both map-reduce and non-mapreduce version and the
 decision will be made in driver, yes?
 
 
 
 On Tue, Jan 28, 2014 at 11:12 PM, Yoonmin Nam rony...@dgist.ac.kr wrote:
 
 K-Means clustering works by using mahout.clustering.iterator package.
 
 In the iterator package, there is a classes called CIMapper and
 CIReducer.
 
 Both of  them are used when we choose a method (-xm) as a mapreduce.
 
 Then, this mapper and reducer works for you!
 
 Thanks.
 
 -Original Message-
 From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com]
 Sent: Wednesday, January 29, 2014 4:35 AM
 To: user@mahout.apache.org
 Subject: Re: How KMeans clustering works in Mahout 0.8?
 
 I see the package, but I couldn't find anything related to map-reduce. I
 wonder why!
 
 
 On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman 
 andrew.mussel...@gmail.com wrote:
 
 In the source code you could take a look in the
 org.apache.mahout.clustering.kmeans package to get a start, if you
 want to understand the implementation.
 
 If you just want to run some clustering, take a look at
 examples/bin/cluster-reuters.sh which has an option to run kmeans.
 
 
 On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban
 s.ade...@gmail.com
 wrote:
 
 I read Mahout KMeans Design of implementation and it seems to be
 clear
 wrt
 map-reduce paradigm. But when I refer to source code, I can not find
 the mapper, reducer, combiner or almost anything mentioned in the
 official website. What happened here and what I need to do to
 understand KMeans implementation of Mahout?
 


Re: How KMeans clustering works in Mahout 0.8?

2014-01-28 Thread Saeed Adel Mehraban
I see the package, but I couldn't find anything related to map-reduce. I
wonder why!


On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman 
andrew.mussel...@gmail.com wrote:

 In the source code you could take a look in the
 org.apache.mahout.clustering.kmeans package to get a start, if you want to
 understand the implementation.

 If you just want to run some clustering, take a look at
 examples/bin/cluster-reuters.sh which has an option to run kmeans.


 On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban s.ade...@gmail.com
 wrote:

  I read Mahout KMeans Design of implementation and it seems to be clear
 wrt
  map-reduce paradigm. But when I refer to source code, I can not find the
  mapper, reducer, combiner or almost anything mentioned in the official
  website. What happened here and what I need to do to understand KMeans
  implementation of Mahout?
 



Re: How KMeans clustering works in Mahout 0.8?

2014-01-28 Thread Suneel Marthi
Look at KMeansDriver.java in the specified package and trace thru the code.
You should see both MR and non-MR versions of kmeans impl.





On Tuesday, January 28, 2014 2:35 PM, Saeed Adel Mehraban s.ade...@gmail.com 
wrote:
 
I see the package, but I couldn't find anything related to map-reduce. I
wonder why!



On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman 
andrew.mussel...@gmail.com wrote:

 In the source code you could take a look in the
 org.apache.mahout.clustering.kmeans package to get a start, if you want to
 understand the implementation.

 If you just want to run some clustering, take a look at
 examples/bin/cluster-reuters.sh which has an option to run kmeans.


 On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban s.ade...@gmail.com
 wrote:

  I read Mahout KMeans Design of implementation and it seems to be clear
 wrt
  map-reduce paradigm. But when I refer to source code, I can not find the
  mapper, reducer, combiner or almost anything mentioned in the official
  website. What happened here and what I need to do to understand KMeans
  implementation of Mahout?
 


RE: How KMeans clustering works in Mahout 0.8?

2014-01-28 Thread Yoonmin Nam
K-Means clustering works by using mahout.clustering.iterator package.

In the iterator package, there is a classes called CIMapper and CIReducer.

Both of  them are used when we choose a method (-xm) as a mapreduce.

Then, this mapper and reducer works for you!

Thanks.

-Original Message-
From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com] 
Sent: Wednesday, January 29, 2014 4:35 AM
To: user@mahout.apache.org
Subject: Re: How KMeans clustering works in Mahout 0.8?

I see the package, but I couldn't find anything related to map-reduce. I
wonder why!


On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman 
andrew.mussel...@gmail.com wrote:

 In the source code you could take a look in the 
 org.apache.mahout.clustering.kmeans package to get a start, if you 
 want to understand the implementation.

 If you just want to run some clustering, take a look at 
 examples/bin/cluster-reuters.sh which has an option to run kmeans.


 On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban 
 s.ade...@gmail.com
 wrote:

  I read Mahout KMeans Design of implementation and it seems to be 
  clear
 wrt
  map-reduce paradigm. But when I refer to source code, I can not find 
  the mapper, reducer, combiner or almost anything mentioned in the 
  official website. What happened here and what I need to do to 
  understand KMeans implementation of Mahout?
 








Re: How KMeans clustering works in Mahout 0.8?

2014-01-28 Thread Saeed Adel Mehraban
Thank you for the details.
So KMeans could be run in both map-reduce and non-mapreduce version and the
decision will be made in driver, yes?


On Tue, Jan 28, 2014 at 11:12 PM, Yoonmin Nam rony...@dgist.ac.kr wrote:

 K-Means clustering works by using mahout.clustering.iterator package.

 In the iterator package, there is a classes called CIMapper and
 CIReducer.

 Both of  them are used when we choose a method (-xm) as a mapreduce.

 Then, this mapper and reducer works for you!

 Thanks.

 -Original Message-
 From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com]
 Sent: Wednesday, January 29, 2014 4:35 AM
 To: user@mahout.apache.org
 Subject: Re: How KMeans clustering works in Mahout 0.8?

 I see the package, but I couldn't find anything related to map-reduce. I
 wonder why!


 On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman 
 andrew.mussel...@gmail.com wrote:

  In the source code you could take a look in the
  org.apache.mahout.clustering.kmeans package to get a start, if you
  want to understand the implementation.
 
  If you just want to run some clustering, take a look at
  examples/bin/cluster-reuters.sh which has an option to run kmeans.
 
 
  On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban
  s.ade...@gmail.com
  wrote:
 
   I read Mahout KMeans Design of implementation and it seems to be
   clear
  wrt
   map-reduce paradigm. But when I refer to source code, I can not find
   the mapper, reducer, combiner or almost anything mentioned in the
   official website. What happened here and what I need to do to
   understand KMeans implementation of Mahout?
  
 








How KMeans clustering works in Mahout 0.8?

2014-01-27 Thread Saeed Adel Mehraban
I read Mahout KMeans Design of implementation and it seems to be clear wrt
map-reduce paradigm. But when I refer to source code, I can not find the
mapper, reducer, combiner or almost anything mentioned in the official
website. What happened here and what I need to do to understand KMeans
implementation of Mahout?


Re: How KMeans clustering works in Mahout 0.8?

2014-01-27 Thread Andrew Musselman
In the source code you could take a look in the
org.apache.mahout.clustering.kmeans package to get a start, if you want to
understand the implementation.

If you just want to run some clustering, take a look at
examples/bin/cluster-reuters.sh which has an option to run kmeans.


On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban s.ade...@gmail.comwrote:

 I read Mahout KMeans Design of implementation and it seems to be clear wrt
 map-reduce paradigm. But when I refer to source code, I can not find the
 mapper, reducer, combiner or almost anything mentioned in the official
 website. What happened here and what I need to do to understand KMeans
 implementation of Mahout?