Re: How KMeans clustering works in Mahout 0.8?

2014-01-30 Thread Suneel Marthi
No

Sent from my iPhone

> On Jan 30, 2014, at 10:57 AM, Pat Ferrel  wrote:
> 
> Is there any qualitative difference sequential v MR?
> 
> 
> On Jan 28, 2014, at 10:11 PM, Suneel Marthi  wrote:
> 
> All of Mahout's clustering algos can be run in both MR and non-MR mode.
> By default its the MR mode that's executed unless the user chooses the non-MR 
> mode by specifiying '-xm sequential' while invoking the driver.
> 
> 
> 
> 
> 
> On Wednesday, January 29, 2014 1:09 AM, Saeed Adel Mehraban 
>  wrote:
> 
> Thank you for the details.
> So KMeans could be run in both map-reduce and non-mapreduce version and the
> decision will be made in driver, yes?
> 
> 
> 
>> On Tue, Jan 28, 2014 at 11:12 PM, Yoonmin Nam  wrote:
>> 
>> K-Means clustering works by using mahout.clustering.iterator package.
>> 
>> In the iterator package, there is a classes called "CIMapper and
>> CIReducer."
>> 
>> Both of  them are used when we choose a method (-xm) as a mapreduce.
>> 
>> Then, this mapper and reducer works for you!
>> 
>> Thanks.
>> 
>> -Original Message-
>> From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com]
>> Sent: Wednesday, January 29, 2014 4:35 AM
>> To: user@mahout.apache.org
>> Subject: Re: How KMeans clustering works in Mahout 0.8?
>> 
>> I see the package, but I couldn't find anything related to map-reduce. I
>> wonder why!
>> 
>> 
>> On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman <
>> andrew.mussel...@gmail.com> wrote:
>> 
>>> In the source code you could take a look in the
>>> org.apache.mahout.clustering.kmeans package to get a start, if you
>>> want to understand the implementation.
>>> 
>>> If you just want to run some clustering, take a look at
>>> examples/bin/cluster-reuters.sh which has an option to run kmeans.
>>> 
>>> 
>>> On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban
>>> >>> wrote:
>>> 
>>>> I read Mahout KMeans Design of implementation and it seems to be
>>>> clear
>>> wrt
>>>> map-reduce paradigm. But when I refer to source code, I can not find
>>>> the mapper, reducer, combiner or almost anything mentioned in the
>>>> official website. What happened here and what I need to do to
>>>> understand KMeans implementation of Mahout?
> 


Re: How KMeans clustering works in Mahout 0.8?

2014-01-30 Thread Pat Ferrel
Is there any qualitative difference sequential v MR?


On Jan 28, 2014, at 10:11 PM, Suneel Marthi  wrote:

All of Mahout's clustering algos can be run in both MR and non-MR mode.
By default its the MR mode that's executed unless the user chooses the non-MR 
mode by specifiying '-xm sequential' while invoking the driver.





On Wednesday, January 29, 2014 1:09 AM, Saeed Adel Mehraban 
 wrote:

Thank you for the details.
So KMeans could be run in both map-reduce and non-mapreduce version and the
decision will be made in driver, yes?



On Tue, Jan 28, 2014 at 11:12 PM, Yoonmin Nam  wrote:

> K-Means clustering works by using mahout.clustering.iterator package.
> 
> In the iterator package, there is a classes called "CIMapper and
> CIReducer."
> 
> Both of  them are used when we choose a method (-xm) as a mapreduce.
> 
> Then, this mapper and reducer works for you!
> 
> Thanks.
> 
> -Original Message-
> From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com]
> Sent: Wednesday, January 29, 2014 4:35 AM
> To: user@mahout.apache.org
> Subject: Re: How KMeans clustering works in Mahout 0.8?
> 
> I see the package, but I couldn't find anything related to map-reduce. I
> wonder why!
> 
> 
> On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman <
> andrew.mussel...@gmail.com> wrote:
> 
>> In the source code you could take a look in the
>> org.apache.mahout.clustering.kmeans package to get a start, if you
>> want to understand the implementation.
>> 
>> If you just want to run some clustering, take a look at
>> examples/bin/cluster-reuters.sh which has an option to run kmeans.
>> 
>> 
>> On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban
>> >> wrote:
>> 
>>> I read Mahout KMeans Design of implementation and it seems to be
>>> clear
>> wrt
>>> map-reduce paradigm. But when I refer to source code, I can not find
>>> the mapper, reducer, combiner or almost anything mentioned in the
>>> official website. What happened here and what I need to do to
>>> understand KMeans implementation of Mahout?
>>> 
>> 
> 
> 
> 
> 
> 



Re: How KMeans clustering works in Mahout 0.8?

2014-01-28 Thread Suneel Marthi
All of Mahout's clustering algos can be run in both MR and non-MR mode.
By default its the MR mode that's executed unless the user chooses the non-MR 
mode by specifiying '-xm sequential' while invoking the driver.





On Wednesday, January 29, 2014 1:09 AM, Saeed Adel Mehraban 
 wrote:
 
Thank you for the details.
So KMeans could be run in both map-reduce and non-mapreduce version and the
decision will be made in driver, yes?



On Tue, Jan 28, 2014 at 11:12 PM, Yoonmin Nam  wrote:

> K-Means clustering works by using mahout.clustering.iterator package.
>
> In the iterator package, there is a classes called "CIMapper and
> CIReducer."
>
> Both of  them are used when we choose a method (-xm) as a mapreduce.
>
> Then, this mapper and reducer works for you!
>
> Thanks.
>
> -Original Message-
> From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com]
> Sent: Wednesday, January 29, 2014 4:35 AM
> To: user@mahout.apache.org
> Subject: Re: How KMeans clustering works in Mahout 0.8?
>
> I see the package, but I couldn't find anything related to map-reduce. I
> wonder why!
>
>
> On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman <
> andrew.mussel...@gmail.com> wrote:
>
> > In the source code you could take a look in the
> > org.apache.mahout.clustering.kmeans package to get a start, if you
> > want to understand the implementation.
> >
> > If you just want to run some clustering, take a look at
> > examples/bin/cluster-reuters.sh which has an option to run kmeans.
> >
> >
> > On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban
> >  > >wrote:
> >
> > > I read Mahout KMeans Design of implementation and it seems to be
> > > clear
> > wrt
> > > map-reduce paradigm. But when I refer to source code, I can not find
> > > the mapper, reducer, combiner or almost anything mentioned in the
> > > official website. What happened here and what I need to do to
> > > understand KMeans implementation of Mahout?
> > >
> >
>
>
>
>
>
>

Re: How KMeans clustering works in Mahout 0.8?

2014-01-28 Thread Saeed Adel Mehraban
Thank you for the details.
So KMeans could be run in both map-reduce and non-mapreduce version and the
decision will be made in driver, yes?


On Tue, Jan 28, 2014 at 11:12 PM, Yoonmin Nam  wrote:

> K-Means clustering works by using mahout.clustering.iterator package.
>
> In the iterator package, there is a classes called "CIMapper and
> CIReducer."
>
> Both of  them are used when we choose a method (-xm) as a mapreduce.
>
> Then, this mapper and reducer works for you!
>
> Thanks.
>
> -Original Message-
> From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com]
> Sent: Wednesday, January 29, 2014 4:35 AM
> To: user@mahout.apache.org
> Subject: Re: How KMeans clustering works in Mahout 0.8?
>
> I see the package, but I couldn't find anything related to map-reduce. I
> wonder why!
>
>
> On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman <
> andrew.mussel...@gmail.com> wrote:
>
> > In the source code you could take a look in the
> > org.apache.mahout.clustering.kmeans package to get a start, if you
> > want to understand the implementation.
> >
> > If you just want to run some clustering, take a look at
> > examples/bin/cluster-reuters.sh which has an option to run kmeans.
> >
> >
> > On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban
> >  > >wrote:
> >
> > > I read Mahout KMeans Design of implementation and it seems to be
> > > clear
> > wrt
> > > map-reduce paradigm. But when I refer to source code, I can not find
> > > the mapper, reducer, combiner or almost anything mentioned in the
> > > official website. What happened here and what I need to do to
> > > understand KMeans implementation of Mahout?
> > >
> >
>
>
>
>
>
>


RE: How KMeans clustering works in Mahout 0.8?

2014-01-28 Thread Yoonmin Nam
K-Means clustering works by using mahout.clustering.iterator package.

In the iterator package, there is a classes called "CIMapper and CIReducer."

Both of  them are used when we choose a method (-xm) as a mapreduce.

Then, this mapper and reducer works for you!

Thanks.

-Original Message-
From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com] 
Sent: Wednesday, January 29, 2014 4:35 AM
To: user@mahout.apache.org
Subject: Re: How KMeans clustering works in Mahout 0.8?

I see the package, but I couldn't find anything related to map-reduce. I
wonder why!


On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman <
andrew.mussel...@gmail.com> wrote:

> In the source code you could take a look in the 
> org.apache.mahout.clustering.kmeans package to get a start, if you 
> want to understand the implementation.
>
> If you just want to run some clustering, take a look at 
> examples/bin/cluster-reuters.sh which has an option to run kmeans.
>
>
> On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban 
>  >wrote:
>
> > I read Mahout KMeans Design of implementation and it seems to be 
> > clear
> wrt
> > map-reduce paradigm. But when I refer to source code, I can not find 
> > the mapper, reducer, combiner or almost anything mentioned in the 
> > official website. What happened here and what I need to do to 
> > understand KMeans implementation of Mahout?
> >
>







Re: How KMeans clustering works in Mahout 0.8?

2014-01-28 Thread Suneel Marthi
Look at KMeansDriver.java in the specified package and trace thru the code.
You should see both MR and non-MR versions of kmeans impl.





On Tuesday, January 28, 2014 2:35 PM, Saeed Adel Mehraban  
wrote:
 
I see the package, but I couldn't find anything related to map-reduce. I
wonder why!



On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman <
andrew.mussel...@gmail.com> wrote:

> In the source code you could take a look in the
> org.apache.mahout.clustering.kmeans package to get a start, if you want to
> understand the implementation.
>
> If you just want to run some clustering, take a look at
> examples/bin/cluster-reuters.sh which has an option to run kmeans.
>
>
> On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban  >wrote:
>
> > I read Mahout KMeans Design of implementation and it seems to be clear
> wrt
> > map-reduce paradigm. But when I refer to source code, I can not find the
> > mapper, reducer, combiner or almost anything mentioned in the official
> > website. What happened here and what I need to do to understand KMeans
> > implementation of Mahout?
> >
>

Re: How KMeans clustering works in Mahout 0.8?

2014-01-28 Thread Saeed Adel Mehraban
I see the package, but I couldn't find anything related to map-reduce. I
wonder why!


On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman <
andrew.mussel...@gmail.com> wrote:

> In the source code you could take a look in the
> org.apache.mahout.clustering.kmeans package to get a start, if you want to
> understand the implementation.
>
> If you just want to run some clustering, take a look at
> examples/bin/cluster-reuters.sh which has an option to run kmeans.
>
>
> On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban  >wrote:
>
> > I read Mahout KMeans Design of implementation and it seems to be clear
> wrt
> > map-reduce paradigm. But when I refer to source code, I can not find the
> > mapper, reducer, combiner or almost anything mentioned in the official
> > website. What happened here and what I need to do to understand KMeans
> > implementation of Mahout?
> >
>


Re: How KMeans clustering works in Mahout 0.8?

2014-01-27 Thread Andrew Musselman
In the source code you could take a look in the
org.apache.mahout.clustering.kmeans package to get a start, if you want to
understand the implementation.

If you just want to run some clustering, take a look at
examples/bin/cluster-reuters.sh which has an option to run kmeans.


On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban wrote:

> I read Mahout KMeans Design of implementation and it seems to be clear wrt
> map-reduce paradigm. But when I refer to source code, I can not find the
> mapper, reducer, combiner or almost anything mentioned in the official
> website. What happened here and what I need to do to understand KMeans
> implementation of Mahout?
>