Re: How KMeans clustering works in Mahout 0.8?
No Sent from my iPhone > On Jan 30, 2014, at 10:57 AM, Pat Ferrel wrote: > > Is there any qualitative difference sequential v MR? > > > On Jan 28, 2014, at 10:11 PM, Suneel Marthi wrote: > > All of Mahout's clustering algos can be run in both MR and non-MR mode. > By default its the MR mode that's executed unless the user chooses the non-MR > mode by specifiying '-xm sequential' while invoking the driver. > > > > > > On Wednesday, January 29, 2014 1:09 AM, Saeed Adel Mehraban > wrote: > > Thank you for the details. > So KMeans could be run in both map-reduce and non-mapreduce version and the > decision will be made in driver, yes? > > > >> On Tue, Jan 28, 2014 at 11:12 PM, Yoonmin Nam wrote: >> >> K-Means clustering works by using mahout.clustering.iterator package. >> >> In the iterator package, there is a classes called "CIMapper and >> CIReducer." >> >> Both of them are used when we choose a method (-xm) as a mapreduce. >> >> Then, this mapper and reducer works for you! >> >> Thanks. >> >> -Original Message- >> From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com] >> Sent: Wednesday, January 29, 2014 4:35 AM >> To: user@mahout.apache.org >> Subject: Re: How KMeans clustering works in Mahout 0.8? >> >> I see the package, but I couldn't find anything related to map-reduce. I >> wonder why! >> >> >> On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman < >> andrew.mussel...@gmail.com> wrote: >> >>> In the source code you could take a look in the >>> org.apache.mahout.clustering.kmeans package to get a start, if you >>> want to understand the implementation. >>> >>> If you just want to run some clustering, take a look at >>> examples/bin/cluster-reuters.sh which has an option to run kmeans. >>> >>> >>> On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban >>> >>> wrote: >>> >>>> I read Mahout KMeans Design of implementation and it seems to be >>>> clear >>> wrt >>>> map-reduce paradigm. But when I refer to source code, I can not find >>>> the mapper, reducer, combiner or almost anything mentioned in the >>>> official website. What happened here and what I need to do to >>>> understand KMeans implementation of Mahout? >
Re: How KMeans clustering works in Mahout 0.8?
Is there any qualitative difference sequential v MR? On Jan 28, 2014, at 10:11 PM, Suneel Marthi wrote: All of Mahout's clustering algos can be run in both MR and non-MR mode. By default its the MR mode that's executed unless the user chooses the non-MR mode by specifiying '-xm sequential' while invoking the driver. On Wednesday, January 29, 2014 1:09 AM, Saeed Adel Mehraban wrote: Thank you for the details. So KMeans could be run in both map-reduce and non-mapreduce version and the decision will be made in driver, yes? On Tue, Jan 28, 2014 at 11:12 PM, Yoonmin Nam wrote: > K-Means clustering works by using mahout.clustering.iterator package. > > In the iterator package, there is a classes called "CIMapper and > CIReducer." > > Both of them are used when we choose a method (-xm) as a mapreduce. > > Then, this mapper and reducer works for you! > > Thanks. > > -Original Message- > From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com] > Sent: Wednesday, January 29, 2014 4:35 AM > To: user@mahout.apache.org > Subject: Re: How KMeans clustering works in Mahout 0.8? > > I see the package, but I couldn't find anything related to map-reduce. I > wonder why! > > > On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman < > andrew.mussel...@gmail.com> wrote: > >> In the source code you could take a look in the >> org.apache.mahout.clustering.kmeans package to get a start, if you >> want to understand the implementation. >> >> If you just want to run some clustering, take a look at >> examples/bin/cluster-reuters.sh which has an option to run kmeans. >> >> >> On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban >> >> wrote: >> >>> I read Mahout KMeans Design of implementation and it seems to be >>> clear >> wrt >>> map-reduce paradigm. But when I refer to source code, I can not find >>> the mapper, reducer, combiner or almost anything mentioned in the >>> official website. What happened here and what I need to do to >>> understand KMeans implementation of Mahout? >>> >> > > > > >
Re: How KMeans clustering works in Mahout 0.8?
All of Mahout's clustering algos can be run in both MR and non-MR mode. By default its the MR mode that's executed unless the user chooses the non-MR mode by specifiying '-xm sequential' while invoking the driver. On Wednesday, January 29, 2014 1:09 AM, Saeed Adel Mehraban wrote: Thank you for the details. So KMeans could be run in both map-reduce and non-mapreduce version and the decision will be made in driver, yes? On Tue, Jan 28, 2014 at 11:12 PM, Yoonmin Nam wrote: > K-Means clustering works by using mahout.clustering.iterator package. > > In the iterator package, there is a classes called "CIMapper and > CIReducer." > > Both of them are used when we choose a method (-xm) as a mapreduce. > > Then, this mapper and reducer works for you! > > Thanks. > > -Original Message- > From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com] > Sent: Wednesday, January 29, 2014 4:35 AM > To: user@mahout.apache.org > Subject: Re: How KMeans clustering works in Mahout 0.8? > > I see the package, but I couldn't find anything related to map-reduce. I > wonder why! > > > On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman < > andrew.mussel...@gmail.com> wrote: > > > In the source code you could take a look in the > > org.apache.mahout.clustering.kmeans package to get a start, if you > > want to understand the implementation. > > > > If you just want to run some clustering, take a look at > > examples/bin/cluster-reuters.sh which has an option to run kmeans. > > > > > > On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban > > > >wrote: > > > > > I read Mahout KMeans Design of implementation and it seems to be > > > clear > > wrt > > > map-reduce paradigm. But when I refer to source code, I can not find > > > the mapper, reducer, combiner or almost anything mentioned in the > > > official website. What happened here and what I need to do to > > > understand KMeans implementation of Mahout? > > > > > > > > > > >
Re: How KMeans clustering works in Mahout 0.8?
Thank you for the details. So KMeans could be run in both map-reduce and non-mapreduce version and the decision will be made in driver, yes? On Tue, Jan 28, 2014 at 11:12 PM, Yoonmin Nam wrote: > K-Means clustering works by using mahout.clustering.iterator package. > > In the iterator package, there is a classes called "CIMapper and > CIReducer." > > Both of them are used when we choose a method (-xm) as a mapreduce. > > Then, this mapper and reducer works for you! > > Thanks. > > -Original Message- > From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com] > Sent: Wednesday, January 29, 2014 4:35 AM > To: user@mahout.apache.org > Subject: Re: How KMeans clustering works in Mahout 0.8? > > I see the package, but I couldn't find anything related to map-reduce. I > wonder why! > > > On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman < > andrew.mussel...@gmail.com> wrote: > > > In the source code you could take a look in the > > org.apache.mahout.clustering.kmeans package to get a start, if you > > want to understand the implementation. > > > > If you just want to run some clustering, take a look at > > examples/bin/cluster-reuters.sh which has an option to run kmeans. > > > > > > On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban > > > >wrote: > > > > > I read Mahout KMeans Design of implementation and it seems to be > > > clear > > wrt > > > map-reduce paradigm. But when I refer to source code, I can not find > > > the mapper, reducer, combiner or almost anything mentioned in the > > > official website. What happened here and what I need to do to > > > understand KMeans implementation of Mahout? > > > > > > > > > > >
RE: How KMeans clustering works in Mahout 0.8?
K-Means clustering works by using mahout.clustering.iterator package. In the iterator package, there is a classes called "CIMapper and CIReducer." Both of them are used when we choose a method (-xm) as a mapreduce. Then, this mapper and reducer works for you! Thanks. -Original Message- From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com] Sent: Wednesday, January 29, 2014 4:35 AM To: user@mahout.apache.org Subject: Re: How KMeans clustering works in Mahout 0.8? I see the package, but I couldn't find anything related to map-reduce. I wonder why! On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman < andrew.mussel...@gmail.com> wrote: > In the source code you could take a look in the > org.apache.mahout.clustering.kmeans package to get a start, if you > want to understand the implementation. > > If you just want to run some clustering, take a look at > examples/bin/cluster-reuters.sh which has an option to run kmeans. > > > On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban > >wrote: > > > I read Mahout KMeans Design of implementation and it seems to be > > clear > wrt > > map-reduce paradigm. But when I refer to source code, I can not find > > the mapper, reducer, combiner or almost anything mentioned in the > > official website. What happened here and what I need to do to > > understand KMeans implementation of Mahout? > > >
Re: How KMeans clustering works in Mahout 0.8?
Look at KMeansDriver.java in the specified package and trace thru the code. You should see both MR and non-MR versions of kmeans impl. On Tuesday, January 28, 2014 2:35 PM, Saeed Adel Mehraban wrote: I see the package, but I couldn't find anything related to map-reduce. I wonder why! On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman < andrew.mussel...@gmail.com> wrote: > In the source code you could take a look in the > org.apache.mahout.clustering.kmeans package to get a start, if you want to > understand the implementation. > > If you just want to run some clustering, take a look at > examples/bin/cluster-reuters.sh which has an option to run kmeans. > > > On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban >wrote: > > > I read Mahout KMeans Design of implementation and it seems to be clear > wrt > > map-reduce paradigm. But when I refer to source code, I can not find the > > mapper, reducer, combiner or almost anything mentioned in the official > > website. What happened here and what I need to do to understand KMeans > > implementation of Mahout? > > >
Re: How KMeans clustering works in Mahout 0.8?
I see the package, but I couldn't find anything related to map-reduce. I wonder why! On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman < andrew.mussel...@gmail.com> wrote: > In the source code you could take a look in the > org.apache.mahout.clustering.kmeans package to get a start, if you want to > understand the implementation. > > If you just want to run some clustering, take a look at > examples/bin/cluster-reuters.sh which has an option to run kmeans. > > > On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban >wrote: > > > I read Mahout KMeans Design of implementation and it seems to be clear > wrt > > map-reduce paradigm. But when I refer to source code, I can not find the > > mapper, reducer, combiner or almost anything mentioned in the official > > website. What happened here and what I need to do to understand KMeans > > implementation of Mahout? > > >
Re: How KMeans clustering works in Mahout 0.8?
In the source code you could take a look in the org.apache.mahout.clustering.kmeans package to get a start, if you want to understand the implementation. If you just want to run some clustering, take a look at examples/bin/cluster-reuters.sh which has an option to run kmeans. On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban wrote: > I read Mahout KMeans Design of implementation and it seems to be clear wrt > map-reduce paradigm. But when I refer to source code, I can not find the > mapper, reducer, combiner or almost anything mentioned in the official > website. What happened here and what I need to do to understand KMeans > implementation of Mahout? >