Re: How KMeans clustering works in Mahout 0.8?
Is there any qualitative difference sequential v MR? On Jan 28, 2014, at 10:11 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: All of Mahout's clustering algos can be run in both MR and non-MR mode. By default its the MR mode that's executed unless the user chooses the non-MR mode by specifiying '-xm sequential' while invoking the driver. On Wednesday, January 29, 2014 1:09 AM, Saeed Adel Mehraban s.ade...@gmail.com wrote: Thank you for the details. So KMeans could be run in both map-reduce and non-mapreduce version and the decision will be made in driver, yes? On Tue, Jan 28, 2014 at 11:12 PM, Yoonmin Nam rony...@dgist.ac.kr wrote: K-Means clustering works by using mahout.clustering.iterator package. In the iterator package, there is a classes called CIMapper and CIReducer. Both of them are used when we choose a method (-xm) as a mapreduce. Then, this mapper and reducer works for you! Thanks. -Original Message- From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com] Sent: Wednesday, January 29, 2014 4:35 AM To: user@mahout.apache.org Subject: Re: How KMeans clustering works in Mahout 0.8? I see the package, but I couldn't find anything related to map-reduce. I wonder why! On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: In the source code you could take a look in the org.apache.mahout.clustering.kmeans package to get a start, if you want to understand the implementation. If you just want to run some clustering, take a look at examples/bin/cluster-reuters.sh which has an option to run kmeans. On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban s.ade...@gmail.com wrote: I read Mahout KMeans Design of implementation and it seems to be clear wrt map-reduce paradigm. But when I refer to source code, I can not find the mapper, reducer, combiner or almost anything mentioned in the official website. What happened here and what I need to do to understand KMeans implementation of Mahout?
Re: How KMeans clustering works in Mahout 0.8?
No Sent from my iPhone On Jan 30, 2014, at 10:57 AM, Pat Ferrel p...@occamsmachete.com wrote: Is there any qualitative difference sequential v MR? On Jan 28, 2014, at 10:11 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: All of Mahout's clustering algos can be run in both MR and non-MR mode. By default its the MR mode that's executed unless the user chooses the non-MR mode by specifiying '-xm sequential' while invoking the driver. On Wednesday, January 29, 2014 1:09 AM, Saeed Adel Mehraban s.ade...@gmail.com wrote: Thank you for the details. So KMeans could be run in both map-reduce and non-mapreduce version and the decision will be made in driver, yes? On Tue, Jan 28, 2014 at 11:12 PM, Yoonmin Nam rony...@dgist.ac.kr wrote: K-Means clustering works by using mahout.clustering.iterator package. In the iterator package, there is a classes called CIMapper and CIReducer. Both of them are used when we choose a method (-xm) as a mapreduce. Then, this mapper and reducer works for you! Thanks. -Original Message- From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com] Sent: Wednesday, January 29, 2014 4:35 AM To: user@mahout.apache.org Subject: Re: How KMeans clustering works in Mahout 0.8? I see the package, but I couldn't find anything related to map-reduce. I wonder why! On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: In the source code you could take a look in the org.apache.mahout.clustering.kmeans package to get a start, if you want to understand the implementation. If you just want to run some clustering, take a look at examples/bin/cluster-reuters.sh which has an option to run kmeans. On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban s.ade...@gmail.com wrote: I read Mahout KMeans Design of implementation and it seems to be clear wrt map-reduce paradigm. But when I refer to source code, I can not find the mapper, reducer, combiner or almost anything mentioned in the official website. What happened here and what I need to do to understand KMeans implementation of Mahout?
Re: How KMeans clustering works in Mahout 0.8?
I see the package, but I couldn't find anything related to map-reduce. I wonder why! On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: In the source code you could take a look in the org.apache.mahout.clustering.kmeans package to get a start, if you want to understand the implementation. If you just want to run some clustering, take a look at examples/bin/cluster-reuters.sh which has an option to run kmeans. On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban s.ade...@gmail.com wrote: I read Mahout KMeans Design of implementation and it seems to be clear wrt map-reduce paradigm. But when I refer to source code, I can not find the mapper, reducer, combiner or almost anything mentioned in the official website. What happened here and what I need to do to understand KMeans implementation of Mahout?
Re: How KMeans clustering works in Mahout 0.8?
Look at KMeansDriver.java in the specified package and trace thru the code. You should see both MR and non-MR versions of kmeans impl. On Tuesday, January 28, 2014 2:35 PM, Saeed Adel Mehraban s.ade...@gmail.com wrote: I see the package, but I couldn't find anything related to map-reduce. I wonder why! On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: In the source code you could take a look in the org.apache.mahout.clustering.kmeans package to get a start, if you want to understand the implementation. If you just want to run some clustering, take a look at examples/bin/cluster-reuters.sh which has an option to run kmeans. On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban s.ade...@gmail.com wrote: I read Mahout KMeans Design of implementation and it seems to be clear wrt map-reduce paradigm. But when I refer to source code, I can not find the mapper, reducer, combiner or almost anything mentioned in the official website. What happened here and what I need to do to understand KMeans implementation of Mahout?
RE: How KMeans clustering works in Mahout 0.8?
K-Means clustering works by using mahout.clustering.iterator package. In the iterator package, there is a classes called CIMapper and CIReducer. Both of them are used when we choose a method (-xm) as a mapreduce. Then, this mapper and reducer works for you! Thanks. -Original Message- From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com] Sent: Wednesday, January 29, 2014 4:35 AM To: user@mahout.apache.org Subject: Re: How KMeans clustering works in Mahout 0.8? I see the package, but I couldn't find anything related to map-reduce. I wonder why! On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: In the source code you could take a look in the org.apache.mahout.clustering.kmeans package to get a start, if you want to understand the implementation. If you just want to run some clustering, take a look at examples/bin/cluster-reuters.sh which has an option to run kmeans. On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban s.ade...@gmail.com wrote: I read Mahout KMeans Design of implementation and it seems to be clear wrt map-reduce paradigm. But when I refer to source code, I can not find the mapper, reducer, combiner or almost anything mentioned in the official website. What happened here and what I need to do to understand KMeans implementation of Mahout?
Re: How KMeans clustering works in Mahout 0.8?
Thank you for the details. So KMeans could be run in both map-reduce and non-mapreduce version and the decision will be made in driver, yes? On Tue, Jan 28, 2014 at 11:12 PM, Yoonmin Nam rony...@dgist.ac.kr wrote: K-Means clustering works by using mahout.clustering.iterator package. In the iterator package, there is a classes called CIMapper and CIReducer. Both of them are used when we choose a method (-xm) as a mapreduce. Then, this mapper and reducer works for you! Thanks. -Original Message- From: Saeed Adel Mehraban [mailto:s.ade...@gmail.com] Sent: Wednesday, January 29, 2014 4:35 AM To: user@mahout.apache.org Subject: Re: How KMeans clustering works in Mahout 0.8? I see the package, but I couldn't find anything related to map-reduce. I wonder why! On Tue, Jan 28, 2014 at 4:14 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: In the source code you could take a look in the org.apache.mahout.clustering.kmeans package to get a start, if you want to understand the implementation. If you just want to run some clustering, take a look at examples/bin/cluster-reuters.sh which has an option to run kmeans. On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban s.ade...@gmail.com wrote: I read Mahout KMeans Design of implementation and it seems to be clear wrt map-reduce paradigm. But when I refer to source code, I can not find the mapper, reducer, combiner or almost anything mentioned in the official website. What happened here and what I need to do to understand KMeans implementation of Mahout?
Re: How KMeans clustering works in Mahout 0.8?
In the source code you could take a look in the org.apache.mahout.clustering.kmeans package to get a start, if you want to understand the implementation. If you just want to run some clustering, take a look at examples/bin/cluster-reuters.sh which has an option to run kmeans. On Mon, Jan 27, 2014 at 5:51 AM, Saeed Adel Mehraban s.ade...@gmail.comwrote: I read Mahout KMeans Design of implementation and it seems to be clear wrt map-reduce paradigm. But when I refer to source code, I can not find the mapper, reducer, combiner or almost anything mentioned in the official website. What happened here and what I need to do to understand KMeans implementation of Mahout?