Re: Supported Lucene Index Version

2016-09-13 Thread Suneel Marthi
, 2016 at 7:14 AM, Raviteja Lokineni < > raviteja.lokin...@gmail.com> wrote: > >> FYI, the versions quoted are for SNAPSHOT. They will be available in 13.0 >> probably, as per the below ticket. >> >> https://issues.apache.org/jira/browse/MAHOUT-1876 >> &

Re: Supported Lucene Index Version

2016-09-12 Thread Suneel Marthi
Its Lucene 5.5.2. Solr 6.0 and above mandate Java 8. On Tue, Sep 13, 2016 at 12:04 AM, Reth RM wrote: > What is the latest lucene index version that is supported? > > trying to generate lucene vectors, index created using solr 4.10.2 and solr > 6.0 apis. > > command > > >

Re: AbstractJob class not found exception

2016-08-16 Thread Suneel Marthi
Which Mahout version are u running? On Tue, Aug 16, 2016 at 7:10 AM, Lee S wrote: > I try to run local mahout job in my main function, > > but when execute it come out with exception: > > java.lang.NoClassDefFoundError: org/apache/mahout/common/AbstractJob > at

Re: Text clustering how to?

2016-07-27 Thread Suneel Marthi
You did get a reply via jira, please stop spamming Mahout and OpenNLP mailing listswith the same question. The book u r looking at 'Taming Text' is from 2011-12, and both OpenNLP and Mahout projects have long diverged from the book. If u r following the book for ur learning, u may be better off

[ANNOUNCE] Apache Mahout 0.12.2 Release

2016-06-13 Thread Suneel Marthi
The Apache Mahout PMC is pleased to announce the release of Mahout 0.12.2 which is a minor release following 0.12.1 in May 2016. Mahout's goal is to create an environment for quickly creating machine learning applications that scale and run on the highest performance parallel computation engines

Re: [VOTE] Mahout 0.12.2 Release Candidate 2

2016-06-13 Thread Suneel Marthi
che.org > Cc: mahout <d...@mahout.apache.org> > Subject: Re: [VOTE] Mahout 0.12.2 Release Candidate 2 > > Signatures and hashes are correct; +1 (binding). > > On Fri, Jun 10, 2016 at 6:05 PM, Suneel Marthi <smar...@apache.org> wrote: > > > Verified {bin} * {zip,tar} - ran tests, tests pass > > > > >

Re: [VOTE] Mahout 0.12.2 Release Candidate 2

2016-06-10 Thread Suneel Marthi
Verified {bin} * {zip,tar} - ran tests, tests pass Verified {src} * {zip,tar} - rant tests, tests pass Here's my +1 (binding) On Fri, Jun 10, 2016 at 8:59 PM, Suneel Marthi <smar...@apache.org> wrote: > This is the vote for release 0.12.2 of Apache Mahout. > > The vot

[VOTE] Mahout 0.12.2 Release Candidate 2

2016-06-10 Thread Suneel Marthi
This is the vote for release 0.12.2 of Apache Mahout. The vote will be going for at least 72 hours and will be closed on Sunday, June 12th, 2016 or once there are at least 3 PMC +1 binding votes (which ever occurs earlier). Please download, test and vote with [ ] +1, accept RC as the official

Re: [VOTE] Apache Mahout 0.12.2 Release Candidate

2016-06-10 Thread Suneel Marthi
l and all tests > pass. > > +1 (binding) > > On Fri, Jun 10, 2016 at 2:25 PM, Suneel Marthi <smar...@apache.org> wrote: > > > Verified {bin} * {zip,tar} - ran tests, tests pass > > > > Verified {src} * {zip,tar} - rant tests, tests pass > > > >

Re: [VOTE] Apache Mahout 0.12.2 Release Candidate

2016-06-10 Thread Suneel Marthi
Verified {bin} * {zip,tar} - ran tests, tests pass Verified {src} * {zip,tar} - rant tests, tests pass Here's my +1 (binding) On Fri, Jun 10, 2016 at 5:12 PM, Suneel Marthi <smar...@apache.org> wrote: > This is the vote for release 0.12.2 of Apache Mahout. > > The vot

Re: Stickers

2016-06-02 Thread Suneel Marthi
Jun 2, 2016 at 6:24 PM, Andrew Musselman < > > andrew.mussel...@gmail.com> wrote: > > > >> Ordered a hundred; will post the proof when it's ready. > >> > >> On Thu, Jun 2, 2016 at 6:19 PM, Andrew Musselman < > >> andrew.mussel...@gmail.com&

Re: Welcome Trevor Grant as a new Mahout Committer

2016-05-24 Thread Suneel Marthi
Welcome Trevor !!! Kokanee Cheers !! On Mon, May 23, 2016 at 8:39 PM, Andrew Palumbo wrote: > In recognition of Trevor Grant's contributions to the Mahout project > notably his Zeppelin Integration work, the PMC has invited and is pleased > to announce that he has accepted

[ANNOUNCE] Apache Mahout 0.12.1 Release

2016-05-18 Thread Suneel Marthi
The Apache Mahout PMC is pleased to announce the release of Mahout 0.12.1 which is a minor release following 0.12.0 release on April 11, 2016. Mahout's goal is to create an environment for quickly creating machine learning applications that scale and run on the highest performance parallel

Re: [VOTE] Apache Mahout 0.12.1 Release

2016-05-18 Thread Suneel Marthi
3:53 PM, Andrew Palumbo <ap@outlook.com> > wrote: > > > +1 (binding) tested a clean source build. > > > > ________ > > From: Suneel Marthi <smar...@apache.org> > > Sent: Wednesday, May 18, 2016 6:23:57 PM > > To:

Re: [VOTE] Apache Mahout 0.12.1 Release

2016-05-18 Thread Suneel Marthi
Verified {src} * {tar, zip} Ran a clean build and tests and see no issues +1 (binding) On Wed, May 18, 2016 at 6:07 PM, Suneel Marthi <smar...@apache.org> wrote: > This is the vote for release 0.12.1 of Apache Mahout. > > The vote will be going for at least 72 hours and

[VOTE] Apache Mahout 0.12.1 Release

2016-05-18 Thread Suneel Marthi
This is the vote for release 0.12.1 of Apache Mahout. The vote will be going for at least 72 hours and will be closed on Wednesday, May 21th, 2016. Please download, test and vote with [ ] +1, accept RC as the official 0.12.1 release of Apache Mahout [ ] +0, I don't care either way, [ ] -1, do

Re: About reuters-fkmeans-centroids

2016-04-28 Thread Suneel Marthi
That's correct, deprecated as of Feb 2014 and will be completely purged in one of the upcoming releases (0.13.0) On Thu, Apr 28, 2016 at 2:10 PM, Dmitriy Lyubimov wrote: > Prakash, > > if you are using any Mahout Mapreduce algorithm for research, please make > sure to make

Re: About reuters-fkmeans-centroids

2016-04-28 Thread Suneel Marthi
Yes, the entire MapReduce code (which includes the fuzzy clustering that u r looking at) is not supported anymore as of Mahout 0.10.0 (suggest reading the release notes on mahout.apache.org) On Thu, Apr 28, 2016 at 2:05 PM, Prakash Poudyal wrote: > Hi! Ted, > > You

Re: About reuters-fkmeans-centroids

2016-04-28 Thread Suneel Marthi
t's being done. feel free to pose more questions. > Thank you so much. I was being stuck since last two days. Hope you will > reply me sooner. > > Prakash > > > On Thu, Apr 28, 2016 at 6:26 PM, Suneel Marthi <smar...@apache.org> wrote: > > > First thing, mo

Re: About reuters-fkmeans-centroids

2016-04-28 Thread Suneel Marthi
First thing, most of this code is legacy MapReduce and is not supported anymore. Hence you r not seeing answers. Back to ur question: -c specifies the folder for the initial centroids that r randomly generated. IIR, the centroids are generated when u execute the Clustering Driver. On Wed, Apr

Re: Mahout with Hadoop 2.2.0

2016-04-25 Thread Suneel Marthi
On Mon, Apr 25, 2016 at 9:16 AM, Nantia Makrynioti wrote: > Thank for your replies! > > Mahout 0.12.0 from command line worked great. > > However, if I want to develop the same example with Java API, what version > of mahout-core should I use in the pom.xml? > > I tried

Re: Mahout with Hadoop 2.2.0

2016-04-25 Thread Suneel Marthi
Mahout 0.9 is not supported anymore and u shouldn't be using it. The issue u r seeing is due to Mahout 0.9 not being compatible with hadoop 2.x. Suggest you try this example with Mahout 0.11.0 or 0.12.0. On Mon, Apr 25, 2016 at 6:27 AM, Nantia Makrynioti wrote: >

Congratulations to our new Chair

2016-04-20 Thread Suneel Marthi
Please join me in congratulating Andrew Palumbo on becoming our new Project Chair. As for me, it was a pleasure to serve as Chair starting with the Mahout 0.10.0 release and ending with the recent 0.12.0 release, and perhaps we will do it again someday. ​Congrats again, Andy!​

[ANNOUNCE] Apache Mahout 0.12.0 Release

2016-04-11 Thread Suneel Marthi
The Apache Mahout PMC is pleased to announce the release of Mahout 0.12.0. Mahout's goal is to create an environment for quickly creating machine learning applications that scale and run on the highest performance parallel computation engines available. Mahout comprises an interactive environment

Re: [VOTE] Apache Mahout 0.12.0 Release Candidate

2016-04-11 Thread Suneel Marthi
er@mahout.apache.org > > Subject: Re: [VOTE] Apache Mahout 0.12.0 Release Candidate > > > > Sigs and hashes are correct, running a build and examples next. > > > > On Mon, Apr 11, 2016 at 8:38 AM, Suneel Marthi <smar...@apache.org> > wrote: > > > &g

Re: [VOTE] Apache Mahout 0.12.0 Release Candidate

2016-04-11 Thread Suneel Marthi
Ran a complete build on {src} * {zip, tar} and verified that all tests pass. Tested Spark Shell All Flink tests pass +1 (binding) On Mon, Apr 11, 2016 at 8:44 AM, Suneel Marthi <smar...@apache.org> wrote: > Correction to previous message > -- > &

Re: [VOTE] Apache Mahout 0.12.0 Release Candidate

2016-04-11 Thread Suneel Marthi
/ The git tag to be voted upon is mahout-0.12.0 On Mon, Apr 11, 2016 at 8:41 AM, Suneel Marthi <smar...@apache.org> wrote: > This is a vote for release 0.12.0 of Apache Mahout that adds Apache Flink > as an execution engine to the Samsara Linear Algebra framework. > > The vo

[VOTE] Apache Mahout 0.12.0 Release Candidate

2016-04-11 Thread Suneel Marthi
This is a vote for release 0.12.0 of Apache Mahout that adds Apache Flink as an execution engine to the Samsara Linear Algebra framework. The vote will run for 24 hours and will be closed on Monday, April 12th, 2016. Please download, test and vote with [ ] +1, accept RC as the official 0.12.0

Re: [VOTE] Apache Mahout 0.12.0 Release Candidate

2016-04-10 Thread Suneel Marthi
Rolling back the Release Candidate, will put up a new RC in an hour. On Mon, Apr 11, 2016 at 1:15 AM, Andrew Musselman < andrew.mussel...@gmail.com> wrote: > -1 Problem found during testing. > > On Sun, Apr 10, 2016 at 7:29 PM, Suneel Marthi <smar...@apache.org> wrote: &

[VOTE] Apache Mahout 0.12.0 Release Candidate

2016-04-10 Thread Suneel Marthi
This is the vote for release 0.12.0 of Apache Mahout that adds Apache Flink as a execution engine to the Samsara Linear Algebra framework. The vote will run for 24 hours and will be closed on Monday, April 12th, 2016. Please download, test and vote with [ ] +1, accept RC as the official 0.12.0

Re: Removing MAHOUT_LOCAL option

2016-03-21 Thread Suneel Marthi
ch 20, 2016, Pat Ferrel <p...@occamsmachete.com > >>>> <javascript:;>> wrote: > >>>>> > >>>>>> Are we just talking about Hadoop Mapreduce? I thought is was ignored > >>>> when > >>>>>> using S

Re: Removing MAHOUT_LOCAL option

2016-03-19 Thread Suneel Marthi
+1 to remove this Sent from my iPhone > On Mar 20, 2016, at 12:01 AM, Andrew Musselman > wrote: > > We're discussing removing the MAHOUT_LOCAL option in order to trim artifact > sizes. > > If you think keeping the option to use MAHOUT_LOCAL for testing with the >

[ANNOUNCE] Apache Mahout 0.11.2 Release

2016-03-12 Thread Suneel Marthi
Apache Mahout 0.11.2 Release Notes The Apache Mahout PMC is pleased to announce the release of Mahout 0.11.2. Mahout's goal is to create an environment for quickly creating machine learning applications that scale and run on the highest performance parallel computation engines available. Mahout

Re: [VOTE] Apache Mahout 0.11.2 Release Candidate

2016-03-11 Thread Suneel Marthi
ssifier.mscala script. All without issue > (aside from the URL changes mentioned in the release notes). > > +1 > > ________ > From: Suneel Marthi <smar...@apache.org> > Sent: Friday, March 11, 2016 6:03 PM > To: user@mahout.apache.org

[VOTE] Apache Mahout 0.11.2 Release Candidate

2016-03-11 Thread Suneel Marthi
This is the vote for release 0.11.2 of Apache Mahout. The vote will be going for 24 hours and will be closed on Sunday, March 12th, 2016. Please download, test and vote with [ ] +1, accept RC as the official 0.11.2 release of Apache Mahout [ ] +0, I don't care either way, [ ] -1, do not accept

Re: New Mahout "Samsara" Book

2016-02-25 Thread Suneel Marthi
leaves a lot of recommendation code that needs to be > covered. There is room for another book. > > > > > > On Thu, Feb 25, 2016 at 9:32 AM, Suneel Marthi <smar...@apache.org> wrote: > > > The Mahout project has diverged from 'Mahout in Action' since Mahout 0.7 >

Re: New Mahout "Samsara" Book

2016-02-25 Thread Suneel Marthi
The Mahout project has diverged from 'Mahout in Action' since Mahout 0.7 release in 2012. On Thu, Feb 25, 2016 at 12:30 PM, FRANCISCO XAVIER SUMBA TORAL < xavier.sumb...@ucuenca.ec> wrote: > Awesome!! This book it’s mostly about Samsara. I’ll buy it. BTW do you > know if there is an update of

Re: New Mahout "Samsara" Book

2016-02-25 Thread Suneel Marthi
on the first Monday of every month. We have almost > 600 members with average attendance somewhere north of 50 per event (High > of 110 and low of 25). > > Cheers, > > SCott > > > On Feb 25, 2016, at 8:56 AM, Suneel Marthi <smar...@apache.org> wrote: > > > &g

Re: New Mahout "Samsara" Book

2016-02-25 Thread Suneel Marthi
It does give u TOC when u 'Look Inside'. On Thu, Feb 25, 2016 at 10:16 AM, Pavan K Narayanan < pavan.naraya...@gmail.com> wrote: > I checked both links, they have only front and back cover of the book. No > table of contents > On Feb 25, 2016 9:57 AM, "Suneel Marthi" &

Re: New Mahout "Samsara" Book

2016-02-25 Thread Suneel Marthi
You can see the TOC on Amazon http://www.amazon.com/Apache-Mahout-MapReduce-Dmitriy-Lyubimov/dp/1523775785 On Thu, Feb 25, 2016 at 9:55 AM, Pavan K Narayanan < pavan.naraya...@gmail.com> wrote: > Andrew, can you please attach table of contents if you don't mind. > On Feb 25, 2016 8:05 AM,

Re: Confusion regarding Samsara's configuration

2016-02-02 Thread Suneel Marthi
Are u working off of Mahout 0.11.1 ? 0.11.1 has been certified for Spark 1.5 but compatible with 1.6. On Tue, Feb 2, 2016 at 12:10 PM, BahaaEddin AlAila wrote: > Thank you very much for your reply. > As I mentioned earlier, I am using mahoutSparkContext, and MAHOUT_HOME

Re: Some test results

2015-12-30 Thread Suneel Marthi
 On Wed, Dec 30, 2015 at 2:57 PM, Dmitriy Lyubimov wrote: > Nice! > On Dec 30, 2015 11:51 AM, "Pat Ferrel" wrote: > > > As many of you know Mahout-Samsara includes an interesting and important > > extension to cooccurrence similarity, which supports

Re: Updated books

2015-12-11 Thread Suneel Marthi
All of the below mentioned books are still based on Mahout 0.10 and cover the old MapReduce algorithms - all of which have been long deprecated/retired/"to be purged". There's no book on Mahout out there today that deals with the new Mahout, its best you bring up ur questions on the email lists

Re: Mahout item based recommender help documentation

2015-12-02 Thread Suneel Marthi
Mahout 0.9 isn't supported anymore, suggest that you upgrade to Mahout 0.11.0 which is Spark 1.3+ compatible. On Wed, Dec 2, 2015 at 7:22 PM, Weiqing Jin wrote: > Hi, I am new to Mahout. I am using Mahout on Cloudera CDH5.3. I believe it > has version 0.9.Wondering

Re: [VOTE] Apache Mahout 0.11.1 Release Candidate

2015-11-06 Thread Suneel Marthi
; wrote: > > > > > 1. Downloaded and built {src} {tar}- all tests passed. > > > 2. Started shell from {src} {bin} *{tar} distro and ran some > distributed > > > algebra and I/O tests- no problems. > > > 3. Ran MR Wikipedia example. > > > 4. R

Re: [VOTE] Apache Mahout 0.11.1 Release Candidate

2015-11-06 Thread Suneel Marthi
} distro and ran some distributed > > algebra and I/O tests- no problems. > > 3. Ran MR Wikipedia example. > > 4. Ran Spark CLI naive bayes examples. > > > > +1 (binding) > > > > > > > > From:

Re: [VOTE] Apache Mahout 0.11.1 Release Candidate

2015-11-06 Thread Suneel Marthi
ct > > > On Fri, Nov 6, 2015 at 5:17 PM, Suneel Marthi <smar...@apache.org> wrote: > > > D, was there a JIRA for this? Have a vague recollection that we may have > > addressed a similar thing during summer on one of the branches (most > likely > > 0.10.x). No ? &

[ANNOUNCE] Apache Mahout 0.11.1 Release

2015-11-06 Thread Suneel Marthi
The Apache Mahout PMC is pleased to announce the release of Mahout 0.11.1. Mahout's goal is to create an environment for quickly creating machine learning applications that scale and run on the highest performance parallel computation engines available. Mahout comprises an interactive

Re: [VOTE] Apache Mahout 0.11.1 Release Candidate

2015-11-06 Thread Suneel Marthi
This Vote is cancelled, a new Release Candidate will be put out sometime today. On Fri, Nov 6, 2015 at 1:54 AM, Suneel Marthi <smar...@apache.org> wrote: > Please vote on releasing the following candidate as Apache Mahout version > 0.11.1: > > Branch: > release-0.11.1 &

[VOTE] Apache Mahout 0.11.1 Release Candidate

2015-11-06 Thread Suneel Marthi
Please vote on releasing the following candidate as Apache Mahout version 0.11.1: Branch: release-0.11.1 (see https://git1-us-west.apache.org/repos/asf?p=mahout.git) The release artifacts to be voted on can be found at:

Re: [VOTE] Apache Mahout 0.11.1 Release Candidate

2015-11-06 Thread Suneel Marthi
, tests passed Here's my +1 (binding) On Fri, Nov 6, 2015 at 2:41 PM, Suneel Marthi <suneel.mar...@gmail.com> wrote: > Please vote on releasing the following candidate as Apache Mahout version > 0.11.1: > > Branch: > release-0.11.1 > (see https://git1-us-west.apache.or

[VOTE] Apache Mahout 0.11.1 Release Candidate

2015-11-05 Thread Suneel Marthi
Please vote on releasing the following candidate as Apache Mahout version 0.11.1: Branch: release-0.11.1 (see https://git1-us-west.apache.org/repos/asf?p=mahout.git) The release artifacts to be voted on can be found at:

Re: Haters get Love too

2015-11-03 Thread Suneel Marthi
Thanks Pat, very interesting indeed. On Tue, Nov 3, 2015 at 6:20 PM, Pat Ferrel wrote: > A colleague of mine just build a MAP@k precision evaluator for the Mahout > based cooccurrence recommender we’ve been working on and we ran some data > scraped from

Re: Is Mahout obsolete now?

2015-10-19 Thread Suneel Marthi
Thanks Sean. Samsara is the new distributed linear algebra DSL that is engine agnostic and presently support Spark and H2O (Flink is in the works). We do have Recommenders built on top of Samsara today. On Mon, Oct 19, 2015 at 3:42 PM, Sean Owen wrote: > No, this is pretty

Re: Is Mahout obsolete now?

2015-10-19 Thread Suneel Marthi
This is so inaccurate and not true. You obviously have not been following Mahout project. Mahout has long moved away from MapReduce and presently support Spark, H2O and in future Flink as execution engines. I would suggest you look at the recent Mahout 0.11.0 and see where the project is before

Re: Is Mahout obsolete now?

2015-10-19 Thread Suneel Marthi
Hi Prasad, As Sean has explained in an earlier posting on this thread, Mahout 0.9 and earlier which were MapReduce based are not supported anymore. We do have recommenders in Mahout 0.11.0 that have been built on the new Samsara Math DSL. Definitely would suggest that you check out the latest

Re: examples/bin/cluster-reuters.sh fails on k-means: Option (1)

2015-09-16 Thread Suneel Marthi
try running /bin/cluster-reuters.sh --help to see the list of expected input options. On Wed, Sep 16, 2015 at 8:54 PM, Disa Mhembere wrote: > Hello all, > > I'm running mahout 0.11.1 on an ubuntu 14.04 server. I compiled the library > with maven 3.3.3 under java 1.7 and

[RESULT] [VOTE] Apache Mahout 0.10.2 Release

2015-08-06 Thread Suneel Marthi
We had 3 +1 PMC votes and no -1s, the release has passed and the voting is now closed.

[ANNOUNCE] Apache Mahout 0.10.2 Release

2015-08-06 Thread Suneel Marthi
The Apache Mahout PMC is pleased to announce the release of Mahout 0.10.2. Mahout's goal is to create an environment for quickly creating machine learning applications that scale and run on the highest performance parallel computation engines available. Mahout comprises an interactive environment

Re: [VOTE] Apache Mahout 0.10.2 Release Candidate

2015-08-05 Thread Suneel Marthi
Cancelling the 0.10.2 Release, discovered a missing artifact which prevents the release from going thru. On Tue, Aug 4, 2015 at 9:31 PM, Suneel Marthi smar...@apache.org wrote: Thanks for the votes. The Voting for 0.10.2 is officially closed, we had 4 +1 votes and no objections, will send

Re: [VOTE] Apache Mahout 0.11.0 Release Candidate

2015-08-05 Thread Suneel Marthi
Tested {src} * {zip,tar} and all tests pass. +1 (binding) On Thu, Aug 6, 2015 at 1:13 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: +1, already tested On Wed, Aug 5, 2015 at 9:44 PM, Suneel Marthi smar...@apache.org wrote: This is the vote for release 0.11.0 of Apache Mahout

[VOTE] Apache Mahout 0.11.0 Release Candidate

2015-08-05 Thread Suneel Marthi
This is the vote for release 0.11.0 of Apache Mahout. The vote will be going for at least 72 hours and will be closed on Thursday, August 6th, 2015. Please download, test and vote with [ ] +1, accept RC as the official 0.11.0 release of Apache Mahout [ ] +0, I don't care either way, [ ] -1, do

[VOTE] Apache Mahout 0.10.2 Release

2015-08-05 Thread Suneel Marthi
This is the vote for release 0.10.2 of Apache Mahout. The vote will be going for at least 72 hours and will be closed on Thursday, August 6th, 2015. Please download, test and vote with [ ] +1, accept RC as the official 0.10.2 release of Apache Mahout [ ] +0, I don't care either way, [ ] -1, do

Re: [VOTE] Apache Mahout 0.10.2 Release

2015-08-05 Thread Suneel Marthi
Tested the examples from {src, bin} in pseudo-cluster mode and all tests pass. Here's my +1 (binding) On Wed, Aug 5, 2015 at 8:02 PM, Suneel Marthi smar...@apache.org wrote: This is the vote for release 0.10.2 of Apache Mahout. The vote will be going for at least 72 hours and will be closed

Re: [VOTE] Apache Mahout 0.10.2 Release Candidate

2015-08-04 Thread Suneel Marthi
at 11:58 AM, Suneel Marthi smar...@apache.org wrote: If u folks have not read the email from last friday that talks about both 0.10.2 and 0.11.0 releases this week, I would suggest that you please do. The plan is to release both 0.10.2 and 0.11.0 this week. Seems like we have some

[VOTE] Apache Mahout 0.11.0 Release Candidate

2015-08-03 Thread Suneel Marthi
This is the vote for release 0.11.0 of Apache Mahout. The vote will be going for at least 72 hours and will be closed on Wednesday, August 6th, 2015. Please download, test and vote with [ ] +1, accept RC as the official 0.11.0 release of Apache Mahout [ ] +0, I don't care either way, [ ] -1, do

Re: [VOTE] Apache Mahout 0.10.2 Release Candidate

2015-08-02 Thread Suneel Marthi
the spark-shell in both .zip and .tar.gz binaries. +1 (binding) On 08/01/2015 12:44 AM, Suneel Marthi wrote: Verified {src} * {bin, tar} and all tests pass. +1 (binding) On Fri, Jul 31, 2015 at 11:56 PM, Suneel Marthi smar...@apache.org javascript:; wrote

[VOTE] Apache Mahout 0.11.0 Release candidate

2015-08-02 Thread Suneel Marthi
This is the vote for release 0.11.0 of Apache Mahout. The vote will be going for at least 72 hours and will be closed on Wednesday, August 5th, 2015. Please download, test and vote with [ ] +1, accept RC as the official 0.11.0 release of Apache Mahout [ ] +0, I don't care either way, [ ] -1, do

Re: [VOTE] Apache Mahout 0.10.2 Release Candidate

2015-07-31 Thread Suneel Marthi
Verified {src} * {bin, tar} and all tests pass. +1 (binding) On Fri, Jul 31, 2015 at 11:56 PM, Suneel Marthi smar...@apache.org wrote: This is a call for Votes for Mahout 0.10.2 Release candidate available at https://repository.apache.org/content/repositories/orgapachemahout-1011 Need

[VOTE] Apache Mahout 0.10.2 Release Candidate

2015-07-31 Thread Suneel Marthi
This is a call for Votes for Mahout 0.10.2 Release candidate available at https://repository.apache.org/content/repositories/orgapachemahout-1011 Need atleast 3 PMC +1 votes for the RC to pass. Voting runs until Sunday Aug 2, 2015. Please verify the following: 1. Sigs and Hashes of Release

Re: deprecation of lucene2seq

2015-07-03 Thread Suneel Marthi
Please Also note that MultiLAyerPerceptron and ConcatenateVectorsJob that were marked as deprecated in 0.10.o would be purged in the upcoming 0.10.2 release planned for July 10. On Thu, Jul 2, 2015 at 4:13 PM, Andrew Palumbo ap@outlook.com wrote: Please note that mahout lucene2seq and all

Re: FP-Growth deprecated

2015-06-24 Thread Suneel Marthi
Fp growth has been deprecated-removed-deprecated since 0.8. It will be removed completely in the subsequent release as it's not been maintained for the past 5 releases. Yes u would have to use 0.8 if u r still looking to use it but it's not supported anymore as the project has moved away

Re: Streaming K-means

2015-06-17 Thread Suneel Marthi
Dmitriy is correct in that the Streaming KMeans in MlLib is a wrong name for something that was meant to convey Spark Streaming + KMeans. The Mahout Streaming KMeans is an implementation of the Meyerson paper that's been referred to in Dmitriy's email. I have had folks wrongly misconstrue

Re: Cannot Compile Mahout Math Module

2015-06-17 Thread Suneel Marthi
U should be using Java 7 Sent from my iPhone On Jun 17, 2015, at 10:47 AM, Prasad Priyadarshana Fernando bpp...@gmail.com wrote: Hi, Mahout Math modules has compilation issues. Does anyone know the root course? Thanks Information:Using javac 1.8.0_45 to compile java sources

Re: Updated AMI for EMR

2015-06-01 Thread Suneel Marthi
Highly likely that there will be another 0.10.x out by July, will they be pulling off the latest ? On Mon, Jun 1, 2015 at 2:18 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: AWS will be releasing a new AMI in July that will include our 0.10.1 release.

Re: [VOTE] Mahout 0.10.1 Release Candidate

2015-05-31 Thread Suneel Marthi
. On 05/31/2015 12:09 PM, Pat Ferrel wrote: +1 (binding) Verified on Spark 1.3 psuedo-clustered HDFS 2.4 There are some cleanup of example data issues that can wait for next release. On May 30, 2015, at 8:16 PM, Suneel Marthi smar...@apache.org wrote: Verified

[ANNOUNCE] Apache Mahout 0.10.1 Released

2015-05-31 Thread Suneel Marthi
The Apache Mahout PMC is pleased to announce the release of Mahout 0.10.1. Mahout's goal is to create an environment for quickly creating machine learning applications that scale and run on the highest performance parallel computation engines available. Mahout comprises an interactive environment

[VOTE] Mahout 0.10.1 Release Candidate

2015-05-30 Thread Suneel Marthi
This is a call for VOTE to pass Mahout 0.10.1 release candidate that's available at https://repository.apache.org/content/repositories/orgapachemahout-1008/org/apache/mahout/mahout-distribution/0.10.1/ Need atleast 3 PMC +1 (binding) votes to cut the release Below are the tasks breakdown for

Re: [VOTE] Mahout 0.10.1 Release Candidate

2015-05-30 Thread Suneel Marthi
Andrew Palumbo / Dmitriy: Please also verify the various scenarios as described in M-1693 On Sat, May 30, 2015 at 10:32 PM, Suneel Marthi smar...@apache.org wrote: Here's the new 0.10.1 Release Candidate https://repository.apache.org/content/repositories/orgapachemahout-1009/org/apache

Re: [VOTE] Mahout 0.10.1 Release Candidate

2015-05-30 Thread Suneel Marthi
Verified locally build and tests for {source} * {zip, tar}. No issues found. +1 (binding) On Sat, May 30, 2015 at 11:14 PM, Suneel Marthi smar...@apache.org wrote: Andrew Palumbo / Dmitriy: Please also verify the various scenarios as described in M-1693 On Sat, May 30, 2015 at 10:32 PM

Re: [VOTE] Mahout 0.10.1 Release Candidate

2015-05-30 Thread Suneel Marthi
Please hold ur votes, will be refreshing staging with another build in the next hour On Sat, May 30, 2015 at 8:31 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Likewise source zip and tarballs build and pass tests. On Sat, May 30, 2015 at 3:23 PM, Suneel Marthi smar...@apache.org

Re: [VOTE] Mahout 0.10.1 Release Candidate

2015-05-30 Thread Suneel Marthi
- {source} * {zip, tar} The LICENSE and NOTICE files have not been updated this time and will be addressed in future releases. On Sat, May 30, 2015 at 8:32 PM, Suneel Marthi suneel.mar...@gmail.com wrote: Please hold ur votes, will be refreshing staging with another build in the next hour

Re: seq2sparse dropping tokens

2015-05-29 Thread Suneel Marthi
Allen, could u please file a JIRA for this? On Fri, May 29, 2015 at 8:58 AM, Allen McIntosh amcint...@appcomsci.com wrote: This shows up with Mahout 0.10.0 (the distribution archive) and Hadoop 2.2.0 When I run seq2sparse on a document containing the following tokens: cash cash equival

Re: Row Similarity

2015-05-14 Thread Suneel Marthi
: Thanks, guys. Can you recommend any resources that show an example of these steps? A google search returns very little information. Now I know what to do, but I can't find anything that tells me how to do it. On Wed, May 13, 2015 at 11:56 PM, Suneel Marthi smar...@apache.org wrote: Hi

Re: Row Similarity

2015-05-13 Thread Suneel Marthi
Hi Jonathan, Here's what u gotta do to run RowSimilarity on ur CSV formatted data. You would have to use the MapReduce version since the Spark version only supports LLR. 1. Convert CSV to Vectors - use CSVIterator and store the vectors as SequenceFiles 2. Run RowIDJob on the SequenceFile

Re: Replacement for DefaultAnalyzer

2015-05-09 Thread Suneel Marthi
Not sure how this was used in 0.7 (its 3 yrs legacy). But I am guessing this would have been required for Lucene 3x back then and must have been dropped for the Lucene 4x upgrade for 0.8 (circa late 2012). On Fri, May 8, 2015 at 8:03 PM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote:

Re: Replacement for DefaultAnalyzer

2015-05-09 Thread Suneel Marthi
as oppose to deprecated. Thanks again for any help. Lewis On Saturday, May 9, 2015, Suneel Marthi smar...@apache.org wrote: Not sure how this was used in 0.7 (its 3 yrs legacy). But I am guessing this would have been required for Lucene 3x back then and must have been dropped for the Lucene

Re: Spectral Clustering

2015-05-07 Thread Suneel Marthi
@ShannonQuinn ?? On Thu, May 7, 2015 at 1:45 PM, sugam bahl sugamb...@yahoo.co.in wrote: Hi Team, I am new to Mahout and working on a project where I need to cluster json documents. I went through the documentation but didn't get enough insights about this. Could you please help me on how

Re: Spectral Clustering

2015-05-07 Thread Suneel Marthi
Shannon would be the right guy to answer this. On Thu, May 7, 2015 at 1:52 PM, sugam bahl sugamb...@yahoo.co.in wrote: What do we mean by ShannonQuinn?? Thanks, Sugam On Thursday, 7 May 2015 10:49 AM, Suneel Marthi smar...@apache.org wrote: @ShannonQuinn ?? On Thu, May 7, 2015 at 1

Re: SparseVectorsFromSequenceFiles tfidf fail

2015-04-21 Thread Suneel Marthi
What's the Mahout Version# u r running with? On Tue, Apr 21, 2015 at 6:37 AM, mw m...@plista.com wrote: Hello, I am trying to get tfidf vectors from a corpus of 100k documents. I noticed that tfidf sequence file is empty, while the tf vectors are not. Here is the log from

Apache Mahout 0.10.0 Released

2015-04-12 Thread Suneel Marthi
The Apache Mahout PMC is pleased to announce the release of Mahout 0.10.0. Mahout's goal is to create an environment for quickly creating machine learning applications that scale and run on the highest performance parallel computation engines available. Mahout comprises an interactive environment

Re: [VOTE] Apache Mahout 0.10.0 Release

2015-04-11 Thread Suneel Marthi
with this release. +1 (binding) On 04/11/2015 11:45 AM, Suneel Marthi wrote: After checking the {source} * {tar,zip} and running a few tests locally, I am fine with this release. +1 (binding) On Sat, Apr 11, 2015 at 11:43 AM, Andrew Musselman andrew.mussel...@gmail.com wrote

Re: [VOTE] Apache Mahout 0.10.0 Release

2015-04-10 Thread Suneel Marthi
on {binary,source} x {zip,tar} + pom. All were correct. One thing that I worry a little about is that the name of the artifact doesn't include apache. Not sure that is a hard requirement, but it seems a good thing to do. On Fri, Apr 10, 2015 at 8:16 PM, Suneel Marthi suneel.mar

Re: [VOTE] Apache Mahout 0.10.0 Release

2015-04-10 Thread Suneel Marthi
the staged repo instead of my local .m2 cache. This means the Scala classes were resolved correctly from the artifacts. Hope someone can actually run it on a cluster On Apr 9, 2015, at 2:42 PM, Suneel Marthi suneel.mar...@gmail.com wrote: Please find the Mahout 0.10.0 release candidate

Re: Error running HMM model

2015-04-07 Thread Suneel Marthi
From $MAHOUT_HOME try running ./bin/mahout and see if that works. On Wed, Apr 8, 2015 at 1:22 AM, Raghuveer alwaysra...@yahoo.com.invalid wrote: I am learning mahout usage and as suggested here am trying to run my sample but i get the below error, kindly suggestError: Could not find or

Re: Error running HMM model

2015-04-07 Thread Suneel Marthi
class ..bin.mahout On Wednesday, April 8, 2015 10:55 AM, Suneel Marthi suneel.mar...@gmail.com wrote: From $MAHOUT_HOME try running ./bin/mahout and see if that works. On Wed, Apr 8, 2015 at 1:22 AM, Raghuveer alwaysra...@yahoo.com.invalid wrote: I am learning mahout usage

Re: fast performance way of writing preferences to file?

2015-04-06 Thread Suneel Marthi
FYI, adding to Pat's reply below Slope-One has been long deprecated. On Mon, Apr 6, 2015 at 5:00 PM, Pat Ferrel p...@occamsmachete.com wrote: Sorry, we are trying to get a release out. You can look at a custom similarity measure. Look at where SIMILARITY_COSINE leads you and customize that

Re: How to change /tmp directory for mahout usage of map-reduce?

2015-04-01 Thread Suneel Marthi
If u running Spectral KMeans via Command Line, u should be able to set the parameter -tempDir to point to a different path On Wed, Apr 1, 2015 at 1:55 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Can you let us know which code/scripts you're using? On Tuesday, March 31, 2015, Vikas

Re: How to change /tmp directory for mahout usage of map-reduce?

2015-04-01 Thread Suneel Marthi
from : Path tmp = new Path(tmp) to Path tmp = new Path(CHOSEN DIRECTORY); Silly mistake. Thanks for the clue :) -Vikas On Wed, Apr 1, 2015 at 1:34 AM, Suneel Marthi suneel.mar...@gmail.com wrote: If u running Spectral KMeans via Command Line, u should

Re: Text clustering with SVD

2015-03-30 Thread Suneel Marthi
Here are the steps if u r using Mahout-mrlegacy in the present Mahout trunk: 1. Generate tfidf vectors from the input corpus using seq2sparse (I am assuming you had done this before and hence avoiding the details) 2. Run SSVD on the generated tfidf vectors from (1) ./bin/mahout ssvd -i

  1   2   3   4   5   >