Re: Tackling the "legacy dilemma"

2014-04-15 Thread Manoj Awasthi
Ok - that makes sense. Thanks. On Wed, Apr 16, 2014 at 8:29 AM, Suneel Marthi wrote: > The plan is to replace the existing Random Forests impl with a spark based > Streaming Random Forests. > As ssc had already mentioned the plan is not to entertain any new MR impls > but accept bug fixes for e

Re: Tackling the "legacy dilemma"

2014-04-15 Thread Suneel Marthi
The plan is to replace the existing Random Forests impl with a spark based Streaming Random Forests. As ssc had already mentioned the plan is not to entertain any new MR impls but accept bug fixes for existing ones. The consensus is to do away with existing MapReduce RF once the Spark based Strea

Re: Tackling the "legacy dilemma"

2014-04-15 Thread Ted Dunning
Manoj, Sounds like a fair trade there. Hopefully, you would consider upgrading if we get Andy's code ported to the DSL or if we incorporate the h2o random forest implementation. On Tue, Apr 15, 2014 at 7:51 PM, Manoj Awasthi wrote: > > * remove Random Forest as we cannot even answer questio

Re: Tackling the "legacy dilemma"

2014-04-15 Thread Manoj Awasthi
> * remove Random Forest as we cannot even answer questions to the > implementation on the mailinglist > -1 to removing present Random Forests. I think it is being used - we (at adobe) are playing around with it a bit. If the reason for removal is that there no active maintainer that can be

Re: Mahout without a CLI?

2014-04-15 Thread Ted Dunning
Well... I think it is an issue that has to do with figuring out how to *avoid* import and export as much as possible. On Tue, Apr 15, 2014 at 6:36 PM, Pat Ferrel wrote: > Which is why it’s an import/export issue. > > On Apr 15, 2014, at 5:48 PM, Ted Dunning wrote: > > On Tue, Apr 15, 2014 at 1

Re: Mahout without a CLI?

2014-04-15 Thread Pat Ferrel
Which is why it’s an import/export issue. On Apr 15, 2014, at 5:48 PM, Ted Dunning wrote: On Tue, Apr 15, 2014 at 10:58 AM, Pat Ferrel wrote: > As to the statement "There is not, nor do i think there will be a way to > run this stuff with CLI” seems unduly misleading. Really, does anyone > s

Re: Mahout without a CLI?

2014-04-15 Thread Ted Dunning
On Tue, Apr 15, 2014 at 10:58 AM, Pat Ferrel wrote: > As to the statement "There is not, nor do i think there will be a way to > run this stuff with CLI” seems unduly misleading. Really, does anyone > second this? > > There will be Scala scripts to drive this stuff and yes even from the CLI. > Do

[jira] [Commented] (MAHOUT-1510) Goodbye MapReduce

2014-04-15 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13970281#comment-13970281 ] Ted Dunning commented on MAHOUT-1510: - [~yxjiang] Hopefully we can be much more sele

[jira] [Commented] (MAHOUT-1265) Add Multilayer Perceptron

2014-04-15 Thread Yexi Jiang (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13970006#comment-13970006 ] Yexi Jiang commented on MAHOUT-1265: Hi, [~barsik], according to [MAHOUT-1510|https:

[jira] [Commented] (MAHOUT-1265) Add Multilayer Perceptron

2014-04-15 Thread Mark Yakushev (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969990#comment-13969990 ] Mark Yakushev commented on MAHOUT-1265: --- Hi Yexi and Ted, Is there anything new on

[jira] [Commented] (MAHOUT-1464) Cooccurrence Analysis on Spark

2014-04-15 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969961#comment-13969961 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- bq. where -cp is what `mahout classp

Re: Mahout without a CLI?

2014-04-15 Thread Pat Ferrel
Quite happy to have you live in the shell and do the arcane math that most end users don’t want to be required to know. That’s why Apache pays you the big bucks ;-) In my experience the customizing pipelines problem is one of import and export. That and having to write Java to do it. Pig+UDFs i

Build failed in Jenkins: Mahout-Examples-Cluster-Reuters-II #815

2014-04-15 Thread Apache Jenkins Server
See Changes: [smarthi] MAHOUT-1504:minor fx to reference relevant section of the Rennie paper [ssc] MAHOUT-1504 fixed minor style issues [ssc] MAHOUT-1445 Create an intro for item based recommender [smarthi] MAHOUT

Re: Mahout without a CLI?

2014-04-15 Thread Dmitriy Lyubimov
Finally, the whole point of ML environment is to enable pipeline customization. Mahout's major criticism is mostly that -- "we can't integrate and customize pipelines using Mahout's methods becasue Mahout's throws "us" into bash environment(only) to do that, and that's silly". So the question is a

Re: Mahout without a CLI?

2014-04-15 Thread Dmitriy Lyubimov
On Tue, Apr 15, 2014 at 10:58 AM, Pat Ferrel wrote: > Sorry you are sick. Thanks for the tip. Spark has a client launcher method > "spark-class …Client launch ..." but I’m not having much success with that. > This will not work because you need Mahout's classpath too. And Spark's. The complexity

Re: Mahout without a CLI?

2014-04-15 Thread Dmitriy Lyubimov
Well, let's put it the way i am not plunning to work on CLIs. I do only what i need , and i don't need it. If you insist on expanding the argument to other audiences, R users seems to be pretty happy with R and RScript. They don't have a CLI (meaning parameters passed thru cli option flags) for an

Mahout without a CLI?

2014-04-15 Thread Pat Ferrel
Sorry you are sick. Thanks for the tip. Spark has a client launcher method "spark-class …Client launch ..." but I’m not having much success with that. As to the statement "There is not, nor do i think there will be a way to run this stuff with CLI” seems unduly misleading. Really, does anyone se

[jira] [Commented] (MAHOUT-1510) Goodbye MapReduce

2014-04-15 Thread Yexi Jiang (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969777#comment-13969777 ] Yexi Jiang commented on MAHOUT-1510: Great, is it necessary to port all of the old al

[jira] [Comment Edited] (MAHOUT-1464) Cooccurrence Analysis on Spark

2014-04-15 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969775#comment-13969775 ] Dmitriy Lyubimov edited comment on MAHOUT-1464 at 4/15/14 5:24 PM:

[jira] [Commented] (MAHOUT-1464) Cooccurrence Analysis on Spark

2014-04-15 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969775#comment-13969775 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- bq, Running from my dev machine in I

[jira] [Commented] (MAHOUT-1464) Cooccurrence Analysis on Spark

2014-04-15 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969766#comment-13969766 ] Pat Ferrel commented on MAHOUT-1464: To sum up, Spark Cooccurrence seems to complete

[jira] [Commented] (MAHOUT-1464) Cooccurrence Analysis on Spark

2014-04-15 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969763#comment-13969763 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- [My] Silence idicates I've been pret

[jira] [Commented] (MAHOUT-1464) Cooccurrence Analysis on Spark

2014-04-15 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969759#comment-13969759 ] Pat Ferrel commented on MAHOUT-1464: Running from my dev machine in IDEA against a re

[jira] [Commented] (MAHOUT-1510) Goodbye MapReduce

2014-04-15 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969755#comment-13969755 ] Dmitriy Lyubimov commented on MAHOUT-1510: -- :) Wow. Never thought i'd live to se

[jira] [Comment Edited] (MAHOUT-1464) Cooccurrence Analysis on Spark

2014-04-15 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969722#comment-13969722 ] Pat Ferrel edited comment on MAHOUT-1464 at 4/15/14 4:51 PM: -

[jira] [Commented] (MAHOUT-1464) Cooccurrence Analysis on Spark

2014-04-15 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969722#comment-13969722 ] Pat Ferrel commented on MAHOUT-1464: Silence indicates: you don't know how to? it can

[jira] [Commented] (MAHOUT-1510) Goodbye MapReduce

2014-04-15 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969616#comment-13969616 ] Sebastian Schelter commented on MAHOUT-1510: Ideally, ports of algorithms tha

[jira] [Commented] (MAHOUT-1510) Goodbye MapReduce

2014-04-15 Thread Yexi Jiang (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969557#comment-13969557 ] Yexi Jiang commented on MAHOUT-1510: What kind of algorithms are acceptable in the fu

[jira] [Commented] (MAHOUT-1504) Enable/fix thetaSummer job in TrainNaiveBayesJob

2014-04-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969499#comment-13969499 ] Hudson commented on MAHOUT-1504: SUCCESS: Integrated in Mahout-Quality #2571 (See [https

[jira] [Commented] (MAHOUT-1355) Frequent Pattern Mining algorithms for Mahout

2014-04-15 Thread Emin Aksehirli (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969361#comment-13969361 ] Emin Aksehirli commented on MAHOUT-1355: Hello, yes, it doesn't make sense to add

Re: Tackling the "legacy dilemma"

2014-04-15 Thread Sebastian Schelter
On 04/15/2014 11:07 AM, Suneel Marthi wrote: On Tue, Apr 15, 2014 at 12:57 AM, Sebastian Schelter wrote: Hi, From reading the thread, I have the impression that we agree on the following actions: * reject any future MR algorithm contributions, prominently state this on the website and in

Re: Tackling the "legacy dilemma"

2014-04-15 Thread Suneel Marthi
On Tue, Apr 15, 2014 at 12:57 AM, Sebastian Schelter wrote: > Hi, > > From reading the thread, I have the impression that we agree on the > following actions: > > > * reject any future MR algorithm contributions, prominently state this > on the website and in talks > * make all existing algorit

[jira] [Commented] (MAHOUT-1504) Enable/fix thetaSummer job in TrainNaiveBayesJob

2014-04-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969332#comment-13969332 ] Hudson commented on MAHOUT-1504: SUCCESS: Integrated in Mahout-Quality #2570 (See [https

[jira] [Commented] (MAHOUT-1445) Create an intro for item based recommender

2014-04-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969331#comment-13969331 ] Hudson commented on MAHOUT-1445: SUCCESS: Integrated in Mahout-Quality #2570 (See [https