[jira] [Commented] (MAHOUT-825) Canopies grouping records outside t1

2011-10-07 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13123393#comment-13123393 ] Paritosh Ranjan commented on MAHOUT-825: I agree that the features for all cluster

Re: MAHOUT-232 status?

2011-10-07 Thread Robin Anil
That patch still needs some cleanup work and refactoring. The original author hadnt made much progress on that front. The functionality is there. So in case you want to use it, you can patch it in and get it running. On Sat, Oct 8, 2011 at 10:53 AM, Josh Patterson wrote: > Whats the status on S

MAHOUT-232 status?

2011-10-07 Thread Josh Patterson
Whats the status on SVM support in Mahout? I was reading through this JIRA and it looked like this implementation stalled out. Is anyone still working on this? JP -- Twitter: @jpatanooga Solution Architect @ Cloudera hadoop: http://www.cloudera.com

Any plan to support markov chain based recommender ?

2011-10-07 Thread Colin
I haven't find any MC-based open source recommender. Does Mahout have any plan to provide some? Thank you, -- Colin Wang Skype : colin.bin.wang

[jira] [Commented] (MAHOUT-831) @Experimental annotation to indicate which implementations are not intended for production use

2011-10-07 Thread Dan Brickley (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13123352#comment-13123352 ] Dan Brickley commented on MAHOUT-831: - To Dmitriy's point re 'explicitly certified as

[jira] [Issue Comment Edited] (MAHOUT-831) @Experimental annotation to indicate which implementations are not intended for production use

2011-10-07 Thread Dan Brickley (Issue Comment Edited) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13123350#comment-13123350 ] Dan Brickley edited comment on MAHOUT-831 at 10/8/11 2:25 AM: --

[jira] [Commented] (MAHOUT-831) @Experimental annotation to indicate which implementations are not intended for production use

2011-10-07 Thread Dan Brickley (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13123350#comment-13123350 ] Dan Brickley commented on MAHOUT-831: - Nice to have in the Java for sure, but where e

Re: [jira] [Commented] (MAHOUT-831) @Experimental annotation to indicate which implementations are not intended for production use

2011-10-07 Thread Dmitriy Lyubimov
> Everything else is likely stable but without guarantees. > or the lack of due diligence in rating :) On Fri, Oct 7, 2011 at 6:21 PM, Ted Dunning wrote: > I think that both are good.  Stable indicates a willingness to maintain > compatibility. > > Experimental is a promise that the code will l

Re: svn commit: r1179753 - /mahout/trunk/bin/mahout

2011-10-07 Thread Ted Dunning
Use gitx (for mac, anyway) On Fri, Oct 7, 2011 at 6:01 PM, Dmitriy Lyubimov wrote: > It may get a little complicated like that at times, but commits are a > tree, so eventually it becomes easier to navigate (use something like > git log --time-order --graph to see what is happening, it is not ve

Re: [jira] [Commented] (MAHOUT-831) @Experimental annotation to indicate which implementations are not intended for production use

2011-10-07 Thread Ted Dunning
I think that both are good. Stable indicates a willingness to maintain compatibility. Experimental is a promise that the code will likely be disrupted. Everything else is likely stable but without guarantees. On Fri, Oct 7, 2011 at 5:51 PM, Dmitriy Lyubimov (Commented) (JIRA) < j...@apache.org>

Re: svn commit: r1179753 - /mahout/trunk/bin/mahout

2011-10-07 Thread Dmitriy Lyubimov
Did you do git commit locally first? One caveat is that if you don't install authors files properly, the apache svn is going to fix it in the commit metadata. The effect of that is that your local commit would have a different hash and dcommit may not necessarily immediately realize that. When thi

[jira] [Commented] (MAHOUT-831) @Experimental annotation to indicate which implementations are not intended for production use

2011-10-07 Thread Dmitriy Lyubimov (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13123328#comment-13123328 ] Dmitriy Lyubimov commented on MAHOUT-831: - My reasoning is, I am more interested i

[jira] [Commented] (MAHOUT-831) @Experimental annotation to indicate which implementations are not intended for production use

2011-10-07 Thread Dmitriy Lyubimov (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13123326#comment-13123326 ] Dmitriy Lyubimov commented on MAHOUT-831: - What about @Stable? Maybe better yet,

[jira] [Updated] (MAHOUT-825) Canopies grouping records outside t1

2011-10-07 Thread Jeff Eastman (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-825: Attachment: MAHOUT-825.patch Modified version of canopy-radius-based-outlier-elimination. Renames t

Re: Board report draft for October

2011-10-07 Thread Frank Scholten
+1 I agree with the comments below. On Oct 7, 2011, at 19:40, Jeff Eastman wrote: > I'm also +1 on the @experimental annotation idea. Different parts of Mahout > are at different levels of maturity and the annotation makes it explicit > which parts are still in motion. Trimming out things that

[jira] [Commented] (MAHOUT-825) Canopies grouping records outside t1

2011-10-07 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13123048#comment-13123048 ] Paritosh Ranjan commented on MAHOUT-825: Nothing wrong with the patch. Tested all

[jira] [Commented] (MAHOUT-825) Canopies grouping records outside t1

2011-10-07 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13123024#comment-13123024 ] Paritosh Ranjan commented on MAHOUT-825: I have found a scope of improvement in th

[jira] [Commented] (MAHOUT-798) Add Examples for the ASF Mail Archive

2011-10-07 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13123000#comment-13123000 ] Hudson commented on MAHOUT-798: --- Integrated in Mahout-Quality #1085 (See [https://builds.ap

RE: Board report draft for October

2011-10-07 Thread Jeff Eastman
I'm also +1 on the @experimental annotation idea. Different parts of Mahout are at different levels of maturity and the annotation makes it explicit which parts are still in motion. Trimming out things that are unused/unsupported is also a good idea. I do think anything we can do to improve API

[jira] [Updated] (MAHOUT-825) Canopies grouping records outside t1

2011-10-07 Thread Paritosh Ranjan (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paritosh Ranjan updated MAHOUT-825: --- Attachment: canopy-radius-based-outlier-elimination Here goes the compromise patch named cano

[jira] [Commented] (MAHOUT-825) Canopies grouping records outside t1

2011-10-07 Thread Jeff Eastman (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13122970#comment-13122970 ] Jeff Eastman commented on MAHOUT-825: - Ok, as a compromise, if you remove the generati

[jira] [Commented] (MAHOUT-825) Canopies grouping records outside t1

2011-10-07 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13122962#comment-13122962 ] Paritosh Ranjan commented on MAHOUT-825: hmmm, applying it in Canopy generation ph

[jira] [Commented] (MAHOUT-825) Canopies grouping records outside t1

2011-10-07 Thread Jeff Eastman (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13122959#comment-13122959 ] Jeff Eastman commented on MAHOUT-825: - -2 Sorry Paritosh, but computeParameters() rese

[jira] [Updated] (MAHOUT-831) @Experimental annotation to indicate which implementations are not intended for production use

2011-10-07 Thread Sebastian Schelter (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-831: -- Attachment: MAHOUT-831.patch A first attempt to create an annotation that aims to indic

[jira] [Updated] (MAHOUT-831) @Experimental annotation to indicate which implementations are not intended for production use

2011-10-07 Thread Sebastian Schelter (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-831: -- Summary: @Experimental annotation to indicate which implementations are not intended fo

[jira] [Created] (MAHOUT-831) @Experimental annotation to indicate which implementation are not intended for production use

2011-10-07 Thread Sebastian Schelter (Created) (JIRA)
@Experimental annotation to indicate which implementation are not intended for production use - Key: MAHOUT-831 URL: https://issues.apache.org/jira/browse/MAHOUT-831

Re: Board report draft for October

2011-10-07 Thread Grant Ingersoll
On Oct 7, 2011, at 12:00 PM, Dmitriy Lyubimov wrote: > I support (and supported before) the annotations as maturity tags. In Lucene, we use @lucene.experimental We also should probably looking at trimming back things or moving it to a sandbox. I think Watchmaker is a good first candidate, si

[jira] [Commented] (MAHOUT-825) Canopies grouping records outside t1

2011-10-07 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13122944#comment-13122944 ] Paritosh Ranjan commented on MAHOUT-825: I experimented on Canopy generation phase

[jira] [Issue Comment Edited] (MAHOUT-825) Canopies grouping records outside t1

2011-10-07 Thread Jeff Eastman (Issue Comment Edited) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13122930#comment-13122930 ] Jeff Eastman edited comment on MAHOUT-825 at 10/7/11 4:28 PM: --

[jira] [Commented] (MAHOUT-825) Canopies grouping records outside t1

2011-10-07 Thread Jeff Eastman (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13122930#comment-13122930 ] Jeff Eastman commented on MAHOUT-825: - -2 on incorporating clusterStrictness in the Ca

Re: Board report draft for October

2011-10-07 Thread Ted Dunning
On Fri, Oct 7, 2011 at 9:00 AM, Dmitriy Lyubimov wrote: > ... > AbstractJob as it currently exists is more a Tool than an individual step > in > a pipeline, perhaps historically driven by a fact that most Mahout > pipelines > are one step generic job agnstic of MR specific parameters passed to th

Re: Board report draft for October

2011-10-07 Thread Dmitriy Lyubimov
Much as i will be happy to see Mahout in cdh u2, I also support Grant in a sense that projects are community driven per asf philosophy. If corporations want to push something thru, the only way they should be able doing that is thru their empoyees' individual clout in the community per asf voting.

Re: Board report draft for October

2011-10-07 Thread Dmitriy Lyubimov
I support (and supported before) the annotations as maturity tags. Also command line API seems to be good. Maybe some solver apis could be standardized in some ways. AbstractJob as it currently exists is more a Tool than an individual step in a pipeline, perhaps historically driven by a fact that

[jira] [Resolved] (MAHOUT-798) Add Examples for the ASF Mail Archive

2011-10-07 Thread Grant Ingersoll (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll resolved MAHOUT-798. Resolution: Fixed Fix Version/s: 0.6 Added in examples of clustering, classification

Re: Board report draft for October

2011-10-07 Thread Grant Ingersoll
On Oct 6, 2011, at 7:53 PM, Lance Norskog wrote: > On that topic: if/when Cloudera adds a Mahout version, it will be much > harder to change some basic things. Why? They know how to contribute patches and how the ASF works. We move forward as a community, not based on any one company (besides

[jira] [Issue Comment Edited] (MAHOUT-825) Canopies grouping records outside t1

2011-10-07 Thread Paritosh Ranjan (Issue Comment Edited) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13122674#comment-13122674 ] Paritosh Ranjan edited comment on MAHOUT-825 at 10/7/11 10:03 AM: --

[jira] [Updated] (MAHOUT-825) Canopies grouping records outside t1

2011-10-07 Thread Paritosh Ranjan (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paritosh Ranjan updated MAHOUT-825: --- Attachment: canopy-outlier-elimination I have added the patch canopy-outlier-elimination. Thi

[jira] [Commented] (MAHOUT-825) Canopies grouping records outside t1

2011-10-07 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13122590#comment-13122590 ] Paritosh Ranjan commented on MAHOUT-825: Already did it. Thanks :).

[jira] [Commented] (MAHOUT-825) Canopies grouping records outside t1

2011-10-07 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13122591#comment-13122591 ] Paritosh Ranjan commented on MAHOUT-825: Already did it. Thanks :).

[jira] [Updated] (MAHOUT-825) Canopies grouping records outside t1

2011-10-07 Thread Paritosh Ranjan (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paritosh Ranjan updated MAHOUT-825: --- Comment: was deleted (was: Already did it. Thanks :).) > Canopies grouping records outsi

[jira] [Commented] (MAHOUT-825) Canopies grouping records outside t1

2011-10-07 Thread Ted Dunning (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13122589#comment-13122589 ] Ted Dunning commented on MAHOUT-825: Don't make the parameter an integer. Make it a d

Re: Board report draft for October

2011-10-07 Thread Sebastian Schelter
On 07.10.2011 01:59, Ted Dunning wrote: > On Thu, Oct 6, 2011 at 4:53 PM, Lance Norskog wrote: > >> if/when Cloudera adds a Mahout version, It's a when not an if :) > I do think that we need to make an effort here. I totally agree. Some things we could start with: I think Mahout is in a spe