Jenkins build became unstable: Mahout-Quality #1219

2011-12-02 Thread Apache Jenkins Server
See

[jira] [Commented] (MAHOUT-897) New implementation for LDA: Collapsed Variational Bayes (0th derivative approximation), with map-side model caching

2011-12-02 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162055#comment-13162055 ] Hudson commented on MAHOUT-897: --- Integrated in Mahout-Quality #1219 (See [https://builds.ap

Build failed in Jenkins: Mahout-Examples #37

2011-12-02 Thread Apache Jenkins Server
See Changes: [jmannix] fixes MAHOUT-897 New Latent Dirichlet Allocation implementation, etc. [srowen] MAHOUT-910 prelude: commit some clear wins in optimizing calls to intersectionSize() [gsingers] MAHOUT-907: move to use bash [gsinge

[jira] [Updated] (MAHOUT-911) Naive Bayes trains models that are too large to apply

2011-12-02 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-911: -- Attachment: example.wiki.categories.txt > Naive Bayes trains models that are too large to apply > -

[jira] [Created] (MAHOUT-911) Naive Bayes trains models that are too large to apply

2011-12-02 Thread tom pierce (Created) (JIRA)
Naive Bayes trains models that are too large to apply - Key: MAHOUT-911 URL: https://issues.apache.org/jira/browse/MAHOUT-911 Project: Mahout Issue Type: Bug Components: Classific

Re: [jira] [Commented] (MAHOUT-880) Add some matrix method(like addition, subtraction, norm ... etc) to DistributedRowMatrix

2011-12-02 Thread Lance Norskog
I was recently looking through code (I think in text vectors) where code merged very sparse term vectors. If there was a collision, it always picked the first one. The assumption was that they never happened, so it did not matter what it did. For symboic vectors, I can see the virtue of randomly pi

Jenkins build is back to normal : Mahout-Quality #1218

2011-12-02 Thread Apache Jenkins Server
See

[jira] [Commented] (MAHOUT-910) Improve sampling in SamplingCandidateItemStrategy, optimize intersection computations

2011-12-02 Thread Lance Norskog (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13161952#comment-13161952 ] Lance Norskog commented on MAHOUT-910: -- {code} return userIDs1.size() < userIDs2.size

[jira] [Commented] (MAHOUT-399) LDA on Mahout 0.3 does not converge to correct solution for overlapping pyramids toy problem.

2011-12-02 Thread Jake Mannix (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13161884#comment-13161884 ] Jake Mannix commented on MAHOUT-399: Ah, not sure what happened, but the current trunk

Re: [jira] [Commented] (MAHOUT-880) Add some matrix method(like addition, subtraction, norm ... etc) to DistributedRowMatrix

2011-12-02 Thread Dan Brickley
On 2 December 2011 19:31, Raphael Cendrillon wrote: > Is this something people would find useful? > > How would you like to sparsify the matrix? Using a threshold, or something > else like target number of elements per row? I can't yet swear hand-on-heart that I need this (I was thinking thresho

Re: [jira] [Commented] (MAHOUT-880) Add some matrix method(like addition, subtraction, norm ... etc) to DistributedRowMatrix

2011-12-02 Thread Raphael Cendrillon
Hi Jake, If you have a chance could you take a look at the new version of the diff at: reviews.apache.org/r/2955/ Thanks!

Re: [jira] [Commented] (MAHOUT-897) New implementation for LDA: Collapsed Variational Bayes (0th derivative approximation), with map-side model caching

2011-12-02 Thread Ted Dunning
I am going to be unable to complain today. So silent assent applies to me. On Fri, Dec 2, 2011 at 12:53 PM, Jake Mannix (Commented) (JIRA) < j...@apache.org> wrote: > >[ > https://issues.apache.org/jira/browse/MAHOUT-897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&f

[jira] [Commented] (MAHOUT-845) Make cluster top terms code more reusable

2011-12-02 Thread Frank Scholten (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13161856#comment-13161856 ] Frank Scholten commented on MAHOUT-845: --- Cool, looks good. The Google collections st

[jira] [Commented] (MAHOUT-880) Add some matrix method(like addition, subtraction, norm ... etc) to DistributedRowMatrix

2011-12-02 Thread jirapos...@reviews.apache.org (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13161854#comment-13161854 ] jirapos...@reviews.apache.org commented on MAHOUT-880: --

Re: Review Request: Matrix methods for DistributedRowMatrix

2011-12-02 Thread Raphael Cendrillon
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/2955/ --- (Updated 2011-12-02 21:04:46.828990) Review request for mahout, Ted Dunning, Jak

[jira] [Commented] (MAHOUT-897) New implementation for LDA: Collapsed Variational Bayes (0th derivative approximation), with map-side model caching

2011-12-02 Thread Jake Mannix (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13161849#comment-13161849 ] Jake Mannix commented on MAHOUT-897: I'm going to commit the latest patch later today

[jira] [Commented] (MAHOUT-845) Make cluster top terms code more reusable

2011-12-02 Thread Jake Mannix (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13161848#comment-13161848 ] Jake Mannix commented on MAHOUT-845: Ok, I added a couple of methods and options to Ve

[jira] [Commented] (MAHOUT-897) New implementation for LDA: Collapsed Variational Bayes (0th derivative approximation), with map-side model caching

2011-12-02 Thread jirapos...@reviews.apache.org (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13161847#comment-13161847 ] jirapos...@reviews.apache.org commented on MAHOUT-897: --

Re: Review Request: New implementation for LDA: Collapsed Variational Bayes (0th derivative approximation), with map-side model caching

2011-12-02 Thread Jake Mannix
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/2944/ --- (Updated 2011-12-02 20:49:52.055735) Review request for mahout and Ted Dunning.

Re: [jira] [Commented] (MAHOUT-880) Add some matrix method(like addition, subtraction, norm ... etc) to DistributedRowMatrix

2011-12-02 Thread Raphael Cendrillon
Is this something people would find useful? How would you like to sparsify the matrix? Using a threshold, or something else like target number of elements per row? On Dec 2, 2011, at 10:04 AM, Ted Dunning wrote: > No. > > On Fri, Dec 2, 2011 at 4:03 AM, Dan Brickley (Commented) (JIRA) < > j..

Re: [jira] [Commented] (MAHOUT-880) Add some matrix method(like addition, subtraction, norm ... etc) to DistributedRowMatrix

2011-12-02 Thread Ted Dunning
No. On Fri, Dec 2, 2011 at 4:03 AM, Dan Brickley (Commented) (JIRA) < j...@apache.org> wrote: > >[ > https://issues.apache.org/jira/browse/MAHOUT-880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13161562#comment-13161562] > > Dan Brickley commented on

[jira] [Commented] (MAHOUT-910) Improve sampling in SamplingCandidateItemStrategy, optimize intersection computations

2011-12-02 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13161752#comment-13161752 ] Hudson commented on MAHOUT-910: --- Integrated in Mahout-Quality #1217 (See [https://builds.ap

[jira] [Commented] (MAHOUT-907) Several Watchmaker Examples tests fail when there is a space in the path

2011-12-02 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-907?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13161751#comment-13161751 ] Hudson commented on MAHOUT-907: --- Integrated in Mahout-Quality #1217 (See [https://builds.ap

Build failed in Jenkins: Mahout-Quality #1217

2011-12-02 Thread Apache Jenkins Server
See Changes: [srowen] MAHOUT-910 prelude: commit some clear wins in optimizing calls to intersectionSize() [gsingers] MAHOUT-907: move to use bash [gsingers] MAHOUT-907: switch to use toURI() -

[jira] [Created] (MAHOUT-910) Improve sampling in SamplingCandidateItemStrategy, optimize intersection computations

2011-12-02 Thread Sean Owen (Created) (JIRA)
Improve sampling in SamplingCandidateItemStrategy, optimize intersection computations - Key: MAHOUT-910 URL: https://issues.apache.org/jira/browse/MAHOUT-910 Project:

[jira] [Created] (MAHOUT-909) Make it so you can pass in all the answers to the questions asked in the example shell scripts

2011-12-02 Thread Grant Ingersoll (Created) (JIRA)
Make it so you can pass in all the answers to the questions asked in the example shell scripts -- Key: MAHOUT-909 URL: https://issues.apache.org/jira/browse/MAHOUT-909

[jira] [Resolved] (MAHOUT-908) Example shell scripts don't run properly on Ubuntu

2011-12-02 Thread Grant Ingersoll (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll resolved MAHOUT-908. Resolution: Fixed Assignee: Grant Ingersoll switched classify-20newsgroups.sh to use

[jira] [Commented] (MAHOUT-908) Example shell scripts don't run properly on Ubuntu

2011-12-02 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13161639#comment-13161639 ] Sean Owen commented on MAHOUT-908: -- Yeah safe to assume bash, I think. >

[jira] [Created] (MAHOUT-908) Example shell scripts don't run properly on Ubuntu

2011-12-02 Thread Grant Ingersoll (Created) (JIRA)
Example shell scripts don't run properly on Ubuntu -- Key: MAHOUT-908 URL: https://issues.apache.org/jira/browse/MAHOUT-908 Project: Mahout Issue Type: Bug Reporter: Grant Ingersoll

[jira] [Resolved] (MAHOUT-907) Several Watchmaker Examples tests fail when there is a space in the path

2011-12-02 Thread Grant Ingersoll (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll resolved MAHOUT-907. Resolution: Fixed fixed. > Several Watchmaker Examples tests fail when the

[jira] [Updated] (MAHOUT-907) Several Watchmaker Examples tests fail when there is a space in the path

2011-12-02 Thread Grant Ingersoll (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-907: --- Attachment: MAHOUT-907.patch switches to use toURI() for the wdbc path > Sev

[jira] [Created] (MAHOUT-907) Several Watchmaker Examples tests fail when there is a space in the path

2011-12-02 Thread Grant Ingersoll (Created) (JIRA)
Several Watchmaker Examples tests fail when there is a space in the path Key: MAHOUT-907 URL: https://issues.apache.org/jira/browse/MAHOUT-907 Project: Mahout Issue Typ

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-02 Thread Manuel Blechschmidt (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13161594#comment-13161594 ] Manuel Blechschmidt commented on MAHOUT-906: Actually it would be a good idea

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-02 Thread Anatoliy Kats (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13161570#comment-13161570 ] Anatoliy Kats commented on MAHOUT-906: -- OK. I'll wait for a day to see if anyone has

[jira] [Commented] (MAHOUT-880) Add some matrix method(like addition, subtraction, norm ... etc) to DistributedRowMatrix

2011-12-02 Thread Dan Brickley (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13161562#comment-13161562 ] Dan Brickley commented on MAHOUT-880: - Does Mahout yet have a method to take a large f

[jira] [Updated] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-02 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-906: - Fix Version/s: (was: 0.6) Assignee: (was: Sean Owen) Yes, I think a clean refactoring of

[jira] [Created] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-02 Thread Anatoliy Kats (Created) (JIRA)
Allow collaborative filtering evaluators to use custom logic in splitting data set -- Key: MAHOUT-906 URL: https://issues.apache.org/jira/browse/MAHOUT-906 Project: Mahou