[jira] [Updated] (MAHOUT-1368) Convert OnlineSummarizer to use the new TDigest

2013-12-01 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Dunning updated MAHOUT-1368: Attachment: MAHOUT-1368.patch Here is a patch with additional skewed test. > Convert OnlineSumma

[jira] [Created] (MAHOUT-1368) Convert OnlineSummarizer to use the new TDigest

2013-12-01 Thread Ted Dunning (JIRA)
Ted Dunning created MAHOUT-1368: --- Summary: Convert OnlineSummarizer to use the new TDigest Key: MAHOUT-1368 URL: https://issues.apache.org/jira/browse/MAHOUT-1368 Project: Mahout Issue Type: Bu

[jira] [Commented] (MAHOUT-1030) Regression: Clustered Points Should be WeightedPropertyVectorWritable not WeightedVectorWritable

2013-12-01 Thread Andrew Musselman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836280#comment-13836280 ] Andrew Musselman commented on MAHOUT-1030: -- Or output square-root of distance-sq

Re: Mahout 0.9 release

2013-12-01 Thread Andrew Musselman
Any tips on submitting to reviewboard for mahout? I tried selecting repo mahout and didn't know which base directory to use, and then used mahout-git and wasn't able to use the patch I made via subversion. On Sun, Dec 1, 2013 at 2:56 PM, Suneel Marthi wrote: > Here's the link to Reviewboard > >

[jira] [Commented] (MAHOUT-1356) Ensure unit tests fail fast when writing outside mvn target directory

2013-12-01 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836270#comment-13836270 ] Suneel Marthi commented on MAHOUT-1356: --- [~isabel] [~dweiss] Applied the patch and

Re: Mahout 0.9 release

2013-12-01 Thread Suneel Marthi
Had to upgrade Mahout's Lucene version to 4.5.1 as part of the fix for M-1345, else the Lucene tests (lucene2seq etc) after applying the patch for M-1345 were failing on Mac OS due to an issue with Lucene <= 4.3.1. Pat, not sure about the impact of this Lucene upgrade on Solr Recommender (if an

[jira] [Closed] (MAHOUT-1154) Implementing Streaming KMeans

2013-12-01 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi closed MAHOUT-1154. - > Implementing Streaming KMeans > - > > Key: MAHOUT-1154

[jira] [Commented] (MAHOUT-1345) Enable randomised testing for all Mahout modules

2013-12-01 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836172#comment-13836172 ] Suneel Marthi commented on MAHOUT-1345: --- The last fix did it. Thanks Dawid. We are

[jira] [Assigned] (MAHOUT-1349) Clusterdumper/loadTermDictionary crashes when highest index in (sparse) dictionary vector is larger than dictionary vector size?

2013-12-01 Thread Andrew Musselman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Musselman reassigned MAHOUT-1349: Assignee: Andrew Musselman > Clusterdumper/loadTermDictionary crashes when highest

[jira] [Commented] (MAHOUT-1345) Enable randomised testing for all Mahout modules

2013-12-01 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836168#comment-13836168 ] Hudson commented on MAHOUT-1345: SUCCESS: Integrated in Mahout-Quality #2347 (See [https

Jenkins build is back to normal : Mahout-Examples-Cluster-Reuters #474

2013-12-01 Thread Apache Jenkins Server
See

[jira] [Commented] (MAHOUT-1345) Enable randomised testing for all Mahout modules

2013-12-01 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836143#comment-13836143 ] Suneel Marthi commented on MAHOUT-1345: --- [~dweiss] Added the annotations per Mahout

Build failed in Jenkins: mahout-nightly #1427

2013-12-01 Thread Apache Jenkins Server
See Changes: [smarthi] MAHOUT-1345: Enable randomised testing for all Mahout modules [smarthi] MAHOUT-1345: Enable randomised testing for all Mahout modules [smarthi] MAHOUT-1345: Enable randomised testing for all Mahout modules [smar

Build failed in Jenkins: mahout-nightly » Mahout Core #1427

2013-12-01 Thread Apache Jenkins Server
See Changes: [smarthi] MAHOUT-1345: Enable randomised testing for all Mahout modules [smarthi] MAHOUT-1345: Enable randomised testing for all Mahout modules -- [...

Re: Mahout 0.9 release

2013-12-01 Thread Suneel Marthi
Here's the link to Reviewboard https://reviews.apache.org On Sunday, December 1, 2013 1:51 PM, Andrew Musselman wrote: No, just reviewboard in general; never put any patches up before. > On Dec 1, 2013, at 8:52 AM, Suneel Marthi wrote: > > For M-1349??  There's no patch for this, no

[jira] [Commented] (MAHOUT-1345) Enable randomised testing for all Mahout modules

2013-12-01 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836132#comment-13836132 ] Suneel Marthi commented on MAHOUT-1345: --- Thanks [~dweiss]. It does seem like a slo

Jenkins build is back to normal : Mahout-Quality #2346

2013-12-01 Thread Apache Jenkins Server
See

[jira] [Commented] (MAHOUT-1286) Memory-efficient DataModel, supporting fast online updates and element-wise iteration

2013-12-01 Thread Gokhan Capan (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836102#comment-13836102 ] Gokhan Capan commented on MAHOUT-1286: -- Let's "Won't Fix" this issue. I think what

[jira] [Commented] (MAHOUT-1030) Regression: Clustered Points Should be WeightedPropertyVectorWritable not WeightedVectorWritable

2013-12-01 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836099#comment-13836099 ] Pat Ferrel commented on MAHOUT-1030: Thanks, I see now. That looks correct. This is f

[jira] [Commented] (MAHOUT-1345) Enable randomised testing for all Mahout modules

2013-12-01 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836092#comment-13836092 ] Dawid Weiss commented on MAHOUT-1345: - {code} Thread[id=108, name=pool-8-thread-8, st

[jira] [Commented] (MAHOUT-1030) Regression: Clustered Points Should be WeightedPropertyVectorWritable not WeightedVectorWritable

2013-12-01 Thread Andrew Musselman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836091#comment-13836091 ] Andrew Musselman commented on MAHOUT-1030: -- Okay; I did the distance calculation

Re: Mahout 0.9 release

2013-12-01 Thread Andrew Musselman
Thanks; me too > On Dec 1, 2013, at 10:53 AM, Suneel Marthi wrote: > > Sorry I am out on the streets but see M-1265 comments for a link to review > board > > Sent from my iPhone > >> On Dec 1, 2013, at 1:50 PM, Andrew Musselman >> wrote: >> >> No, just reviewboard in general; never put an

Re: Mahout 0.9 release

2013-12-01 Thread Suneel Marthi
Sorry I am out on the streets but see M-1265 comments for a link to review board Sent from my iPhone > On Dec 1, 2013, at 1:50 PM, Andrew Musselman > wrote: > > No, just reviewboard in general; never put any patches up before. > >> On Dec 1, 2013, at 8:52 AM, Suneel Marthi wrote: >> >> Fo

Re: [jira] [Commented] (MAHOUT-1030) Regression: Clustered Points Should be WeightedPropertyVectorWritable not WeightedVectorWritable

2013-12-01 Thread Andrew Musselman
Okay cool; I used distance of each vector to each centroid in the mapper. > On Dec 1, 2013, at 10:41 AM, "Pat Ferrel (JIRA)" wrote: > > >[ > https://issues.apache.org/jira/browse/MAHOUT-1030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836087#com

[jira] [Commented] (MAHOUT-1345) Enable randomised testing for all Mahout modules

2013-12-01 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836089#comment-13836089 ] Suneel Marthi commented on MAHOUT-1345: --- Following test failed in Hudson build {Co

Re: Mahout 0.9 release

2013-12-01 Thread Andrew Musselman
No, just reviewboard in general; never put any patches up before. > On Dec 1, 2013, at 8:52 AM, Suneel Marthi wrote: > > For M-1349?? There's no patch for this, no one's worked on it yet. > > > > On Sunday, December 1, 2013 11:50 AM, Andrew Musselman > wrote: > I will look at M-1349 since

[jira] [Commented] (MAHOUT-1030) Regression: Clustered Points Should be WeightedPropertyVectorWritable not WeightedVectorWritable

2013-12-01 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836087#comment-13836087 ] Pat Ferrel commented on MAHOUT-1030: I hope Jeff can answer about normalized results,

[jira] [Commented] (MAHOUT-1030) Regression: Clustered Points Should be WeightedPropertyVectorWritable not WeightedVectorWritable

2013-12-01 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836085#comment-13836085 ] Pat Ferrel commented on MAHOUT-1030: Don't have time to fully test this right now but

Build failed in Jenkins: Mahout-Quality #2345

2013-12-01 Thread Apache Jenkins Server
See Changes: [smarthi] MAHOUT-1345: Enable randomised testing for all Mahout modules [smarthi] MAHOUT-1345: Enable randomised testing for all Mahout modules [smarthi] MAHOUT-1345: Enable randomised testing for all Mahout modules -

[jira] [Commented] (MAHOUT-1345) Enable randomised testing for all Mahout modules

2013-12-01 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836084#comment-13836084 ] Hudson commented on MAHOUT-1345: FAILURE: Integrated in Mahout-Quality #2345 (See [https

Re: Mahout 0.9 release

2013-12-01 Thread Suneel Marthi
Code for M-1345 committed to trunk. On Sunday, December 1, 2013 11:56 AM, Suneel Marthi wrote: I will be committing the patch for M-1345 in a few minutes, upgrading the Lucene version to 4.5.1 as committing this patch is gonna fail 'lucene2seq' tests on Mac OS for Lucene versions < 4.4. 

[jira] [Updated] (MAHOUT-1345) Enable randomised testing for all Mahout modules

2013-12-01 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi updated MAHOUT-1345: -- Resolution: Fixed Assignee: Suneel Marthi Status: Resolved (was: Patch Available

[jira] [Commented] (MAHOUT-1345) Enable randomised testing for all Mahout modules

2013-12-01 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836078#comment-13836078 ] Suneel Marthi commented on MAHOUT-1345: --- Patch committed to trunk. Had to upgrade L

[jira] [Commented] (MAHOUT-1366) Please delete old releases from mirroring system

2013-12-01 Thread Sebb (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836077#comment-13836077 ] Sebb commented on MAHOUT-1366: -- http://www.apache.org/dev/release.html#upload-ci > Please d

[jira] [Commented] (MAHOUT-1356) Ensure unit tests fail fast when writing outside mvn target directory

2013-12-01 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836066#comment-13836066 ] Dawid Weiss commented on MAHOUT-1356: - Suneel you may want to check out how Apache Lu

Re: Mahout 0.9 release

2013-12-01 Thread Suneel Marthi
I will be committing the patch for M-1345 in a few minutes, upgrading the Lucene version to 4.5.1 as committing this patch is gonna fail 'lucene2seq' tests on Mac OS for Lucene versions < 4.4.  On Sunday, December 1, 2013 11:52 AM, Suneel Marthi wrote: For M-1349??  There's no patch for

Re: Mahout 0.9 release

2013-12-01 Thread Suneel Marthi
For M-1349??  There's no patch for this, no one's worked on it yet. On Sunday, December 1, 2013 11:50 AM, Andrew Musselman wrote: I will look at M-1349 since I'm in there. Where's the Reviewboard. On Sun, Dec 1, 2013 at 4:57 AM, Suneel Marthi wrote: Open JIRAs for 0.9 release :- > >W

Re: Mahout 0.9 release

2013-12-01 Thread Andrew Musselman
I will look at M-1349 since I'm in there. Where's the Reviewboard. On Sun, Dec 1, 2013 at 4:57 AM, Suneel Marthi wrote: > Open JIRAs for 0.9 release :- > > Wiki - Isabel, Sebastian and other volunteers > - > > M-1245, M-1304, M-1305, M-1307, M

[jira] [Updated] (MAHOUT-1356) Ensure unit tests fail fast when writing outside mvn target directory

2013-12-01 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi updated MAHOUT-1356: -- Affects Version/s: 0.7 0.8 Fix Version/s: 1.0 > Ensure unit tes

[jira] [Commented] (MAHOUT-1286) Memory-efficient DataModel, supporting fast online updates and element-wise iteration

2013-12-01 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836052#comment-13836052 ] Sebastian Schelter commented on MAHOUT-1286: I wasn't able to load the Netfli

[jira] [Commented] (MAHOUT-1366) Please delete old releases from mirroring system

2013-12-01 Thread Isabel Drost-Fromm (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836041#comment-13836041 ] Isabel Drost-Fromm commented on MAHOUT-1366: [~s...@apache.org] As we have a

[jira] [Commented] (MAHOUT-1345) Enable randomised testing for all Mahout modules

2013-12-01 Thread Frank Scholten (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836035#comment-13836035 ] Frank Scholten commented on MAHOUT-1345: The current patch does not give me any b

[jira] [Commented] (MAHOUT-1345) Enable randomised testing for all Mahout modules

2013-12-01 Thread Frank Scholten (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836034#comment-13836034 ] Frank Scholten commented on MAHOUT-1345: [~smarthi] No but I will create one. >

[jira] [Commented] (MAHOUT-1367) WikipediaXmlSplitter --> Exception in thread "main" java.lang.NullPointerException

2013-12-01 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836028#comment-13836028 ] Hudson commented on MAHOUT-1367: SUCCESS: Integrated in Mahout-Quality #2344 (See [https

[jira] [Commented] (MAHOUT-1345) Enable randomised testing for all Mahout modules

2013-12-01 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836023#comment-13836023 ] Suneel Marthi commented on MAHOUT-1345: --- [~frankscholten] Is there a more recent p

Re: Mahout 0.9 release

2013-12-01 Thread Suneel Marthi
Open JIRAs for 0.9 release :- Wiki - Isabel, Sebastian and other volunteers - M-1245, M-1304, M-1305, M-1307, M-1326 Suneel --- M-1319, M-1328 Pat --- M-1288 Solr Recommender Sebastian, Peng M-1286 Yexi, Sune

[jira] [Updated] (MAHOUT-1242) No key redistribution function for associative maps

2013-12-01 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi updated MAHOUT-1242: -- Assignee: (was: Benson Margulies) > No key redistribution function for associative maps >

[jira] [Updated] (MAHOUT-1366) Please delete old releases from mirroring system

2013-12-01 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi updated MAHOUT-1366: -- Affects Version/s: 0.7 0.8 Fix Version/s: 0.9 Assig

[jira] [Updated] (MAHOUT-1367) WikipediaXmlSplitter --> Exception in thread "main" java.lang.NullPointerException

2013-12-01 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi updated MAHOUT-1367: -- Resolution: Fixed Status: Resolved (was: Patch Available) Marking this as resolved. W

[jira] [Comment Edited] (MAHOUT-1367) WikipediaXmlSplitter --> Exception in thread "main" java.lang.NullPointerException

2013-12-01 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836014#comment-13836014 ] Suneel Marthi edited comment on MAHOUT-1367 at 12/1/13 12:24 PM: --

[jira] [Commented] (MAHOUT-1367) WikipediaXmlSplitter --> Exception in thread "main" java.lang.NullPointerException

2013-12-01 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836010#comment-13836010 ] Suneel Marthi commented on MAHOUT-1367: --- The original issue that's been reported co