[jira] [Updated] (MAHOUT-1200) Mahout tests depend on writing to /tmp/hadoop-$user

2013-05-03 Thread Isabel Drost-Fromm (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost-Fromm updated MAHOUT-1200: --- Attachment: MAHOUT-1200.patch Added some Maven configuration to store test local dat

[jira] [Resolved] (MAHOUT-1180) Multinomial throws ConcurrentModificationException when iterating and setting probabilities

2013-05-03 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dan Filimon resolved MAHOUT-1180. - Resolution: Fixed Fix Version/s: 0.8 Committed revision 1478723. > Multi

[jira] [Resolved] (MAHOUT-1189) CosineDistanceMeasure doesn't return 0 for two 0 vectors

2013-05-03 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dan Filimon resolved MAHOUT-1189. - Resolution: Fixed Fix Version/s: 0.8 Committed revision 1478733. > Cosin

[jira] [Commented] (MAHOUT-1117) Vectors are not hashable

2013-05-03 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13648348#comment-13648348 ] Dan Filimon commented on MAHOUT-1117: - About this, how would you go about creating a

[jira] [Created] (MAHOUT-1202) Speed up Vector operations

2013-05-03 Thread Dan Filimon (JIRA)
Dan Filimon created MAHOUT-1202: --- Summary: Speed up Vector operations Key: MAHOUT-1202 URL: https://issues.apache.org/jira/browse/MAHOUT-1202 Project: Mahout Issue Type: Improvement C

[jira] [Resolved] (MAHOUT-1190) SequentialAccessSparseVector function assignment is very slow and other iterator woes

2013-05-03 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dan Filimon resolved MAHOUT-1190. - Resolution: Duplicate Moved into this other issue as it grew in scope: https://issues.apache.org

Re: [jira] [Commented] (MAHOUT-1177) GSOC 2013: Reform and simplify the clustering APIs

2013-05-03 Thread 姜页希
Is there other comments about this issue? 2013/5/2 Shannon Quinn > This sounds excellent. I'd be happy to assist in unifying the interfaces > of the spectral methods in particular. > > > On 5/2/13 3:54 PM, Yu Lee (JIRA) wrote: > >> [ https://issues.apache.org/**jira/browse/MAHOUT-1177?pag

Re: Committing to mahout-git?

2013-05-03 Thread Dan Filimon
Thanks, I can't directly use the github mirror but applying the formatted patch worked fine! On Thu, May 2, 2013 at 8:57 PM, Robin Anil wrote: > diffs from git can be applied on svn using > > patch -P1 < patch.file > > I tried this with your patches. I dont know much about the apache mahout >

[jira] [Resolved] (MAHOUT-1135) Unify decorated vectors in DecoratedVector

2013-05-03 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dan Filimon resolved MAHOUT-1135. - Resolution: Won't Fix Experimented with this a while back, but for the most important use case

[jira] [Updated] (MAHOUT-1155) Make MatrixSlice a Vector and fix Centroid cloning

2013-05-03 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dan Filimon updated MAHOUT-1155: Description: There are two changes in this issue: - making MatrixSlice a Vector by extending Deleg

[jira] [Updated] (MAHOUT-1155) Make MatrixSlice a Vector (and fix Centroid cloning; MAHOUT-1202)

2013-05-03 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dan Filimon updated MAHOUT-1155: Summary: Make MatrixSlice a Vector (and fix Centroid cloning; MAHOUT-1202) (was: Make MatrixSlice

[jira] [Resolved] (MAHOUT-1155) Make MatrixSlice a Vector (and fix Centroid cloning; MAHOUT-1202)

2013-05-03 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dan Filimon resolved MAHOUT-1155. - Resolution: Fixed Fix Version/s: 0.8 Committed revision 1478836. > Make

Re: Really long running tests

2013-05-03 Thread Robin Anil
QRDecompositionTest: I saw this from time to time. Sometimes it runs in 0.2 seconds sometimes 100s. Seed related? On Fri, May 3, 2013 at 9:59 AM, Dan Filimon wrote: > QRDecompositionTest.fasterThanBefore() and most of the tests in > fpm.pfpgrowth take a really long time to run (FPGrowthSyntheti

[jira] [Created] (MAHOUT-1203) Problem in PhD Topic

2013-05-03 Thread saeed iqbal (JIRA)
saeed iqbal created MAHOUT-1203: Summary: Problem in PhD Topic Key: MAHOUT-1203 URL: https://issues.apache.org/jira/browse/MAHOUT-1203 Project: Mahout Issue Type: Wish Components:

[jira] [Updated] (MAHOUT-1203) Problem in PhD Topic

2013-05-03 Thread saeed iqbal (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] saeed iqbal updated MAHOUT-1203: - Issue Type: Bug (was: Wish) > Problem in PhD Topic > > >

[jira] [Resolved] (MAHOUT-1203) Problem in PhD Topic

2013-05-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-1203. --- Resolution: Invalid This not an issue report, let alone a bug. > Problem in PhD Top

[jira] [Commented] (MAHOUT-1203) Problem in PhD Topic

2013-05-03 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13648531#comment-13648531 ] Dan Filimon commented on MAHOUT-1203: - Saeed, It sounds like you're having trouble d

[jira] [Closed] (MAHOUT-1203) Problem in PhD Topic

2013-05-03 Thread saeed iqbal (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] saeed iqbal closed MAHOUT-1203. Thanks > Problem in PhD Topic > > > Key: MAHOUT-

[jira] [Updated] (MAHOUT-1202) Speed up Vector operations

2013-05-03 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dan Filimon updated MAHOUT-1202: Description: Vector assign() and aggregate() can be significantly improved in some conditions tak

[jira] [Commented] (MAHOUT-1203) Problem in PhD Topic

2013-05-03 Thread saeed iqbal (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13648536#comment-13648536 ] saeed iqbal commented on MAHOUT-1203: -- thanks, Owen, and Filimon

Re: Really long running tests

2013-05-03 Thread Ted Dunning
Shouldn't depend on seed. Very odd. Sent from my iPhone On May 3, 2013, at 8:24, Robin Anil wrote: > QRDecompositionTest: I saw this from time to time. Sometimes it runs in 0.2 > seconds sometimes 100s. Seed related? > > > > On Fri, May 3, 2013 at 9:59 AM, Dan Filimon > wrote: > >> QR

Re: Really long running tests

2013-05-03 Thread Dan Filimon
I think I found out why, for the QR test. First off, it's stable and not seed dependent (on my machine anyway, haven't looked too closely). Trunk takes about 2 minutes and my new vector branch takes more than 3. >From what I've seen the problem is twofold: - norm1 is still slower in the new code

Re: [jira] [Commented] (MAHOUT-1177) GSOC 2013: Reform and simplify the clustering APIs

2013-05-03 Thread yu lee
Co-ask. Shannon: we'd be happy if you are going to help us! Ted: what do you think about our (Yexi's and my) ideas? Shall we move on to the proposal? On Fri, May 3, 2013 at 8:10 AM, 姜页希 wrote: > Is there other comments about this issue? > > > > 2013/5/2 Shannon Quinn > > > This sounds excell

Re: Really long running tests

2013-05-03 Thread Dan Filimon
After the updates I mentioned in the last e-mail this happens: [trunk] [~2m 17s] new 10 7704.7 1.52101e-14 0.0 1.52101e-14 new 30 9395.2 1.52101e-14 0.0 1.52101e-14 new 80 9400.5 1.52101e-14 0.0 1.52101e-14 new 180 12842.9 1.52101e-14 0.0 1.52101e-14 new 380 5654.4 1.52101e-14 0.00

Design Document Wiki : First Steps

2013-05-03 Thread satyam sinha
Browsing through the Mailing-list archives for the past few months, I sensed the need to document and provide a wiki for the design document of Mahout project. There have been some very productive discussions in recent times and I heve started working on the initial draft of the wiki. It is open f

[jira] [Resolved] (MAHOUT-1202) Speed up Vector operations

2013-05-03 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dan Filimon resolved MAHOUT-1202. - Resolution: Fixed Committed revision 1478958. > Speed up Vector operations > --

[jira] [Closed] (MAHOUT-1202) Speed up Vector operations

2013-05-03 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dan Filimon closed MAHOUT-1202. --- > Speed up Vector operations > -- > > Key: MAHOUT-1202 >

[jira] [Closed] (MAHOUT-1155) Make MatrixSlice a Vector (and fix Centroid cloning; MAHOUT-1202)

2013-05-03 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dan Filimon closed MAHOUT-1155. --- > Make MatrixSlice a Vector (and fix Centroid cloning; MAHOUT-1202) > --

[jira] [Closed] (MAHOUT-1135) Unify decorated vectors in DecoratedVector

2013-05-03 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dan Filimon closed MAHOUT-1135. --- > Unify decorated vectors in DecoratedVector > - > >