[jira] [Commented] (MAHOUT-1286) Memory-efficient DataModel, supporting fast online updates and element-wise iteration

2013-08-12 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13737606#comment-13737606 ] Ted Dunning commented on MAHOUT-1286: - Recommendation as search is just one model. I

Re: apache-math dependency

2013-08-12 Thread Ted Dunning
Your advice isn't so bad even so. No need to reimplement interesting capabilities. On Mon, Aug 12, 2013 at 4:56 PM, Peng Cheng wrote: > Apologies, I mistaken apache-math as mahout-math and didn't know what I'm > talking about :) > > > On 13-08-12 07:08 PM, Ted Dunning wrote: > >> Yes. Apache

Re: apache-math dependency

2013-08-12 Thread Peng Cheng
Apologies, I mistaken apache-math as mahout-math and didn't know what I'm talking about :) On 13-08-12 07:08 PM, Ted Dunning wrote: Yes. Apache Math linear algebra is very difficult for us to use because their matrices are non-extensible. But there is actually quite a lot of code to do with r

[jira] [Commented] (MAHOUT-1286) Memory-efficient DataModel, supporting fast online updates and element-wise iteration

2013-08-12 Thread Peng Cheng (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13737563#comment-13737563 ] Peng Cheng commented on MAHOUT-1286: Also, please be noted that the first patch is st

[jira] [Commented] (MAHOUT-1286) Memory-efficient DataModel, supporting fast online updates and element-wise iteration

2013-08-12 Thread Peng Cheng (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13737553#comment-13737553 ] Peng Cheng commented on MAHOUT-1286: Hi Gentlemen, Thanks a lot for proving my point

Build failed in Jenkins: mahout-nightly #1320

2013-08-12 Thread Apache Jenkins Server
See -- [...truncated 1497 lines...] Running org.apache.mahout.cf.taste.impl.recommender.svd.FilePersistenceStrategyTest Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.061 sec - in org.

Re: apache-math dependency

2013-08-12 Thread Ted Dunning
Yes. Apache Math linear algebra is very difficult for us to use because their matrices are non-extensible. But there is actually quite a lot of code to do with random distributions, optimization and quadrature. Those are much more likely to be useful to us. On Mon, Aug 12, 2013 at 3:26 PM, Dmit

Jenkins build is back to normal : Mahout-Quality #2194

2013-08-12 Thread Apache Jenkins Server
See

Re: apache-math dependency

2013-08-12 Thread Dmitriy Lyubimov
Sorry. typo. it should read " Larger part of *apache-math* is linear algebra, which is currently broken for sparse part of the equation and which we don't use at all." On Mon, Aug 12, 2013 at 3:26 PM, Dmitriy Lyubimov wrote: > Larger part of apache-math is linear algebra, which is currently br

Re: apache-math dependency

2013-08-12 Thread Dmitriy Lyubimov
Larger part of mahout-math is linear algebra, which is currently broken for sparse part of the equation and which we don't use at all. One part of the problem is that our use for that library is always a fringe case, and as far as i can tell, will always continue to be such. Another part of the p

Re: apache-math dependency

2013-08-12 Thread Peng Cheng
seriously, I would prefer the dependency as a good architectural pattern. It encourages other people to use/contribute to it to avoid repetitive work. On 13-08-12 06:16 PM, Ted Dunning wrote: I am fine with it staying. On Mon, Aug 12, 2013 at 3:14 PM, Dmitriy Lyubimov wrote: So you are ok

Re: apache-math dependency

2013-08-12 Thread Ted Dunning
I am fine with it staying. On Mon, Aug 12, 2013 at 3:14 PM, Dmitriy Lyubimov wrote: > So you are ok with apache-math dependency to stay? > > > On Mon, Aug 12, 2013 at 3:09 PM, Ted Dunning > wrote: > > > So I checked on these. The non-trivial issues with replacing Commons > Math > > include: >

Re: apache-math dependency

2013-08-12 Thread Ted Dunning
On Mon, Aug 12, 2013 at 3:09 PM, Dmitriy Lyubimov wrote: > > The files that are involved are: > > > >3 EigenSolverWrapper.java > > > this file should have been deleted on trunk... let me know if not, it was > part of "wean SSVD off apache.math dep" commit > Not an issue. I ran the count agai

Re: apache-math dependency

2013-08-12 Thread Dmitriy Lyubimov
So you are ok with apache-math dependency to stay? On Mon, Aug 12, 2013 at 3:09 PM, Ted Dunning wrote: > So I checked on these. The non-trivial issues with replacing Commons Math > include: > > - Poisson and negative binomial distributions. This would be several hours > work to write and test

[jira] [Created] (MAHOUT-1312) LocalitySensitiveHashSearch.search does not respect search result limit

2013-08-12 Thread Stevo Slavic (JIRA)
Stevo Slavic created MAHOUT-1312: Summary: LocalitySensitiveHashSearch.search does not respect search result limit Key: MAHOUT-1312 URL: https://issues.apache.org/jira/browse/MAHOUT-1312 Project: Maho

Re: apache-math dependency

2013-08-12 Thread Ted Dunning
So I checked on these. The non-trivial issues with replacing Commons Math include: - Poisson and negative binomial distributions. This would be several hours work to write and test (we have Colt-inherited negative binomial distribution, but it takes no longer to write a new one than to test an o

Re: apache-math dependency

2013-08-12 Thread Dmitriy Lyubimov
On Mon, Aug 12, 2013 at 2:53 PM, Ted Dunning wrote: > Here are the uses [1] > > Counting the unique usages, we have these: > >2 distribution.PoissonDistribution; >2 distribution.PascalDistribution; >2 distribution.NormalDistribution; >1 util.FastMath; >1 random.RandomGenerator

Re: apache-math dependency

2013-08-12 Thread Ted Dunning
Here are the uses [1] Counting the unique usages, we have these: 2 distribution.PoissonDistribution; 2 distribution.PascalDistribution; 2 distribution.NormalDistribution; 1 util.FastMath; 1 random.RandomGenerator; 1 random.MersenneTwister; 1 primes.Primes; 1 linear.RealMat

Re: apache-math dependency

2013-08-12 Thread Dmitriy Lyubimov
hm. so it crept in. As far back as i can recollect, we tried to minimize those and the only limiting factor for us were decompositions. Now all decompositions are available natively to Mahout, so perhaps it is time to review these occurrences. On Mon, Aug 12, 2013 at 2:03 PM, Stevo Slavić wrote

Re: apache-math dependency

2013-08-12 Thread Stevo Slavić
There are 30 matches when searching for "org.apache.commons.math3" in Mahout java files. On Fri, Aug 9, 2013 at 8:34 PM, Dmitriy Lyubimov wrote: > FYI SSVD does not have that dependency anymore (thanks to fixes to > EigenSolver in Mahout). If there are no more methods using it, it can be > dele

[jira] [Commented] (MAHOUT-1286) Memory-efficient DataModel, supporting fast online updates and element-wise iteration

2013-08-12 Thread Gokhan Capan (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13737267#comment-13737267 ] Gokhan Capan commented on MAHOUT-1286: -- Peng, With a SparseRowMatrix, column access

Re: Mahout ported to windows

2013-08-12 Thread Ted Dunning
Also, it looks from your discussion like you are hard-coding changes to path names and line delimiters that are windows specific. It is not acceptable to make Mahout run on windows at the cost of breaking it on all other platforms. Also, you reference Jenkins builds. Can you provide a pointer?

Re: Mahout ported to windows

2013-08-12 Thread Dmitriy Lyubimov
On Mon, Aug 12, 2013 at 11:25 AM, Sergey Svinarchuk < ssvinarc...@hortonworks.com> wrote: > Hi all, > > https://issues.apache.org/jira/browse/MAHOUT-1309 > https://issues.apache.org/jira/browse/MAHOUT-1310 > https://issues.apache.org/jira/browse/MAHOUT-1311 > This tickets is a part of porting maho

[jira] [Commented] (MAHOUT-1309) Install mahout on windows

2013-08-12 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13737182#comment-13737182 ] Dmitriy Lyubimov commented on MAHOUT-1309: -- it looks like the patch is against m

Mahout ported to windows

2013-08-12 Thread Sergey Svinarchuk
Hi all, https://issues.apache.org/jira/browse/MAHOUT-1309 https://issues.apache.org/jira/browse/MAHOUT-1310 https://issues.apache.org/jira/browse/MAHOUT-1311 This tickets is a part of porting mahout to Windows. After this change mahout compile, build And also all mahout example scripts must work w

[jira] [Commented] (MAHOUT-1286) Memory-efficient DataModel, supporting fast online updates and element-wise iteration

2013-08-12 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13737116#comment-13737116 ] Ted Dunning commented on MAHOUT-1286: - The hopscotch hashing still looks pretty inter

[jira] [Commented] (MAHOUT-1286) Memory-efficient DataModel, supporting fast online updates and element-wise iteration

2013-08-12 Thread Peng Cheng (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13737033#comment-13737033 ] Peng Cheng commented on MAHOUT-1286: The data structure used here (1d hashing row/col

[jira] [Comment Edited] (MAHOUT-1286) Memory-efficient DataModel, supporting fast online updates and element-wise iteration

2013-08-12 Thread Peng Cheng (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13736962#comment-13736962 ] Peng Cheng edited comment on MAHOUT-1286 at 8/12/13 4:54 PM: -

[jira] [Commented] (MAHOUT-1286) Memory-efficient DataModel, supporting fast online updates and element-wise iteration

2013-08-12 Thread Peng Cheng (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13737023#comment-13737023 ] Peng Cheng commented on MAHOUT-1286: Well, I mean, I partially agree that the effort

[jira] [Commented] (MAHOUT-1286) Memory-efficient DataModel, supporting fast online updates and element-wise iteration

2013-08-12 Thread Peng Cheng (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13737019#comment-13737019 ] Peng Cheng commented on MAHOUT-1286: Hi Dr Dunning, Indeed both Gokhan and me have e

[jira] [Comment Edited] (MAHOUT-1286) Memory-efficient DataModel, supporting fast online updates and element-wise iteration

2013-08-12 Thread Peng Cheng (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13737019#comment-13737019 ] Peng Cheng edited comment on MAHOUT-1286 at 8/12/13 4:44 PM: -

[jira] [Updated] (MAHOUT-1310) 100% unit test pass for mahout during build on Windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Svinarchuk updated MAHOUT-1310: -- Fix Version/s: (was: 0.7) > 100% unit test pass for mahout during build on Win

[jira] [Updated] (MAHOUT-1311) 100% system tests pass for mahout on Windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Svinarchuk updated MAHOUT-1311: -- Attachment: patchfile.patch > 100% system tests pass for mahout on Windows > -

[jira] [Updated] (MAHOUT-1310) 100% unit test pass for mahout during build on Windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Svinarchuk updated MAHOUT-1310: -- Attachment: patchfile.patch > 100% unit test pass for mahout during build on Windo

[jira] [Updated] (MAHOUT-1309) Install mahout on windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Svinarchuk updated MAHOUT-1309: -- Attachment: patchfile.patch > Install mahout on windows >

[jira] [Updated] (MAHOUT-1311) 100% system tests pass for mahout on Windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Svinarchuk updated MAHOUT-1311: -- Attachment: (was: patch_product_code.archive) > 100% system tests pass for mah

[jira] [Updated] (MAHOUT-1310) 100% unit test pass for mahout during build on Windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Svinarchuk updated MAHOUT-1310: -- Attachment: (was: patch_product_code.archive) > 100% unit test pass for mahout

[jira] [Updated] (MAHOUT-1309) Install mahout on windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Svinarchuk updated MAHOUT-1309: -- Attachment: (was: patch_product_code.archive) > Install mahout on windows > --

[jira] [Updated] (MAHOUT-1309) Install mahout on windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Svinarchuk updated MAHOUT-1309: -- Fix Version/s: (was: 0.7) > Install mahout on windows > --

[jira] [Updated] (MAHOUT-1311) 100% system tests pass for mahout on Windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Svinarchuk updated MAHOUT-1311: -- Fix Version/s: (was: 0.7) > 100% system tests pass for mahout on Windows > ---

[jira] [Updated] (MAHOUT-1286) Memory-efficient DataModel, supporting fast online updates and element-wise iteration

2013-08-12 Thread Peng Cheng (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peng Cheng updated MAHOUT-1286: --- Fix Version/s: 0.9 Labels: collaborative-filtering datamodel patch recommender (was: )

[jira] [Commented] (MAHOUT-1286) Memory-efficient DataModel, supporting fast online updates and element-wise iteration

2013-08-12 Thread Peng Cheng (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13736992#comment-13736992 ] Peng Cheng commented on MAHOUT-1286: Here is my final solution after numerous experim

[jira] [Updated] (MAHOUT-1286) Memory-efficient DataModel, supporting fast online updates and element-wise iteration

2013-08-12 Thread Peng Cheng (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peng Cheng updated MAHOUT-1286: --- Attachment: InMemoryDataModelTest.java InMemoryDataModel.java See uploaded files for

[jira] [Reopened] (MAHOUT-1311) 100% system tests pass for mahout on Windows

2013-08-12 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Dunning reopened MAHOUT-1311: - This isn't fixed until the fix is committed. Can you re-attach your patch in a suitable standard di

[jira] [Reopened] (MAHOUT-1309) Install mahout on windows

2013-08-12 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Dunning reopened MAHOUT-1309: - This isn't fixed until the fix is committed. Can you re-attach your patch in a suitable standard di

[jira] [Reopened] (MAHOUT-1310) 100% unit test pass for mahout during build on Windows

2013-08-12 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Dunning reopened MAHOUT-1310: - This isn't fixed until the fix is committed. Can you re-attach your patch in a standard form such

[jira] [Commented] (MAHOUT-1286) Memory-efficient DataModel, supporting fast online updates and element-wise iteration

2013-08-12 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13736984#comment-13736984 ] Ted Dunning commented on MAHOUT-1286: - Peng, You should be able to create a relative

[jira] [Commented] (MAHOUT-1286) Memory-efficient DataModel, supporting fast online updates and element-wise iteration

2013-08-12 Thread Peng Cheng (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13736962#comment-13736962 ] Peng Cheng commented on MAHOUT-1286: The idea of ArrayMap has been discarded due to i

[jira] [Updated] (MAHOUT-1311) 100% system tests pass for mahout on Windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Svinarchuk updated MAHOUT-1311: -- Attachment: mahout_system_tests.patch patch_product_code.archive Attac

[jira] [Resolved] (MAHOUT-1311) 100% system tests pass for mahout on Windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Svinarchuk resolved MAHOUT-1311. --- Resolution: Fixed > 100% system tests pass for mahout on Windows > -

[jira] [Created] (MAHOUT-1311) 100% system tests pass for mahout on Windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
Sergey Svinarchuk created MAHOUT-1311: - Summary: 100% system tests pass for mahout on Windows Key: MAHOUT-1311 URL: https://issues.apache.org/jira/browse/MAHOUT-1311 Project: Mahout Issue

[jira] [Updated] (MAHOUT-1310) 100% unit test pass for mahout during build on Windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Svinarchuk updated MAHOUT-1310: -- Fix Version/s: 0.7 Reproduced In: 0.7 Labels: patch (was: )

[jira] [Updated] (MAHOUT-1310) 100% unit test pass for mahout during build on Windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Svinarchuk updated MAHOUT-1310: -- Resolution: Fixed Status: Resolved (was: Patch Available) > 100% unit tes

[jira] [Updated] (MAHOUT-1310) 100% unit test pass for mahout during build on Windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Svinarchuk updated MAHOUT-1310: -- Attachment: patch_product_code.archive > 100% unit test pass for mahout during bui

[jira] [Created] (MAHOUT-1310) 100% unit test pass for mahout during build on Windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
Sergey Svinarchuk created MAHOUT-1310: - Summary: 100% unit test pass for mahout during build on Windows Key: MAHOUT-1310 URL: https://issues.apache.org/jira/browse/MAHOUT-1310 Project: Mahout

[jira] [Resolved] (MAHOUT-1309) Install mahout on windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Svinarchuk resolved MAHOUT-1309. --- Resolution: Fixed > Install mahout on windows > - > >

[jira] [Comment Edited] (MAHOUT-1309) Install mahout on windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13736933#comment-13736933 ] Sergey Svinarchuk edited comment on MAHOUT-1309 at 8/12/13 3:12 PM: ---

[jira] [Updated] (MAHOUT-1309) Install mahout on windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Svinarchuk updated MAHOUT-1309: -- Attachment: patch_product_code.archive In this archive patch with changes in mahout fo

[jira] [Created] (MAHOUT-1309) Install mahout on windows

2013-08-12 Thread Sergey Svinarchuk (JIRA)
Sergey Svinarchuk created MAHOUT-1309: - Summary: Install mahout on windows Key: MAHOUT-1309 URL: https://issues.apache.org/jira/browse/MAHOUT-1309 Project: Mahout Issue Type: Task