Re: Review Request: MAHOUT-1192 [2]: Speed up Vector Operations

2013-04-25 Thread Robin Anil
Do in place for rand an dense. Assign a bug to me to speed up rasv GetElement. On Apr 25, 2013 2:56 PM, "Dan Filimon" wrote: > Nearly done splitting the code up, but I'm not sure what the costs should > ideally be. > > Robin, you proposed: cost of iteration + cost of lookup + cost of update > (if

CfP 2013 Workshop on Middleware for HPC and Big Data Systems (MHPC'13)

2013-04-25 Thread MHPC 2013
we apologize if you receive multiple copies of this message === CALL FOR PAPERS 2013 Workshop on Middleware for HPC and Big Data Systems MHPC '13 as part of Euro-Par 2013, Aachen, Germany =

Re: Review Request: MAHOUT-1192 [2]: Speed up Vector Operations

2013-04-25 Thread Dan Filimon
Nearly done splitting the code up, but I'm not sure what the costs should ideally be. Robin, you proposed: cost of iteration + cost of lookup + cost of update (if its not in-place) This sounds like it's per element, rather than for the entire vector. Also, are we just going to assume that there w

[jira] [Commented] (MAHOUT-1196) LogisticModelParameters uses csv.getTargetCategories() even if csv is not used.

2013-04-25 Thread Vineet Krishnan (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13642011#comment-13642011 ] Vineet Krishnan commented on MAHOUT-1196: - Yes, i'm working on something that use

Re: Review Request: MAHOUT-1192 [2]: Speed up Vector Operations

2013-04-25 Thread Robin Anil
Depends on the speed of copying a 1M double array v/s doing a calloc + copying 1000 non zeros (Assuming java is doing that underneath). -- Robin Anil On Thu, Apr 25, 2013 at 2:18 AM, Dan Filimon wrote: > Right, but is clone() generally slower than assigning? That strikes me as > odd; doesn

[jira] [Commented] (MAHOUT-1177) GSOC 2013: Reform and simplify the clustering APIs

2013-04-25 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13641904#comment-13641904 ] Ted Dunning commented on MAHOUT-1177: - Yexi, We would love to have you contribute wi

[jira] [Comment Edited] (MAHOUT-1177) GSOC 2013: Reform and simplify the clustering APIs

2013-04-25 Thread Yexi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13641893#comment-13641893 ] Yexi edited comment on MAHOUT-1177 at 4/25/13 3:34 PM: --- Hi, I am

[jira] [Commented] (MAHOUT-1177) GSOC 2013: Reform and simplify the clustering APIs

2013-04-25 Thread Yexi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13641893#comment-13641893 ] Yexi commented on MAHOUT-1177: -- Hi, I am a graduate student majored in data mining, I am v

[jira] [Commented] (MAHOUT-916) Make Mahout's tests run in parallel

2013-04-25 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13641790#comment-13641790 ] Isabel Drost commented on MAHOUT-916: - Some status in between: Setting "parallel" to c

Build failed in Jenkins: mahout-nightly #1215

2013-04-25 Thread Apache Jenkins Server
See -- [...truncated 1337 lines...] Tests run: 7, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.002 sec Running org.apache.mahout.cf.taste.impl.model.MemoryIDMigratorTest Tests run: 3, Failures: 0, E

[jira] [Commented] (MAHOUT-916) Make Mahout's tests run in parallel

2013-04-25 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13641639#comment-13641639 ] Dawid Weiss commented on MAHOUT-916: I'm not advocating for making the build any more

Re: Review Request: MAHOUT-1192 [2]: Speed up Vector Operations

2013-04-25 Thread Dan Filimon
Right, but is clone() generally slower than assigning? That strikes me as odd; doesn't Java optimize copying the internal structures (there are arrays underneath after all)? On Thu, Apr 25, 2013 at 10:14 AM, Robin Anil wrote: > Seems like for dense clone is slower than like().assign I need to t

Re: Review Request: MAHOUT-1192 [2]: Speed up Vector Operations

2013-04-25 Thread Robin Anil
Seems like for dense clone is slower than like().assign I need to test it with different sizes to be sure. I kept it from the old behavior. On Apr 25, 2013 2:12 AM, "Dan Filimon" wrote: > Okay, so I should split it further into smaller sub-cases that handle each > Vector type. I tried making so t

Re: Review Request: MAHOUT-1192 [2]: Speed up Vector Operations

2013-04-25 Thread Dan Filimon
Okay, so I should split it further into smaller sub-cases that handle each Vector type. I tried making so that these match the cases in the document to the extent possible. You're right. It is ugly and I need to split it up. I removed the OrderedIntDoubleMapping (but with another if...) but one mo