Re: [jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-26 Thread Dmitriy Lyubimov
On Wed, Mar 26, 2014 at 6:00 AM, Hardik Pandya wrote: > Sorry to hijack the thread, > > this seems like first steps of mahout geeting it to work on spark > > there are similar efforts going on with R+Spark aka Spark R > Yeah. And there's rmr and i wrote a very similar thing, CrunchR (R for Crunch

Re: [jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-26 Thread Ted Dunning
It would be great to have you. (go ahead and start new threads when appropriate ... better than hijacking) On Wed, Mar 26, 2014 at 6:00 AM, Hardik Pandya wrote: > Sorry to hijack the thread, > > this seems like first steps of mahout geeting it to work on spark > > there are similar efforts goi

Re: [jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-26 Thread Hardik Pandya
Sorry to hijack the thread, this seems like first steps of mahout geeting it to work on spark there are similar efforts going on with R+Spark aka Spark R not sure if this helpos, played with spark ec2 scripts and it brings up multinode cluster using mesos and its configurable - willing to contri

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-24 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13945337#comment-13945337 ] Pat Ferrel commented on MAHOUT-1464: OK, I do have an alum address but it takes some

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-24 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13945306#comment-13945306 ] Pat Ferrel commented on MAHOUT-1464: I tried by you have to have a .edu or university

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-23 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13944773#comment-13944773 ] Sebastian Schelter commented on MAHOUT-1464: [~pferrel] I planned to test the

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-23 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13944755#comment-13944755 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- bq. Adding 16 cores to my closet's c

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-23 Thread Andrew Musselman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13944743#comment-13944743 ] Andrew Musselman commented on MAHOUT-1464: -- [~kanjilal] Not sure how to make a s

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-23 Thread Saikat Kanjilal (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13944710#comment-13944710 ] Saikat Kanjilal commented on MAHOUT-1464: - +1 on Andrew's suggestion on using AWS

Re: [jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-23 Thread Andrew Musselman
I dug it up and it's a "promotional discount" which I think can be applied to anyone's account. Shall we spin up an AWS/EMR account for mahout-dev? On Sun, Mar 23, 2014 at 5:44 PM, Andrew Musselman < andrew.mussel...@gmail.com> wrote: > I still have (as I recall) a thousand dollars' worth of AW

Re: [jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-23 Thread Andrew Musselman
I still have (as I recall) a thousand dollars' worth of AWS credit the AWS team gave me specifically for Mahout testing, and we could run stuff on EMR very easily. Need to dig up the account number or details and see about sharing around the credentials somehow. On Sun, Mar 23, 2014 at 5:39 PM,

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-23 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13944673#comment-13944673 ] Pat Ferrel commented on MAHOUT-1464: Adding 16 cores to my closet's cluster next week

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-23 Thread Saikat Kanjilal (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13944622#comment-13944622 ] Saikat Kanjilal commented on MAHOUT-1464: - We have a cluster at work , however I'

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-23 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13944545#comment-13944545 ] Sebastian Schelter commented on MAHOUT-1464: Would be awesome to take the pat

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-23 Thread Saikat Kanjilal (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13944543#comment-13944543 ] Saikat Kanjilal commented on MAHOUT-1464: - Sebastien, Can I help out on this issu

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-20 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13941604#comment-13941604 ] Hudson commented on MAHOUT-1464: SUCCESS: Integrated in Mahout-Quality #2531 (See [https

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-20 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13941561#comment-13941561 ] Hudson commented on MAHOUT-1464: SUCCESS: Integrated in Mahout-Quality #2530 (See [https

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-20 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13941545#comment-13941545 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- If anything, at least i see non-negl

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-20 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13941541#comment-13941541 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- Oh, you mean in case of sparse row v

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-20 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13941531#comment-13941531 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- No, i think blockify is fine. it pro

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-20 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13941521#comment-13941521 ] Sebastian Schelter commented on MAHOUT-1464: one possibility would be to allo

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-20 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13941520#comment-13941520 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- On Thu, Mar 20, 2014 at 1:42 AM, Seb

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-20 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13941517#comment-13941517 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- I have non-slim A'A. Of course slim

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-20 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13941516#comment-13941516 ] Sebastian Schelter commented on MAHOUT-1464: In a SparseRowMatrix, this is on

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-20 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13941511#comment-13941511 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- yeah .. those views.. i think they c

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-19 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13941355#comment-13941355 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- Actually, non-slim A'A operator is p

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13939513#comment-13939513 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- http://weatheringthrutechdays.blogs

Re: [jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Dmitriy Lyubimov
On Tue, Mar 18, 2014 at 9:57 AM, Pat Ferrel (JIRA) wrote: > > [ > https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13939471#comment-13939471] > > Pat Ferrel commented on MAHOUT-1464: >

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13939471#comment-13939471 ] Pat Ferrel commented on MAHOUT-1464: Since there are potentially commits by D and S a

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13939468#comment-13939468 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- [~ssc] Looking nice. I guess we wan

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13939451#comment-13939451 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- That's what i normally do, yes. The

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13939420#comment-13939420 ] Pat Ferrel commented on MAHOUT-1464: PDF in the repo is fine by me. Can the patches

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13938958#comment-13938958 ] Sebastian Schelter commented on MAHOUT-1464: The physical operator for non-sk

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-17 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13938810#comment-13938810 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- Also, just FYI, much as i love to us

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-17 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13938805#comment-13938805 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- 1. {code} val C = A.t %*% A {code}

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-17 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13938799#comment-13938799 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- What's the best way to share PDF sou

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-17 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13938333#comment-13938333 ] Pat Ferrel commented on MAHOUT-1464: OK, refreshed the repo and now I see all the Spa

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-17 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13938259#comment-13938259 ] Sebastian Schelter commented on MAHOUT-1464: I'd like to rework my prototype

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-17 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13938254#comment-13938254 ] Sebastian Schelter commented on MAHOUT-1464: I havent tested Spark on Hadoop

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-17 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13938020#comment-13938020 ] Pat Ferrel commented on MAHOUT-1464: So am I so no problem. My plan is to update the

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-17 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13937987#comment-13937987 ] Sebastian Schelter commented on MAHOUT-1464: @Pat I'm pretty busy with non-Ma

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-17 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13937954#comment-13937954 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- Ps spark module has cdh4 maven profi

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-17 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13937951#comment-13937951 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- I only ever ran spark code with hdfs

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-17 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13937925#comment-13937925 ] Pat Ferrel commented on MAHOUT-1464: Good news. At the danger of asking for too much,

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-16 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13937490#comment-13937490 ] Sebastian Schelter commented on MAHOUT-1464: I've started to work on this. >