[jira] [Commented] (MAHOUT-1490) Data frame R-like bindings

2014-03-26 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948962#comment-13948962 ] Dmitriy Lyubimov commented on MAHOUT-1490: -- could be, i have no opinion on tha

[jira] [Commented] (MAHOUT-1489) Interactive Scala & Spark Bindings Shell & Script processor

2014-03-26 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948959#comment-13948959 ] Dmitriy Lyubimov commented on MAHOUT-1489: -- hm email quoting did not work . i gu

[jira] [Commented] (MAHOUT-1489) Interactive Scala & Spark Bindings Shell & Script processor

2014-03-26 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948952#comment-13948952 ] Dmitriy Lyubimov commented on MAHOUT-1489: -- yes no this is not the scope no t

[jira] [Commented] (MAHOUT-1489) Interactive Scala & Spark Bindings Shell & Script processor

2014-03-26 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948948#comment-13948948 ] Dmitriy Lyubimov commented on MAHOUT-1489: -- This issues not about it and this i

[jira] [Commented] (MAHOUT-1491) Spectral KMeans Clustering doesn't clean its /tmp dir and fails when seeing it again

2014-03-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948905#comment-13948905 ] Hudson commented on MAHOUT-1491: SUCCESS: Integrated in Mahout-Quality #2541 (See [https

[jira] [Commented] (MAHOUT-1489) Interactive Scala & Spark Bindings Shell & Script processor

2014-03-26 Thread Saikat Kanjilal (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948897#comment-13948897 ] Saikat Kanjilal commented on MAHOUT-1489: - Were you thinking of an in memory 2d a

[jira] [Commented] (MAHOUT-1489) Interactive Scala & Spark Bindings Shell & Script processor

2014-03-26 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948891#comment-13948891 ] Ted Dunning commented on MAHOUT-1489: - I think that creating a distributed object fro

[jira] [Commented] (MAHOUT-1489) Interactive Scala & Spark Bindings Shell & Script processor

2014-03-26 Thread Saikat Kanjilal (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948882#comment-13948882 ] Saikat Kanjilal commented on MAHOUT-1489: - Here is an initial list of functionali

[jira] [Commented] (MAHOUT-1490) Data frame R-like bindings

2014-03-26 Thread Saikat Kanjilal (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948876#comment-13948876 ] Saikat Kanjilal commented on MAHOUT-1490: - Here's a list of features that can exi

[jira] [Updated] (MAHOUT-1491) Spectral KMeans Clustering doesn't clean its /tmp dir and fails when seeing it again

2014-03-26 Thread Andrew Musselman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Musselman updated MAHOUT-1491: - Resolution: Fixed Status: Resolved (was: Patch Available) Committed patch. > S

[jira] [Commented] (MAHOUT-1491) Spectral KMeans Clustering doesn't clean its /tmp dir and fails when seeing it again

2014-03-26 Thread Andrew Musselman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948808#comment-13948808 ] Andrew Musselman commented on MAHOUT-1491: -- Committed patch to trunk. > Spectra

[jira] [Commented] (MAHOUT-1491) Spectral KMeans Clustering doesn't clean its /tmp dir and fails when seeing it again

2014-03-26 Thread Andrew Musselman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948804#comment-13948804 ] Andrew Musselman commented on MAHOUT-1491: -- Confirmed this patch fixes the bug.

[jira] [Updated] (MAHOUT-1491) Spectral KMeans Clustering doesn't clean its /tmp dir and fails when seeing it again

2014-03-26 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi updated MAHOUT-1491: -- Attachment: MAHOUT-1491.patch > Spectral KMeans Clustering doesn't clean its /tmp dir and fail

[jira] [Updated] (MAHOUT-1491) Spectral KMeans Clustering doesn't clean its /tmp dir and fails when seeing it again

2014-03-26 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi updated MAHOUT-1491: -- Status: Patch Available (was: Open) > Spectral KMeans Clustering doesn't clean its /tmp dir a

[jira] [Created] (MAHOUT-1491) Spectral KMeans Clustering doesn't clean its /tmp dir and fails when seeing it again

2014-03-26 Thread Suneel Marthi (JIRA)
Suneel Marthi created MAHOUT-1491: - Summary: Spectral KMeans Clustering doesn't clean its /tmp dir and fails when seeing it again Key: MAHOUT-1491 URL: https://issues.apache.org/jira/browse/MAHOUT-1491

[jira] [Work started] (MAHOUT-1443) Update "How to release page"

2014-03-26 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on MAHOUT-1443 started by Suneel Marthi. > Update "How to release page" > > > Key: MAHOUT-1443 > URL: h

[jira] [Commented] (MAHOUT-1488) DisplaySpectralKMeans fails: examples/output/clusteredPoints/part-m-00000 does not exist.

2014-03-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948517#comment-13948517 ] Hudson commented on MAHOUT-1488: SUCCESS: Integrated in Mahout-Quality #2539 (See [https

[jira] [Commented] (MAHOUT-1346) Spark Bindings (DRM)

2014-03-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948515#comment-13948515 ] Hudson commented on MAHOUT-1346: SUCCESS: Integrated in Mahout-Quality #2539 (See [https

[jira] [Commented] (MAHOUT-1471) Cleanup website on Canopy Clustering

2014-03-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948516#comment-13948516 ] Hudson commented on MAHOUT-1471: SUCCESS: Integrated in Mahout-Quality #2539 (See [https

[jira] [Commented] (MAHOUT-1488) DisplaySpectralKMeans fails: examples/output/clusteredPoints/part-m-00000 does not exist.

2014-03-26 Thread Saleem Ansari (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948402#comment-13948402 ] Saleem Ansari commented on MAHOUT-1488: --- Thanks for promoting this patch. > Displa

[jira] [Updated] (MAHOUT-1488) DisplaySpectralKMeans fails: examples/output/clusteredPoints/part-m-00000 does not exist.

2014-03-26 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi updated MAHOUT-1488: -- Resolution: Fixed Status: Resolved (was: Patch Available) Patch committed to trunk. T

[jira] [Assigned] (MAHOUT-1488) DisplaySpectralKMeans fails: examples/output/clusteredPoints/part-m-00000 does not exist.

2014-03-26 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi reassigned MAHOUT-1488: - Assignee: Suneel Marthi > DisplaySpectralKMeans fails: examples/output/clusteredPoints/p

[jira] [Updated] (MAHOUT-1489) Interactive Scala & Spark Bindings Shell & Script processor

2014-03-26 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitriy Lyubimov updated MAHOUT-1489: - Summary: Interactive Scala & Spark Bindings Shell & Script processor (was: Interactive

[jira] [Commented] (MAHOUT-1490) Data frame R-like bindings

2014-03-26 Thread Saikat Kanjilal (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948335#comment-13948335 ] Saikat Kanjilal commented on MAHOUT-1490: - I can work on this since I have worked

[jira] [Commented] (MAHOUT-1489) Interactive linear algebra shell & script processor

2014-03-26 Thread Saikat Kanjilal (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948332#comment-13948332 ] Saikat Kanjilal commented on MAHOUT-1489: - Yes correct, however for my sake pleas

[jira] [Assigned] (MAHOUT-1490) Data frame R-like bindings

2014-03-26 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitriy Lyubimov reassigned MAHOUT-1490: Assignee: Dmitriy Lyubimov > Data frame R-like bindings >

[jira] [Commented] (MAHOUT-1490) Data frame R-like bindings

2014-03-26 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948280#comment-13948280 ] Dmitriy Lyubimov commented on MAHOUT-1490: -- Very good. I guess we need a DSL pro

[jira] [Commented] (MAHOUT-1489) Interactive linear algebra shell & script processor

2014-03-26 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948276#comment-13948276 ] Dmitriy Lyubimov commented on MAHOUT-1489: -- I cannot assign to a non-committer,

[jira] [Assigned] (MAHOUT-1489) Interactive linear algebra shell & script processor

2014-03-26 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitriy Lyubimov reassigned MAHOUT-1489: Assignee: Dmitriy Lyubimov > Interactive linear algebra shell & script processor >

RE: Mahout on Spark

2014-03-26 Thread Saikat Kanjilal
Created some placeholders for the first two pieces:https://issues.apache.org/jira/browse/MAHOUT-1489https://issues.apache.org/jira/browse/MAHOUT-1490 @Dmitry feel free to add some more descriptions/use cases onto these, I'll read through the spark description and have some more questions for you

[jira] [Created] (MAHOUT-1490) Data frame R-like bindings

2014-03-26 Thread Saikat Kanjilal (JIRA)
Saikat Kanjilal created MAHOUT-1490: --- Summary: Data frame R-like bindings Key: MAHOUT-1490 URL: https://issues.apache.org/jira/browse/MAHOUT-1490 Project: Mahout Issue Type: New Feature

[jira] [Created] (MAHOUT-1489) Interactive linear algebra shell & script processor

2014-03-26 Thread Saikat Kanjilal (JIRA)
Saikat Kanjilal created MAHOUT-1489: --- Summary: Interactive linear algebra shell & script processor Key: MAHOUT-1489 URL: https://issues.apache.org/jira/browse/MAHOUT-1489 Project: Mahout Is

Re: Mahout on Spark

2014-03-26 Thread Dmitriy Lyubimov
No, we probably don't want to create them unless we have someone to assign them to. You are more than welcome create one if you want to take a stub at any of those. -d On Wed, Mar 26, 2014 at 10:28 AM, Saikat Kanjilal wrote: > @DmitryAre there JIRA items created for the wanted pieces? I'd like

Re: [jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-26 Thread Dmitriy Lyubimov
On Wed, Mar 26, 2014 at 6:00 AM, Hardik Pandya wrote: > Sorry to hijack the thread, > > this seems like first steps of mahout geeting it to work on spark > > there are similar efforts going on with R+Spark aka Spark R > Yeah. And there's rmr and i wrote a very similar thing, CrunchR (R for Crunch

RE: Mahout on Spark

2014-03-26 Thread Saikat Kanjilal
@DmitryAre there JIRA items created for the wanted pieces? I'd like to volunteer to take on the shell and the R bindings , should I create JIRA items for these? > Date: Wed, 26 Mar 2014 10:12:01 -0700 > Subject: Re: Mahout on Spark > From: dlie...@gmail.com > To: sxk1...@hotmail.com > CC: dev@m

Re: Mahout on Spark

2014-03-26 Thread Dmitriy Lyubimov
Sure. @Saikat et al: Check out the http://mahout.apache.org/users/sparkbindings/home.html "Wanted" section. Of course, data frames and vectorization(feature prep) standardization is very high priority there. Another high priority is interactive shell /scripting (just like spark shell). Something

RE: Mahout on Spark

2014-03-26 Thread Saikat Kanjilal
+1, in fact I would be very much indebted if someone (namely Dmitry :) ) could do a google hangout focused on spark where folks can ask questions and learn more, to this end I want to bring up something else, it'd be great if mahout itself either through the apache project foundation or through

Mahout on Spark

2014-03-26 Thread Pat Ferrel
New name for a new thread. A lot of the discussion on MAHOUT-1464 has been around integrating that feature with the Scala DSL. As Saikat says this is of general interest since people seem to agree that this is a good place to integrate efforts. I’m interested in what I think Dmitriy called data

Re: [jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-26 Thread Ted Dunning
It would be great to have you. (go ahead and start new threads when appropriate ... better than hijacking) On Wed, Mar 26, 2014 at 6:00 AM, Hardik Pandya wrote: > Sorry to hijack the thread, > > this seems like first steps of mahout geeting it to work on spark > > there are similar efforts goi

Re: [jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-26 Thread Hardik Pandya
Sorry to hijack the thread, this seems like first steps of mahout geeting it to work on spark there are similar efforts going on with R+Spark aka Spark R not sure if this helpos, played with spark ec2 scripts and it brings up multinode cluster using mesos and its configurable - willing to contri

[jira] [Updated] (MAHOUT-1488) DisplaySpectralKMeans fails: examples/output/clusteredPoints/part-m-00000 does not exist.

2014-03-26 Thread Saleem Ansari (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saleem Ansari updated MAHOUT-1488: -- Fix Version/s: 1.0 > DisplaySpectralKMeans fails: examples/output/clusteredPoints/part-m-0

[jira] [Comment Edited] (MAHOUT-1488) DisplaySpectralKMeans fails: examples/output/clusteredPoints/part-m-00000 does not exist.

2014-03-26 Thread Saleem Ansari (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13947652#comment-13947652 ] Saleem Ansari edited comment on MAHOUT-1488 at 3/26/14 8:16 AM: ---

[jira] [Commented] (MAHOUT-1488) DisplaySpectralKMeans fails: examples/output/clusteredPoints/part-m-00000 does not exist.

2014-03-26 Thread Saleem Ansari (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13947659#comment-13947659 ] Saleem Ansari commented on MAHOUT-1488: --- I have attached full error and patch file.

[jira] [Updated] (MAHOUT-1488) DisplaySpectralKMeans fails: examples/output/clusteredPoints/part-m-00000 does not exist.

2014-03-26 Thread Saleem Ansari (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saleem Ansari updated MAHOUT-1488: -- Attachment: 0001-MAHOUT-1488-Fix-DisplaySpectralKMeans-failure.patch Patch file to fix the iss

[jira] [Updated] (MAHOUT-1488) DisplaySpectralKMeans fails: examples/output/clusteredPoints/part-m-00000 does not exist.

2014-03-26 Thread Saleem Ansari (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saleem Ansari updated MAHOUT-1488: -- Attachment: error.txt Full error of the issue. > DisplaySpectralKMeans fails: examples/output

[jira] [Comment Edited] (MAHOUT-1488) DisplaySpectralKMeans fails: examples/output/clusteredPoints/part-m-00000 does not exist.

2014-03-26 Thread Saleem Ansari (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13947652#comment-13947652 ] Saleem Ansari edited comment on MAHOUT-1488 at 3/26/14 8:09 AM: ---

[jira] [Updated] (MAHOUT-1488) DisplaySpectralKMeans fails: examples/output/clusteredPoints/part-m-00000 does not exist.

2014-03-26 Thread Saleem Ansari (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saleem Ansari updated MAHOUT-1488: -- Status: Patch Available (was: Open) diff --git a/examples/src/main/java/org/apache/mahout/cl

[jira] [Created] (MAHOUT-1488) DisplaySpectralKMeans fails: examples/output/clusteredPoints/part-m-00000 does not exist.

2014-03-26 Thread Saleem Ansari (JIRA)
Saleem Ansari created MAHOUT-1488: - Summary: DisplaySpectralKMeans fails: examples/output/clusteredPoints/part-m-0 does not exist. Key: MAHOUT-1488 URL: https://issues.apache.org/jira/browse/MAHOUT-1488