[jira] [Reopened] (SPARK-5312) Use sbt to detect new or changed public classes in PRs

2015-03-12 Thread Brennon York (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brennon York reopened SPARK-5312: - Reopening as believed to be a direct path forward with

[jira] [Commented] (SPARK-5987) Model import/export for GaussianMixtureModel

2015-03-12 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14355274#comment-14355274 ] Manoj Kumar commented on SPARK-5987: [~josephkb] I am stuck at this point. Would be

[jira] [Commented] (SPARK-6232) Spark Streaming: simple application stalls processing

2015-03-12 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14355314#comment-14355314 ] Tathagata Das commented on SPARK-6232: -- [~minisaw] Could you try out 1.3.0 and

[jira] [Commented] (SPARK-6232) Spark Streaming: simple application stalls processing

2015-03-12 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14355311#comment-14355311 ] Tathagata Das commented on SPARK-6232: -- I believe this is the jira that fixed it

[jira] [Resolved] (SPARK-6296) Add equals operator to Column (v1.3)

2015-03-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-6296. Resolution: Fixed Target Version/s: 1.4.0, 1.3.1 (was: 1.3.0) Add equals operator to

[jira] [Commented] (SPARK-4921) TaskSetManager mistakenly returns PROCESS_LOCAL for NO_PREF tasks

2015-03-12 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14355366#comment-14355366 ] Sandy Ryza commented on SPARK-4921: --- I'm going to close this as Won't Fix as this has

[jira] [Commented] (SPARK-6246) spark-ec2 can't handle clusters with 100 nodes

2015-03-12 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14355403#comment-14355403 ] Shivaram Venkataraman commented on SPARK-6246: -- Hmm - This seems like a bad

[jira] [Updated] (SPARK-6300) sc.addFile(path) does not support the relative path.

2015-03-12 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza updated SPARK-6300: -- Priority: Critical (was: Major) Target Version/s: 1.3.1 Affects Version/s: 1.3.0

[jira] [Updated] (SPARK-3642) Better document the nuances of shared variables

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3642: - Priority: Minor (was: Major) Assignee: Sandy Ryza Better document the nuances of shared variables

[jira] [Resolved] (SPARK-3642) Better document the nuances of shared variables

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3642. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 2490

[jira] [Updated] (SPARK-6294) PySpark task may hang while call take() on in Java/Scala

2015-03-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6294: - Fix Version/s: 1.3.1 1.4.0 PySpark task may hang while call take() on in

[jira] [Resolved] (SPARK-5814) Remove JBLAS from runtime dependencies

2015-03-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5814. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 4699

[jira] [Resolved] (SPARK-5186) Vector.equals and Vector.hashCode are very inefficient and fail on SparseVectors with large size

2015-03-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5186. -- Resolution: Fixed Fix Version/s: (was: 1.3.0) 1.2.2 Issue

[jira] [Updated] (SPARK-6301) Unable to load external jars while submiiting Spark Job

2015-03-12 Thread raju patel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] raju patel updated SPARK-6301: -- Component/s: Spark Submit PySpark Unable to load external jars while submiiting Spark

[jira] [Updated] (SPARK-6301) Unable to load external jars while submitting Spark Job

2015-03-12 Thread raju patel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] raju patel updated SPARK-6301: -- Summary: Unable to load external jars while submitting Spark Job (was: Unable to load external jars

[jira] [Commented] (SPARK-5692) Model import/export for Word2Vec

2015-03-12 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14357069#comment-14357069 ] Manoj Kumar commented on SPARK-5692: okay, great Model import/export for Word2Vec

[jira] [Created] (SPARK-6302) GeneratedAggregate uses wrong schema on updateProjection

2015-03-12 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6302: -- Summary: GeneratedAggregate uses wrong schema on updateProjection Key: SPARK-6302 URL: https://issues.apache.org/jira/browse/SPARK-6302 Project: Spark

[jira] [Commented] (SPARK-5987) Model import/export for GaussianMixtureModel

2015-03-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14357109#comment-14357109 ] Joseph K. Bradley commented on SPARK-5987: -- This isn't a bug in Spark SQL. The

[jira] [Commented] (SPARK-6285) Duplicated code leads to errors

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14357255#comment-14357255 ] Sean Owen commented on SPARK-6285: -- I do not observe any compilation problem in Maven or

[jira] [Commented] (SPARK-6284) Support framework authentication and role in Mesos framework

2015-03-12 Thread Timothy Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14357180#comment-14357180 ] Timothy Chen commented on SPARK-6284: - https://github.com/apache/spark/pull/4960

[jira] [Updated] (SPARK-6301) Unable to load external jars while submiiting Spark Job

2015-03-12 Thread raju patel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] raju patel updated SPARK-6301: -- Description: We are using Jnius to call Java functions from Python. But when we are trying to submit

[jira] [Created] (SPARK-6301) Unable to load external jars while submiiting Spark Job

2015-03-12 Thread raju patel (JIRA)
raju patel created SPARK-6301: - Summary: Unable to load external jars while submiiting Spark Job Key: SPARK-6301 URL: https://issues.apache.org/jira/browse/SPARK-6301 Project: Spark Issue Type:

[jira] [Updated] (SPARK-6301) Unable to load external jars while submiiting Spark Job

2015-03-12 Thread raju patel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] raju patel updated SPARK-6301: -- Priority: Blocker (was: Major) Unable to load external jars while submiiting Spark Job

[jira] [Commented] (SPARK-6302) GeneratedAggregate uses wrong schema on updateProjection

2015-03-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358333#comment-14358333 ] Apache Spark commented on SPARK-6302: - User 'viirya' has created a pull request for

[jira] [Updated] (SPARK-6301) Unable to load external jars while submitting Spark Job

2015-03-12 Thread raju patel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] raju patel updated SPARK-6301: -- Description: We are using Jnius to call Java functions from Python. But when we are trying to submit

[jira] [Commented] (SPARK-6286) Handle TASK_ERROR in TaskState

2015-03-12 Thread Iulian Dragos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14357310#comment-14357310 ] Iulian Dragos commented on SPARK-6286: -- Good point. It's been [introduced in

[jira] [Commented] (SPARK-6190) create LargeByteBuffer abstraction for eliminating 2GB limit on blocks

2015-03-12 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14357609#comment-14357609 ] Imran Rashid commented on SPARK-6190: - Hi [~rxin], I've been adding scatterered notes

[jira] [Created] (SPARK-6304) Checkpointing doesn't retain driver port

2015-03-12 Thread Marius Soutier (JIRA)
Marius Soutier created SPARK-6304: - Summary: Checkpointing doesn't retain driver port Key: SPARK-6304 URL: https://issues.apache.org/jira/browse/SPARK-6304 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-6189) Pandas to DataFrame conversion should check field names for periods

2015-03-12 Thread mgdadv (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14357610#comment-14357610 ] mgdadv commented on SPARK-6189: --- While the dot is legal in R and SQL, I don't think there is

[jira] [Commented] (SPARK-6282) Strange Python import error when using random() in a lambda function

2015-03-12 Thread Pavel Laskov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358504#comment-14358504 ] Pavel Laskov commented on SPARK-6282: - Hi Sven and Joseph, Thanks for a quick reply

[jira] [Commented] (SPARK-6227) PCA and SVD for PySpark

2015-03-12 Thread Meethu Mathew (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358529#comment-14358529 ] Meethu Mathew commented on SPARK-6227: -- [~mengxr] Please give your inputs on the

[jira] [Commented] (SPARK-6305) Add support for log4j 2.x to Spark

2015-03-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358625#comment-14358625 ] Apache Spark commented on SPARK-6305: - User 'liorchaga' has created a pull request for

[jira] [Commented] (SPARK-6306) Readme points to dead link

2015-03-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358687#comment-14358687 ] Apache Spark commented on SPARK-6306: - User 'thvasilo' has created a pull request for

[jira] [Commented] (SPARK-6313) Fetch File Lock file creation doesnt work when Spark working dir is on a NFS mount

2015-03-12 Thread Nathan McCarthy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359964#comment-14359964 ] Nathan McCarthy commented on SPARK-6313: Suggestion along the lines of;

[jira] [Commented] (SPARK-6222) [STREAMING] All data may not be recovered from WAL when driver is killed

2015-03-12 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359950#comment-14359950 ] Tathagata Das commented on SPARK-6222: -- I proposed another way to fix this here

[jira] [Commented] (SPARK-6222) [STREAMING] All data may not be recovered from WAL when driver is killed

2015-03-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359948#comment-14359948 ] Apache Spark commented on SPARK-6222: - User 'tdas' has created a pull request for this

[jira] [Comment Edited] (SPARK-6222) [STREAMING] All data may not be recovered from WAL when driver is killed

2015-03-12 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359950#comment-14359950 ] Tathagata Das edited comment on SPARK-6222 at 3/13/15 5:09 AM:

[jira] [Commented] (SPARK-6299) ClassNotFoundException when running groupByKey with class defined in REPL.

2015-03-12 Thread Kevin (Sangwoo) Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359789#comment-14359789 ] Kevin (Sangwoo) Kim commented on SPARK-6299: Hi Sean, Surely it should work,

[jira] [Created] (SPARK-6313) Fetch File Lock file creation doesnt work when Spark working dir is on a NFS mount

2015-03-12 Thread Nathan McCarthy (JIRA)
Nathan McCarthy created SPARK-6313: -- Summary: Fetch File Lock file creation doesnt work when Spark working dir is on a NFS mount Key: SPARK-6313 URL: https://issues.apache.org/jira/browse/SPARK-6313

[jira] [Resolved] (SPARK-6311) ChiSqTest should check for too few counts

2015-03-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-6311. Resolution: Duplicate ChiSqTest should check for too few counts

[jira] [Resolved] (SPARK-6310) ChiSqTest should check for too few counts

2015-03-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-6310. Resolution: Duplicate ChiSqTest should check for too few counts

[jira] [Commented] (SPARK-5376) [Mesos] MesosExecutor should have correct resources

2015-03-12 Thread Lukasz Jastrzebski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359943#comment-14359943 ] Lukasz Jastrzebski commented on SPARK-5376: --- One comment, however if you run

[jira] [Updated] (SPARK-6313) Fetch File Lock file creation doesnt work when Spark working dir is on a NFS mount

2015-03-12 Thread Nathan McCarthy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nathan McCarthy updated SPARK-6313: --- Affects Version/s: 1.2.0 1.2.1 Fetch File Lock file creation doesnt

[jira] [Comment Edited] (SPARK-6313) Fetch File Lock file creation doesnt work when Spark working dir is on a NFS mount

2015-03-12 Thread Nathan McCarthy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359972#comment-14359972 ] Nathan McCarthy edited comment on SPARK-6313 at 3/13/15 5:38 AM:

[jira] [Commented] (SPARK-6313) Fetch File Lock file creation doesnt work when Spark working dir is on a NFS mount

2015-03-12 Thread Nathan McCarthy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359972#comment-14359972 ] Nathan McCarthy commented on SPARK-6313: Since the `val lockFileName =

[jira] [Commented] (SPARK-3066) Support recommendAll in matrix factorization model

2015-03-12 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359892#comment-14359892 ] Debasish Das commented on SPARK-3066: - We use the non-level 3 BLAS code in our

[jira] [Commented] (SPARK-4927) Spark does not clean up properly during long jobs.

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358933#comment-14358933 ] Sean Owen commented on SPARK-4927: -- I'm interested in this one. When I run it though it

[jira] [Updated] (SPARK-6301) Unable to load external jars while submitting Spark Job

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6301: - Priority: Major (was: Blocker) Until it's clear what is being reported, this should not be marked

[jira] [Resolved] (SPARK-6275) Miss toDF() function in docs/sql-programming-guide.md

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6275. -- Resolution: Fixed Fix Version/s: 1.4.0 Miss toDF() function in docs/sql-programming-guide.md

[jira] [Updated] (SPARK-6300) sc.addFile(path) does not support the relative path.

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6300: - Priority: Minor (was: Critical) Target Version/s: (was: 1.3.1) sc.addFile(path) does not

[jira] [Updated] (SPARK-6300) sc.addFile(path) does not support the relative path.

2015-03-12 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza updated SPARK-6300: -- Priority: Critical (was: Minor) Target Version/s: 1.3.1 sc.addFile(path) does not support

[jira] [Commented] (SPARK-6299) ClassNotFoundException when running groupByKey with class defined in REPL.

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358816#comment-14358816 ] Sean Owen commented on SPARK-6299: -- Hm, is this supposed to work? the class is not

[jira] [Commented] (SPARK-1548) Add Partial Random Forest algorithm to MLlib

2015-03-12 Thread Manish Amde (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358871#comment-14358871 ] Manish Amde commented on SPARK-1548: We should also leave this ticket unassigned for

[jira] [Updated] (SPARK-6273) Got error when one table's alias name is the same with other table's column name

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6273: - Component/s: SQL Description: while one table's alias name is the same with other table's column name

[jira] [Updated] (SPARK-1548) Add Partial Random Forest algorithm to MLlib

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-1548: - Assignee: (was: Frank Dai) Add Partial Random Forest algorithm to MLlib

[jira] [Commented] (SPARK-6306) Readme points to dead link

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358784#comment-14358784 ] Sean Owen commented on SPARK-6306: -- For a trivial change, a JIRA is just overhead. You

[jira] [Updated] (SPARK-6275) Miss toDF() function in docs/sql-programming-guide.md

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6275: - Priority: Trivial (was: Minor) Assignee: zzc This is also too minor to bother with a JIRA. Miss

[jira] [Commented] (SPARK-6286) Handle TASK_ERROR in TaskState

2015-03-12 Thread Iulian Dragos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358905#comment-14358905 ] Iulian Dragos commented on SPARK-6286: -- Sure, I'll issue a PR for handling

[jira] [Commented] (SPARK-4012) Uncaught OOM in ContextCleaner

2015-03-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359062#comment-14359062 ] Apache Spark commented on SPARK-4012: - User 'CodingCat' has created a pull request for

[jira] [Commented] (SPARK-4927) Spark does not clean up properly during long jobs.

2015-03-12 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358958#comment-14358958 ] Ilya Ganelin commented on SPARK-4927: - Hi Sean - I have a code snippet that reproduced

[jira] [Commented] (SPARK-1564) Add JavaScript into Javadoc to turn ::Experimental:: and such into badges

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359027#comment-14359027 ] Sean Owen commented on SPARK-1564: -- Yeah that's what I did, just made it not tied to the

[jira] [Commented] (SPARK-6286) Handle TASK_ERROR in TaskState

2015-03-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358964#comment-14358964 ] Apache Spark commented on SPARK-6286: - User 'dragos' has created a pull request for

[jira] [Commented] (SPARK-5310) Update SQL programming guide for 1.3

2015-03-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358981#comment-14358981 ] Apache Spark commented on SPARK-5310: - User 'liancheng' has created a pull request for

[jira] [Commented] (SPARK-1564) Add JavaScript into Javadoc to turn ::Experimental:: and such into badges

2015-03-12 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359017#comment-14359017 ] Matei Zaharia commented on SPARK-1564: -- This is still a valid issue AFAIK, isn't it?

[jira] [Commented] (SPARK-6294) PySpark task may hang while call take() on in Java/Scala

2015-03-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359016#comment-14359016 ] Apache Spark commented on SPARK-6294: - User 'davies' has created a pull request for

[jira] [Commented] (SPARK-5654) Integrate SparkR into Apache Spark

2015-03-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359166#comment-14359166 ] Patrick Wendell commented on SPARK-5654: I see the decision here as somewhat

[jira] [Commented] (SPARK-6282) Strange Python import error when using random() in a lambda function

2015-03-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359336#comment-14359336 ] Joseph K. Bradley commented on SPARK-6282: -- It looks like winreg is referenced in

[jira] [Commented] (SPARK-1673) GLMNET implementation in Spark

2015-03-12 Thread mike bowles (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359366#comment-14359366 ] mike bowles commented on SPARK-1673: Here's a table of scaling results for our

[jira] [Commented] (SPARK-4927) Spark does not clean up properly during long jobs.

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359437#comment-14359437 ] Sean Owen commented on SPARK-4927: -- OK, behavior looks a little different on YARN. I find

[jira] [Commented] (SPARK-6282) Strange Python import error when using random() in a lambda function

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359362#comment-14359362 ] Sean Owen commented on SPARK-6282: -- [~nchammas] or [~shivaram] might have a clue if it

[jira] [Commented] (SPARK-6282) Strange Python import error when using random() in a lambda function

2015-03-12 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359404#comment-14359404 ] Nicholas Chammas commented on SPARK-6282: - Shouldn't be related to boto. _winreg

[jira] [Updated] (SPARK-5740) Change comment default value from empty string to null in DescribeCommand

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5740: - Priority: Minor (was: Major) Target Version/s: (was: 1.4.0) Fix Version/s: (was:

[jira] [Commented] (SPARK-6286) Handle TASK_ERROR in TaskState

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358799#comment-14358799 ] Sean Owen commented on SPARK-6286: -- [~dragos] I think it would be reasonable to handle

[jira] [Commented] (SPARK-6301) Unable to load external jars while submitting Spark Job

2015-03-12 Thread raju patel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358815#comment-14358815 ] raju patel commented on SPARK-6301: --- I am trying to call Java functions which is

[jira] [Commented] (SPARK-6306) Readme points to dead link

2015-03-12 Thread Theodore Vasiloudis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358850#comment-14358850 ] Theodore Vasiloudis commented on SPARK-6306: I'll keep that in mind in the

[jira] [Commented] (SPARK-6300) sc.addFile(path) does not support the relative path.

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358877#comment-14358877 ] Sean Owen commented on SPARK-6300: -- (Sandy notes it's a regression so yeah it's more

[jira] [Commented] (SPARK-1546) Add AdaBoost algorithm to Spark MLlib

2015-03-12 Thread Manish Amde (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358881#comment-14358881 ] Manish Amde commented on SPARK-1546: I haven't worked on it since we haven't heard a

[jira] [Commented] (SPARK-3424) KMeans Plus Plus is too slow

2015-03-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359241#comment-14359241 ] Xiangrui Meng commented on SPARK-3424: -- Ah, sorry! I typed your email manually in the

[jira] [Commented] (SPARK-4927) Spark does not clean up properly during long jobs.

2015-03-12 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359294#comment-14359294 ] Ilya Ganelin commented on SPARK-4927: - Are you running over yarn? My theory is that

[jira] [Updated] (SPARK-4001) Add FP-growth algorithm to Spark MLlib

2015-03-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4001: - Summary: Add FP-growth algorithm to Spark MLlib (was: Add Apriori algorithm to Spark MLlib)

[jira] [Commented] (SPARK-4927) Spark does not clean up properly during long jobs.

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359200#comment-14359200 ] Sean Owen commented on SPARK-4927: -- Yes that's what I'm running in spark-shell (plus

[jira] [Comment Edited] (SPARK-4927) Spark does not clean up properly during long jobs.

2015-03-12 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358958#comment-14358958 ] Ilya Ganelin edited comment on SPARK-4927 at 3/12/15 6:50 PM: --

[jira] [Created] (SPARK-6305) Add support for log4j 2.x to Spark

2015-03-12 Thread Tal Sliwowicz (JIRA)
Tal Sliwowicz created SPARK-6305: Summary: Add support for log4j 2.x to Spark Key: SPARK-6305 URL: https://issues.apache.org/jira/browse/SPARK-6305 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-5189) Reorganize EC2 scripts so that nodes can be provisioned independent of Spark master

2015-03-12 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-5189: Description: As of 1.2.0, we launch Spark clusters on EC2 by setting up the master first,

[jira] [Commented] (SPARK-5189) Reorganize EC2 scripts so that nodes can be provisioned independent of Spark master

2015-03-12 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359665#comment-14359665 ] Nicholas Chammas commented on SPARK-5189: - For the record, this is the script I

[jira] [Commented] (SPARK-2426) Quadratic Minimization for MLlib ALS

2015-03-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359695#comment-14359695 ] Apache Spark commented on SPARK-2426: - User 'debasish83' has created a pull request

[jira] [Assigned] (SPARK-6210) Generated column name should not include id of column in it.

2015-03-12 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-6210: - Assignee: Davies Liu (was: Michael Armbrust) Generated column name should not include id of

[jira] [Commented] (SPARK-6210) Generated column name should not include id of column in it.

2015-03-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359705#comment-14359705 ] Apache Spark commented on SPARK-6210: - User 'davies' has created a pull request for

[jira] [Created] (SPARK-6308) VectorUDT is displayed as `vecto` in dtypes

2015-03-12 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6308: Summary: VectorUDT is displayed as `vecto` in dtypes Key: SPARK-6308 URL: https://issues.apache.org/jira/browse/SPARK-6308 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-6309) Add MatrixUDT to support dense/sparse matrices in DataFrames

2015-03-12 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6309: Summary: Add MatrixUDT to support dense/sparse matrices in DataFrames Key: SPARK-6309 URL: https://issues.apache.org/jira/browse/SPARK-6309 Project: Spark

[jira] [Resolved] (SPARK-5622) Add connector/handler hive configuration settings to hive-thrift-server

2015-03-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5622. -- Resolution: Won't Fix This sounded more clearly like a WontFix from the PR. Add connector/handler

[jira] [Commented] (SPARK-6190) create LargeByteBuffer abstraction for eliminating 2GB limit on blocks

2015-03-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14359524#comment-14359524 ] Reynold Xin commented on SPARK-6190: If I can guarantee at the block manager level,

[jira] [Resolved] (SPARK-6268) KMeans parameter getter methods

2015-03-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6268. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 4974

[jira] [Created] (SPARK-6307) Executers fetches the same rdd-block 100's or 1000's of times

2015-03-12 Thread Tobias Bertelsen (JIRA)
Tobias Bertelsen created SPARK-6307: --- Summary: Executers fetches the same rdd-block 100's or 1000's of times Key: SPARK-6307 URL: https://issues.apache.org/jira/browse/SPARK-6307 Project: Spark

[jira] [Resolved] (SPARK-6294) PySpark task may hang while call take() on in Java/Scala

2015-03-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6294. -- Resolution: Fixed Fix Version/s: (was: 1.3.1) (was: 1.4.0)

[jira] [Commented] (SPARK-6303) Average should be in canBeCodeGened list

2015-03-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358399#comment-14358399 ] Apache Spark commented on SPARK-6303: - User 'viirya' has created a pull request for

[jira] [Created] (SPARK-6303) Average should be in canBeCodeGened list

2015-03-12 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6303: -- Summary: Average should be in canBeCodeGened list Key: SPARK-6303 URL: https://issues.apache.org/jira/browse/SPARK-6303 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-6304) Checkpointing doesn't retain driver port

2015-03-12 Thread Marius Soutier (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marius Soutier updated SPARK-6304: -- Description: In a check-pointed Streaming application running on a fixed driver port, the

[jira] [Commented] (SPARK-5692) Model import/export for Word2Vec

2015-03-12 Thread ANUPAM MEDIRATTA (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358573#comment-14358573 ] ANUPAM MEDIRATTA commented on SPARK-5692: - I tried working on it. I am new to

[jira] [Commented] (SPARK-5692) Model import/export for Word2Vec

2015-03-12 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14358583#comment-14358583 ] Manoj Kumar commented on SPARK-5692: I'm not sure about Eclipse, but I work just on

  1   2   >