[jira] [Commented] (SPARK-3276) Provide a API to specify whether the old files need to be ignored in file input text DStream

2015-01-20 Thread Jack Hu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283662#comment-14283662 ] Jack Hu commented on SPARK-3276: With some cases, the old files (older than current spark

[jira] [Created] (SPARK-5334) NullPointerException when getting files from S3 (hadoop 2.3+)

2015-01-20 Thread Kevin (Sangwoo) Kim (JIRA)
Kevin (Sangwoo) Kim created SPARK-5334: -- Summary: NullPointerException when getting files from S3 (hadoop 2.3+) Key: SPARK-5334 URL: https://issues.apache.org/jira/browse/SPARK-5334 Project:

[jira] [Created] (SPARK-5332) Efficient way to deal with ExecutorLost

2015-01-20 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5332: -- Summary: Efficient way to deal with ExecutorLost Key: SPARK-5332 URL: https://issues.apache.org/jira/browse/SPARK-5332 Project: Spark Issue Type:

[jira] [Updated] (SPARK-5333) [Mesos] MesosTaskLaunchData occurs BufferUnderflowException

2015-01-20 Thread Jongyoul Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jongyoul Lee updated SPARK-5333: Summary: [Mesos] MesosTaskLaunchData occurs BufferUnderflowException (was:

[jira] [Resolved] (SPARK-4803) Duplicate RegisterReceiver messages sent from ReceiverSupervisor

2015-01-20 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-4803. -- Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 Duplicate

[jira] [Commented] (SPARK-5311) EventLoggingListener throws exception if log directory does not exist

2015-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283732#comment-14283732 ] Apache Spark commented on SPARK-5311: - User 'ganonp' has created a pull request for

[jira] [Commented] (SPARK-5332) Efficient way to deal with ExecutorLost

2015-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283648#comment-14283648 ] Apache Spark commented on SPARK-5332: - User 'viirya' has created a pull request for

[jira] [Created] (SPARK-5333) [Mesos]MesosTaskLaunchData occurs BufferUnderflowException

2015-01-20 Thread Jongyoul Lee (JIRA)
Jongyoul Lee created SPARK-5333: --- Summary: [Mesos]MesosTaskLaunchData occurs BufferUnderflowException Key: SPARK-5333 URL: https://issues.apache.org/jira/browse/SPARK-5333 Project: Spark Issue

[jira] [Commented] (SPARK-5333) [Mesos] MesosTaskLaunchData occurs BufferUnderflowException

2015-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283669#comment-14283669 ] Apache Spark commented on SPARK-5333: - User 'jongyoul' has created a pull request for

[jira] [Updated] (SPARK-5333) [Mesos] MesosTaskLaunchData occurs BufferUnderflowException

2015-01-20 Thread Jongyoul Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jongyoul Lee updated SPARK-5333: Target Version/s: 1.3.0 (was: 1.3.0, 1.2.1) [Mesos] MesosTaskLaunchData occurs

[jira] [Comment Edited] (SPARK-4630) Dynamically determine optimal number of partitions

2015-01-20 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283809#comment-14283809 ] Lianhui Wang edited comment on SPARK-4630 at 1/20/15 1:28 PM: --

[jira] [Commented] (SPARK-4017) Progress bar in console

2015-01-20 Thread Paul Wolfe (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283848#comment-14283848 ] Paul Wolfe commented on SPARK-4017: --- Hello, was wondering if there is a way to turn

[jira] [Commented] (SPARK-4630) Dynamically determine optimal number of partitions

2015-01-20 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283809#comment-14283809 ] Lianhui Wang commented on SPARK-4630: - I think it is better that we use stage's output

[jira] [Commented] (SPARK-5328) Update PySpark MLlib NaiveBayes API to take model type parameter for Bernoulli fit

2015-01-20 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283887#comment-14283887 ] RJ Nowling commented on SPARK-5328: --- The Python API for Naive Bayes is located in

[jira] [Commented] (SPARK-4442) Move common unit test utilities into their own package / module

2015-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283941#comment-14283941 ] Sean Owen commented on SPARK-4442: -- [~matthewcornell] Normally I'd say you don't need to

[jira] [Commented] (SPARK-4017) Progress bar in console

2015-01-20 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284014#comment-14284014 ] Davies Liu commented on SPARK-4017: --- It can be turned of by spark.ui.showConsoleProgress

[jira] [Commented] (SPARK-4442) Move common unit test utilities into their own package / module

2015-01-20 Thread Matthew Cornell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283993#comment-14283993 ] Matthew Cornell commented on SPARK-4442: [~srowen] Thanks for the tip. I tried

[jira] [Created] (SPARK-5336) spark.executor.cores must not be less than spark.task.cpus

2015-01-20 Thread WangTaoTheTonic (JIRA)
WangTaoTheTonic created SPARK-5336: -- Summary: spark.executor.cores must not be less than spark.task.cpus Key: SPARK-5336 URL: https://issues.apache.org/jira/browse/SPARK-5336 Project: Spark

[jira] [Commented] (SPARK-4442) Move common unit test utilities into their own package / module

2015-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284054#comment-14284054 ] Sean Owen commented on SPARK-4442: -- Hm, no works for me. Maybe {{mvn -DskipTests

[jira] [Commented] (SPARK-5336) spark.executor.cores must not be less than spark.task.cpus

2015-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284067#comment-14284067 ] Apache Spark commented on SPARK-5336: - User 'WangTaoTheTonic' has created a pull

[jira] [Commented] (SPARK-4442) Move common unit test utilities into their own package / module

2015-01-20 Thread Matthew Cornell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284069#comment-14284069 ] Matthew Cornell commented on SPARK-4442: Thanks for sticking with me on this,

[jira] [Commented] (SPARK-4442) Move common unit test utilities into their own package / module

2015-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284013#comment-14284013 ] Sean Owen commented on SPARK-4442: -- [~matthewcornell] Oops, I missed again. The test JARs

[jira] [Commented] (SPARK-2890) Spark SQL should allow SELECT with duplicated columns

2015-01-20 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284012#comment-14284012 ] Yin Huai commented on SPARK-2890: - [~btiernay] Oh, seems the comments thread of this JIRA

[jira] [Created] (SPARK-5337) respect spark.task.cpus when launch executors

2015-01-20 Thread WangTaoTheTonic (JIRA)
WangTaoTheTonic created SPARK-5337: -- Summary: respect spark.task.cpus when launch executors Key: SPARK-5337 URL: https://issues.apache.org/jira/browse/SPARK-5337 Project: Spark Issue Type:

[jira] [Commented] (SPARK-4442) Move common unit test utilities into their own package / module

2015-01-20 Thread Matthew Cornell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284052#comment-14284052 ] Matthew Cornell commented on SPARK-4442: [~srowen] I might have misunderstood. I

[jira] [Comment Edited] (SPARK-2890) Spark SQL should allow SELECT with duplicated columns

2015-01-20 Thread Bob Tiernay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283958#comment-14283958 ] Bob Tiernay edited comment on SPARK-2890 at 1/20/15 4:02 PM: -

[jira] [Commented] (SPARK-2890) Spark SQL should allow SELECT with duplicated columns

2015-01-20 Thread Bob Tiernay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283958#comment-14283958 ] Bob Tiernay commented on SPARK-2890: What if you request {{SELECT x.*, y.*}}? If there

[jira] [Commented] (SPARK-5335) Destroying cluster in VPC with --delete-groups fails to remove security groups

2015-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283936#comment-14283936 ] Apache Spark commented on SPARK-5335: - User 'voukka' has created a pull request for

[jira] [Commented] (SPARK-4442) Move common unit test utilities into their own package / module

2015-01-20 Thread Matthew Cornell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283920#comment-14283920 ] Matthew Cornell commented on SPARK-4442: Please, as a new Spark (and Maven and

[jira] [Commented] (SPARK-750) LocalSparkContext should be included in Spark JAR

2015-01-20 Thread Matthew Cornell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283919#comment-14283919 ] Matthew Cornell commented on SPARK-750: --- Please, as a new Spark (and Maven and SBT)

[jira] [Resolved] (SPARK-5333) [Mesos] MesosTaskLaunchData occurs BufferUnderflowException

2015-01-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-5333. --- Resolution: Fixed [Mesos] MesosTaskLaunchData occurs BufferUnderflowException

[jira] [Updated] (SPARK-5287) Add defaultSizeOf to every data type

2015-01-20 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-5287: Summary: Add defaultSizeOf to every data type (was: NativeType.defaultSizeOf should have default sizes of

[jira] [Updated] (SPARK-5287) Add defaultSizeOf to every data type

2015-01-20 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-5287: Description: Right now, in NativeType, we defined some defaultSizes (it is actually missing some types) and

[jira] [Commented] (SPARK-4442) Move common unit test utilities into their own package / module

2015-01-20 Thread Matthew Cornell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284210#comment-14284210 ] Matthew Cornell commented on SPARK-4442: OK, progress: I cloned master and re-ran

[jira] [Updated] (SPARK-5333) [Mesos] MesosTaskLaunchData occurs BufferUnderflowException

2015-01-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5333: -- Fix Version/s: 1.3.0 [Mesos] MesosTaskLaunchData occurs BufferUnderflowException

[jira] [Commented] (SPARK-5333) [Mesos] MesosTaskLaunchData occurs BufferUnderflowException

2015-01-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284155#comment-14284155 ] Josh Rosen commented on SPARK-5333: --- Fixed by https://github.com/apache/spark/pull/4119

[jira] [Updated] (SPARK-5333) [Mesos] MesosTaskLaunchData occurs BufferUnderflowException

2015-01-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5333: -- Assignee: Jongyoul Lee [Mesos] MesosTaskLaunchData occurs BufferUnderflowException

[jira] [Commented] (SPARK-4442) Move common unit test utilities into their own package / module

2015-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284216#comment-14284216 ] Sean Owen commented on SPARK-4442: -- I don't think 1.3.0 should be different in this

[jira] [Resolved] (SPARK-750) LocalSparkContext should be included in Spark JAR

2015-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-750. - Resolution: Duplicate I'm going to boldly fold this into SPARK-4442 as a more general, related request to

[jira] [Commented] (SPARK-5333) [Mesos] MesosTaskLaunchData occurs BufferUnderflowException

2015-01-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284135#comment-14284135 ] Josh Rosen commented on SPARK-5333: --- Good catch. I've created a link to the SPARK-4014

[jira] [Commented] (SPARK-4014) TaskContext.attemptId returns taskId

2015-01-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284137#comment-14284137 ] Josh Rosen commented on SPARK-4014: --- Note to self: when backporting this to any

[jira] [Comment Edited] (SPARK-4014) TaskContext.attemptId returns taskId

2015-01-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284137#comment-14284137 ] Josh Rosen edited comment on SPARK-4014 at 1/20/15 6:12 PM:

[jira] [Commented] (SPARK-2243) Support multiple SparkContexts in the same JVM

2015-01-20 Thread Manoj Samel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284335#comment-14284335 ] Manoj Samel commented on SPARK-2243: Is there a target release for this ? Support

[jira] [Commented] (SPARK-4630) Dynamically determine optimal number of partitions

2015-01-20 Thread Kostas Sakellis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284341#comment-14284341 ] Kostas Sakellis commented on SPARK-4630: I agree that this should be build not

[jira] [Updated] (SPARK-5019) Update GMM API to use MultivariateGaussian

2015-01-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5019: - Fix Version/s: 1.3.0 Update GMM API to use MultivariateGaussian

[jira] [Updated] (SPARK-5019) Update GMM API to use MultivariateGaussian

2015-01-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5019: - Priority: Minor (was: Blocker) Update GMM API to use MultivariateGaussian

[jira] [Updated] (SPARK-5019) Update GMM API to use MultivariateGaussian

2015-01-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5019: - Assignee: Travis Galoppo Update GMM API to use MultivariateGaussian

[jira] [Resolved] (SPARK-5019) Update GMM API to use MultivariateGaussian

2015-01-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5019. -- Resolution: Fixed Fixed by https://github.com/apache/spark/pull/4088 Update GMM API to use

[jira] [Resolved] (SPARK-5186) Vector.equals and Vector.hashCode are very inefficient and fail on SparseVectors with large size

2015-01-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5186. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3997

[jira] [Commented] (SPARK-5342) Allow long running Spark apps to run on secure YARN/HDFS

2015-01-20 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284908#comment-14284908 ] Hari Shreedharan commented on SPARK-5342: - [~pwendell], [~tgraves], [~vanzin],

[jira] [Updated] (SPARK-5342) Allow long running Spark apps to run on secure YARN/HDFS

2015-01-20 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hari Shreedharan updated SPARK-5342: Attachment: SparkYARN.pdf Design doc with proposed design. Original design doc with

[jira] [Updated] (SPARK-4923) Add Developer API to REPL to allow re-publishing the REPL jar

2015-01-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4923: --- Summary: Add Developer API to REPL to allow re-publishing the REPL jar (was: Maven build

[jira] [Updated] (SPARK-4923) Add Developer API to REPL to allow re-publishing the REPL jar

2015-01-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4923: --- Assignee: Chip Senkbeil Add Developer API to REPL to allow re-publishing the REPL jar

[jira] [Resolved] (SPARK-5323) Row shouldn't extend Seq

2015-01-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5323. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4115

[jira] [Updated] (SPARK-5289) Backport publishing of repl, yarn into branch-1.2

2015-01-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5289: --- Fix Version/s: 1.2.1 Backport publishing of repl, yarn into branch-1.2

[jira] [Commented] (SPARK-4259) Add Spectral Clustering Algorithm with Gaussian Similarity Function

2015-01-20 Thread Stephen Boesch (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284470#comment-14284470 ] Stephen Boesch commented on SPARK-4259: --- Xiangrui has provided valuable feedback.

[jira] [Created] (SPARK-5341) Support maven coordinates in spark-shell and spark-submit

2015-01-20 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-5341: -- Summary: Support maven coordinates in spark-shell and spark-submit Key: SPARK-5341 URL: https://issues.apache.org/jira/browse/SPARK-5341 Project: Spark Issue

[jira] [Updated] (SPARK-5186) Vector.equals and Vector.hashCode are very inefficient and fail on SparseVectors with large size

2015-01-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5186: - Assignee: yuhao yang Vector.equals and Vector.hashCode are very inefficient and fail on

[jira] [Resolved] (SPARK-5287) Add defaultSizeOf to every data type

2015-01-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5287. Resolution: Fixed Fix Version/s: 1.3.0 Add defaultSizeOf to every data type

[jira] [Updated] (SPARK-5287) Add defaultSizeOf to every data type

2015-01-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5287: --- Issue Type: Sub-task (was: Bug) Parent: SPARK-5166 Add defaultSizeOf to every data type

[jira] [Updated] (SPARK-5287) Add defaultSizeOf to every data type

2015-01-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5287: --- Assignee: Yin Huai Add defaultSizeOf to every data type

[jira] [Commented] (SPARK-5342) Allow long running Spark apps to run on secure YARN/HDFS

2015-01-20 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284914#comment-14284914 ] Hari Shreedharan commented on SPARK-5342: - Thanks [~adhoot] for helping with

[jira] [Commented] (SPARK-5135) Add support for describe [extended] table to DDL in SQLContext

2015-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284915#comment-14284915 ] Apache Spark commented on SPARK-5135: - User 'rxin' has created a pull request for this

[jira] [Closed] (SPARK-4296) Throw Expression not in GROUP BY when using same expression in group by clause and select clause

2015-01-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust closed SPARK-4296. --- Resolution: Fixed Throw Expression not in GROUP BY when using same expression in group by

[jira] [Commented] (SPARK-3439) Add Canopy Clustering Algorithm

2015-01-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284584#comment-14284584 ] Xiangrui Meng commented on SPARK-3439: -- [~angellandros] Are you interested in

[jira] [Updated] (SPARK-3439) Add Canopy Clustering Algorithm

2015-01-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3439: - Assignee: Muhammad-Ali A'rabi Add Canopy Clustering Algorithm ---

[jira] [Created] (SPARK-5342) Allow long running Spark apps to run on secure YARN/HDFS

2015-01-20 Thread Hari Shreedharan (JIRA)
Hari Shreedharan created SPARK-5342: --- Summary: Allow long running Spark apps to run on secure YARN/HDFS Key: SPARK-5342 URL: https://issues.apache.org/jira/browse/SPARK-5342 Project: Spark

[jira] [Comment Edited] (SPARK-5342) Allow long running Spark apps to run on secure YARN/HDFS

2015-01-20 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284908#comment-14284908 ] Hari Shreedharan edited comment on SPARK-5342 at 1/21/15 12:46 AM:

[jira] [Commented] (SPARK-5275) pyspark.streaming is not included in assembly jar

2015-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284924#comment-14284924 ] Apache Spark commented on SPARK-5275: - User 'davies' has created a pull request for

[jira] [Updated] (SPARK-5294) Hide tables in AllStagePages for Active Stages, Completed Stages and Failed Stages when they are empty

2015-01-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5294: -- Assignee: Kousuke Saruta Hide tables in AllStagePages for Active Stages, Completed Stages and Failed

[jira] [Resolved] (SPARK-5294) Hide tables in AllStagePages for Active Stages, Completed Stages and Failed Stages when they are empty

2015-01-20 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-5294. --- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4083

[jira] [Resolved] (SPARK-4923) Add Developer API to REPL to allow re-publishing the REPL jar

2015-01-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4923. Resolution: Fixed Target Version/s: 1.3.0 (was: 1.3.0, 1.2.1) I updated the

[jira] [Commented] (SPARK-4259) Add Spectral Clustering Algorithm with Gaussian Similarity Function

2015-01-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284561#comment-14284561 ] Xiangrui Meng commented on SPARK-4259: -- Note: [~javadba]'s update is from an offline

[jira] [Commented] (SPARK-5144) spark-yarn module should be published

2015-01-20 Thread David McWhorter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284662#comment-14284662 ] David McWhorter commented on SPARK-5144: Similar problem here, building an

[jira] [Commented] (SPARK-3452) Maven build should skip publishing artifacts people shouldn't depend on

2015-01-20 Thread David McWhorter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284659#comment-14284659 ] David McWhorter commented on SPARK-3452: Same problem here -- if spark-yarn is not

[jira] [Commented] (SPARK-3789) Python bindings for GraphX

2015-01-20 Thread Ameet Talwalkar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284903#comment-14284903 ] Ameet Talwalkar commented on SPARK-3789: Great. I hope this can make it into 1.3.

[jira] [Commented] (SPARK-5337) respect spark.task.cpus when launch executors

2015-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284986#comment-14284986 ] Apache Spark commented on SPARK-5337: - User 'CodingCat' has created a pull request for

[jira] [Commented] (SPARK-2620) case class cannot be used as key for reduce

2015-01-20 Thread Derrick Burns (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14285058#comment-14285058 ] Derrick Burns commented on SPARK-2620: -- Thanks for the info! It would seem to me

[jira] [Commented] (SPARK-2620) case class cannot be used as key for reduce

2015-01-20 Thread Tobias Schlatter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284969#comment-14284969 ] Tobias Schlatter commented on SPARK-2620: - I am currently looking into the various

[jira] [Created] (SPARK-5343) ShortestPaths traverses backwards

2015-01-20 Thread Michael Malak (JIRA)
Michael Malak created SPARK-5343: Summary: ShortestPaths traverses backwards Key: SPARK-5343 URL: https://issues.apache.org/jira/browse/SPARK-5343 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-5342) Allow long running Spark apps to run on secure YARN/HDFS

2015-01-20 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hari Shreedharan updated SPARK-5342: Attachment: SparkYARN.pdf Minor updates. Allow long running Spark apps to run on secure

[jira] [Updated] (SPARK-5342) Allow long running Spark apps to run on secure YARN/HDFS

2015-01-20 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hari Shreedharan updated SPARK-5342: Attachment: (was: SparkYARN.pdf) Allow long running Spark apps to run on secure

[jira] [Updated] (SPARK-5331) Spark workers can't find tachyon master as spark-ec2 doesn't set spark.tachyonStore.url

2015-01-20 Thread Florian Verhein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florian Verhein updated SPARK-5331: --- Component/s: EC2 Description: ps -ef | grep Tachyon shows Tachyon running on the master

[jira] [Commented] (SPARK-5262) coalesce should allow NullType and 1 another type in parameters

2015-01-20 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14285179#comment-14285179 ] Adrian Wang commented on SPARK-5262: Currently if you try coalesce in hivecontext, it

[jira] [Updated] (SPARK-4959) Attributes are case sensitive when using a select query from a projection

2015-01-20 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4959: Labels: backport-needed (was: ) Attributes are case sensitive when using a select query from a projection

[jira] [Resolved] (SPARK-5257) SparseVector indices must be non-negative

2015-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5257. -- Resolution: Won't Fix [~MechCoder] I will resolve as WontFix. With ~1100 open JIRAs unfortunately I

[jira] [Updated] (SPARK-4959) Attributes are case sensitive when using a select query from a projection

2015-01-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4959: --- Priority: Blocker (was: Critical) Attributes are case sensitive when using a select query

[jira] [Updated] (SPARK-4959) Attributes are case sensitive when using a select query from a projection

2015-01-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4959: --- Assignee: Cheng Hao Attributes are case sensitive when using a select query from a

[jira] [Updated] (SPARK-4959) Attributes are case sensitive when using a select query from a projection

2015-01-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4959: --- Fix Version/s: 1.3.0 Attributes are case sensitive when using a select query from a

[jira] [Updated] (SPARK-4959) Attributes are case sensitive when using a select query from a projection

2015-01-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4959: --- Fix Version/s: (was: 1.2.1) Attributes are case sensitive when using a select query from

[jira] [Commented] (SPARK-4959) Attributes are case sensitive when using a select query from a projection

2015-01-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14285262#comment-14285262 ] Patrick Wendell commented on SPARK-4959: Excuse my last comment, it was on the

[jira] [Commented] (SPARK-5344) HistoryServer cannot recognize that inprogress file was renamed to completed file

2015-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14285261#comment-14285261 ] Apache Spark commented on SPARK-5344: - User 'sarutak' has created a pull request for

[jira] [Comment Edited] (SPARK-4959) Attributes are case sensitive when using a select query from a projection

2015-01-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14285258#comment-14285258 ] Patrick Wendell edited comment on SPARK-4959 at 1/21/15 6:47 AM:

[jira] [Updated] (SPARK-5021) GaussianMixtureEM should be faster for SparseVector input

2015-01-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5021: - Assignee: Manoj Kumar GaussianMixtureEM should be faster for SparseVector input

[jira] [Resolved] (SPARK-5276) pyspark.streaming is not included in assembly jar

2015-01-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5276. Resolution: Duplicate pyspark.streaming is not included in assembly jar

[jira] [Commented] (SPARK-5262) coalesce should allow NullType and 1 another type in parameters

2015-01-20 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14285187#comment-14285187 ] Yin Huai commented on SPARK-5262: - OK i see. In HiveContext, we are still using Hive's

[jira] [Commented] (SPARK-4959) Attributes are case sensitive when using a select query from a projection

2015-01-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14285258#comment-14285258 ] Patrick Wendell commented on SPARK-4959: Note that in the 1.2 branch this was

[jira] [Updated] (SPARK-5344) HistoryServer cannot recognize that inprogress file was renamed to completed file

2015-01-20 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-5344: -- Description: FsHistoryProvider tries to update application status but if checkForLogs is called

[jira] [Updated] (SPARK-4959) Attributes are case sensitive when using a select query from a projection

2015-01-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4959: --- Fix Version/s: 1.2.1 Attributes are case sensitive when using a select query from a

[jira] [Updated] (SPARK-5344) HistoryServer cannot recognize that inprogress file was renamed to completed file

2015-01-20 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-5344: -- Description: FsHistoryProvider tries to updates application status but if checkForLogs is

  1   2   >