[jira] [Updated] (SPARK-2476) Have sbt-assembly include runtime dependencies in jar

2014-07-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2476: --- Issue Type: Sub-task (was: Task) Parent: SPARK-2487 Have sbt-assembly include

[jira] [Updated] (SPARK-2467) Revert SparkBuild to publish-local to both .m2 and .ivy2.

2014-07-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2467: --- Assignee: Takuya Ueshin Revert SparkBuild to publish-local to both .m2 and .ivy2.

[jira] [Updated] (SPARK-2487) Follow up from SBT build refactor

2014-07-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2487: --- Description: This is to track follw up issues relating to SPARK-1776, which was a major

[jira] [Updated] (SPARK-2481) The environment variables SPARK_HISTORY_OPTS is covered in start-history-server.sh

2014-07-15 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li updated SPARK-2481: --- Summary: The environment variables SPARK_HISTORY_OPTS is covered in start-history-server.sh (was:

[jira] [Updated] (SPARK-2481) The environment variables SPARK_HISTORY_OPTS is covered in start-history-server.sh

2014-07-15 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li updated SPARK-2481: --- Description: In If we have the following code in the conf/spark-env.sh {{export

[jira] [Updated] (SPARK-2481) The environment variables SPARK_HISTORY_OPTS is covered in start-history-server.sh

2014-07-15 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li updated SPARK-2481: --- Description: In If we have the following code in the conf/spark-env.sh {{export

[jira] [Updated] (SPARK-2481) The environment variables SPARK_HISTORY_OPTS is covered in start-history-server.sh

2014-07-15 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li updated SPARK-2481: --- Description: If we have the following code in the conf/spark-env.sh {{export

[jira] [Resolved] (SPARK-2486) Utils.getCallSite can crash under JVMTI profilers

2014-07-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2486. Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1413

[jira] [Updated] (SPARK-2486) Utils.getCallSite can crash under JVMTI profilers

2014-07-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2486: --- Assignee: William Benton Utils.getCallSite can crash under JVMTI profilers

[jira] [Commented] (SPARK-2399) Add support for LZ4 compression

2014-07-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14061736#comment-14061736 ] Reynold Xin commented on SPARK-2399: Do you mind submitting a pull request for this? I

[jira] [Updated] (SPARK-2469) Lower shuffle compression buffer memory usage (replace LZF with Snappy for default compression codec)

2014-07-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2469: --- Summary: Lower shuffle compression buffer memory usage (replace LZF with Snappy for default

[jira] [Updated] (SPARK-1994) Aggregates return incorrect results on first execution

2014-07-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-1994: Fix Version/s: 1.1.0 1.0.1 Aggregates return incorrect results on

[jira] [Updated] (SPARK-2399) Add support for LZ4 compression

2014-07-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2399: --- Assignee: Reynold Xin Add support for LZ4 compression ---

[jira] [Created] (SPARK-2488) Model SerDe in MLlib

2014-07-15 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-2488: Summary: Model SerDe in MLlib Key: SPARK-2488 URL: https://issues.apache.org/jira/browse/SPARK-2488 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-2243) Support multiple SparkContexts in the same JVM

2014-07-15 Thread wangfei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14061767#comment-14061767 ] wangfei commented on SPARK-2243: What do you mean of but it's something we could support

[jira] [Updated] (SPARK-2399) Add support for LZ4 compression

2014-07-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2399: --- Target Version/s: 1.1.0 Add support for LZ4 compression ---

[jira] [Updated] (SPARK-2390) Files in .sparkStaging on HDFS cannot be deleted and wastes the space of HDFS

2014-07-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2390: --- Assignee: Kousuke Saruta Files in .sparkStaging on HDFS cannot be deleted and wastes the

[jira] [Updated] (SPARK-2096) Correctly parse dot notations for accessing an array of structs

2014-07-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2096: Priority: Minor (was: Major) Correctly parse dot notations for accessing an array of

[jira] [Updated] (SPARK-2096) Correctly parse dot notations for accessing an array of structs

2014-07-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2096: Target Version/s: 1.1.0 Labels: starter (was: ) Correctly parse dot

[jira] [Commented] (SPARK-2354) BitSet Range Expanded when creating new one

2014-07-15 Thread Yijie Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14061787#comment-14061787 ] Yijie Shen commented on SPARK-2354: --- No issue yet, I just think it's a minor bug, as the

[jira] [Resolved] (SPARK-1984) Maven build requires SCALA_HOME to be set even though it's not needed

2014-07-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-1984. Resolution: Fixed this was fixed a while ago, not sure why it's open Maven build

[jira] [Updated] (SPARK-2099) Report metrics for running tasks

2014-07-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2099: --- Assignee: Sandy Ryza Report metrics for running tasks

[jira] [Updated] (SPARK-2099) Report metrics for running tasks

2014-07-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2099: --- Target Version/s: 1.1.0 Report metrics for running tasks

[jira] [Updated] (SPARK-983) Support external sorting for RDD#sortByKey()

2014-07-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-983: -- Component/s: Spark Core Priority: Critical (was: Major) Target Version/s:

[jira] [Updated] (SPARK-983) Support external sorting for RDD#sortByKey()

2014-07-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-983: -- Assignee: (was: Madhu Siddalingaiah) Support external sorting for RDD#sortByKey()

[jira] [Updated] (SPARK-2099) Report TaskMetrics for running tasks

2014-07-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2099: --- Summary: Report TaskMetrics for running tasks (was: Report metrics for running tasks)

[jira] [Commented] (SPARK-2138) The KMeans algorithm in the MLlib can lead to the Serialized Task size become bigger and bigger

2014-07-15 Thread DjvuLee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14061804#comment-14061804 ] DjvuLee commented on SPARK-2138: [~piotrszul] In my opinion, if your task size if bigger

[jira] [Commented] (SPARK-2138) The KMeans algorithm in the MLlib can lead to the Serialized Task size become bigger and bigger

2014-07-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14061837#comment-14061837 ] Xiangrui Meng commented on SPARK-2138: -- One possible reason is that this happens in

[jira] [Commented] (SPARK-2138) The KMeans algorithm in the MLlib can lead to the Serialized Task size become bigger and bigger

2014-07-15 Thread DjvuLee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14061843#comment-14061843 ] DjvuLee commented on SPARK-2138: Oh, So can we improve this better? The KMeans

[jira] [Commented] (SPARK-2138) The KMeans algorithm in the MLlib can lead to the Serialized Task size become bigger and bigger

2014-07-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14061848#comment-14061848 ] Xiangrui Meng commented on SPARK-2138: -- I'm not sure whether this is the real cause.

[jira] [Resolved] (SPARK-2399) Add support for LZ4 compression

2014-07-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-2399. Resolution: Fixed Fix Version/s: 1.1.0 Add support for LZ4 compression

[jira] [Updated] (SPARK-2477) Using appendBias for adding intercept in GeneralizedLinearAlgorithm

2014-07-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2477: - Assignee: DB Tsai Using appendBias for adding intercept in GeneralizedLinearAlgorithm

[jira] [Resolved] (SPARK-2477) Using appendBias for adding intercept in GeneralizedLinearAlgorithm

2014-07-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2477. -- Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1410

[jira] [Commented] (SPARK-2354) BitSet Range Expanded when creating new one

2014-07-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14061920#comment-14061920 ] Sean Owen commented on SPARK-2354: -- I think it should be closed if there is no existing

[jira] [Created] (SPARK-2489) Unsupported parquet datatype optional fixed_len_byte_array

2014-07-15 Thread Pei-Lun Lee (JIRA)
Pei-Lun Lee created SPARK-2489: -- Summary: Unsupported parquet datatype optional fixed_len_byte_array Key: SPARK-2489 URL: https://issues.apache.org/jira/browse/SPARK-2489 Project: Spark Issue

[jira] [Created] (SPARK-2490) StackOverflowError when RDD dependencies are too long

2014-07-15 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-2490: -- Summary: StackOverflowError when RDD dependencies are too long Key: SPARK-2490 URL: https://issues.apache.org/jira/browse/SPARK-2490 Project: Spark

[jira] [Commented] (SPARK-2459) the user should be able to configure the resources used by JDBC server

2014-07-15 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14061981#comment-14061981 ] Nan Zhu commented on SPARK-2459: yeah, spark-submit may solve the problem... then we need

[jira] [Created] (SPARK-2491) When an exception is thrown,the executor is not properly stopped,

2014-07-15 Thread Guoqiang Li (JIRA)
Guoqiang Li created SPARK-2491: -- Summary: When an exception is thrown,the executor is not properly stopped, Key: SPARK-2491 URL: https://issues.apache.org/jira/browse/SPARK-2491 Project: Spark

[jira] [Commented] (SPARK-2138) The KMeans algorithm in the MLlib can lead to the Serialized Task size become bigger and bigger

2014-07-15 Thread DjvuLee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062026#comment-14062026 ] DjvuLee commented on SPARK-2138: In my experiment, I set the akka.frameSize=200, my data

[jira] [Commented] (SPARK-2492) KafkaReceiver minor changes to align with Kafka 0.8

2014-07-15 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062053#comment-14062053 ] Saisai Shao commented on SPARK-2492: PR submitted:

[jira] [Commented] (SPARK-2356) Exception: Could not locate executable null\bin\winutils.exe in the Hadoop

2014-07-15 Thread Kostiantyn Kudriavtsev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062162#comment-14062162 ] Kostiantyn Kudriavtsev commented on SPARK-2356: --- No Sean, check stack trace

[jira] [Commented] (SPARK-2138) The KMeans algorithm in the MLlib can lead to the Serialized Task size become bigger and bigger

2014-07-15 Thread DjvuLee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062163#comment-14062163 ] DjvuLee commented on SPARK-2138: oh, I am a little sorry that I write some mistaken in my

[jira] [Commented] (SPARK-2356) Exception: Could not locate executable null\bin\winutils.exe in the Hadoop

2014-07-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062182#comment-14062182 ] Sean Owen commented on SPARK-2356: -- Yeah you are right, on a closer look, this is coming

[jira] [Resolved] (SPARK-2222) Add multiclass evaluation metrics

2014-07-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-. -- Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1155

[jira] [Updated] (SPARK-2222) Add multiclass evaluation metrics

2014-07-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-: - Assignee: Alexander Ulanov Add multiclass evaluation metrics -

[jira] [Resolved] (SPARK-2485) Usage of HiveClient not threadsafe.

2014-07-15 Thread Aaron Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aaron Davidson resolved SPARK-2485. --- Resolution: Fixed https://github.com/apache/spark/pull/1412 Usage of HiveClient not

[jira] [Commented] (SPARK-1486) Support multi-model training in MLlib

2014-07-15 Thread Erik Erlandson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062339#comment-14062339 ] Erik Erlandson commented on SPARK-1486: --- Does the dev on this issue effectively

[jira] [Commented] (SPARK-2314) RDD actions are only overridden in Scala, not java or python

2014-07-15 Thread Aaron (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062384#comment-14062384 ] Aaron commented on SPARK-2314: -- I added a PR for the java api:

[jira] [Resolved] (SPARK-2480) Resolve sbt warnings NOTE: SPARK_YARN is deprecated, please use -Pyarn flag

2014-07-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2480. Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1404

[jira] [Created] (SPARK-2493) SBT gen-idea doesn't generate correct Intellij project

2014-07-15 Thread DB Tsai (JIRA)
DB Tsai created SPARK-2493: -- Summary: SBT gen-idea doesn't generate correct Intellij project Key: SPARK-2493 URL: https://issues.apache.org/jira/browse/SPARK-2493 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-2314) RDD actions are only overridden in Scala, not java or python

2014-07-15 Thread Aaron (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062384#comment-14062384 ] Aaron edited comment on SPARK-2314 at 7/15/14 5:54 PM: --- I added a PR

[jira] [Updated] (SPARK-2479) Comparing floating-point numbers using relative error in UnitTests

2014-07-15 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-2479: --- Description: Floating point math is not exact, and most floating-point numbers end up being slightly

[jira] [Updated] (SPARK-2464) Twitter Receiver does not stop correctly when streamingContext.stop is called

2014-07-15 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2464: - Fix Version/s: 1.0.2 1.1.0 Twitter Receiver does not stop correctly when

[jira] [Updated] (SPARK-2377) Create a Python API for Spark Streaming

2014-07-15 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2377: - Fix Version/s: 1.1.0 Create a Python API for Spark Streaming

[jira] [Updated] (SPARK-2377) Create a Python API for Spark Streaming

2014-07-15 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2377: - Affects Version/s: (was: 1.0.0) Create a Python API for Spark Streaming

[jira] [Commented] (SPARK-2463) Creating multiple StreamingContexts from shell generates duplicate Streaming tabs in UI

2014-07-15 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062441#comment-14062441 ] Tathagata Das commented on SPARK-2463: -- [~andrewor14] Any insights on how much work

[jira] [Commented] (SPARK-2463) Creating multiple StreamingContexts from shell generates duplicate Streaming tabs in UI

2014-07-15 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062439#comment-14062439 ] Tathagata Das commented on SPARK-2463: -- I knew this issue. Its not a trivial change

[jira] [Updated] (SPARK-1645) Improve Spark Streaming compatibility with Flume

2014-07-15 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-1645: - Fix Version/s: (was: 1.1.0) Improve Spark Streaming compatibility with Flume

[jira] [Commented] (SPARK-2463) Creating multiple StreamingContexts from shell generates duplicate Streaming tabs in UI

2014-07-15 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062458#comment-14062458 ] Nicholas Chammas commented on SPARK-2463: - Ah, my apologies about the Trivial

[jira] [Commented] (SPARK-2488) Model SerDe in MLlib

2014-07-15 Thread Prachi Sharma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062473#comment-14062473 ] Prachi Sharma commented on SPARK-2488: -- [~mengxr] I want to work on this task, Can

[jira] [Commented] (SPARK-2488) Model SerDe in MLlib

2014-07-15 Thread Erik Erlandson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062486#comment-14062486 ] Erik Erlandson commented on SPARK-2488: --- Related, possible duplicate: SPARK-1406

[jira] [Resolved] (SPARK-1291) Link the spark UI to RM ui in yarn-client mode

2014-07-15 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-1291. -- Resolution: Fixed Fix Version/s: 1.1.0 Target Version/s: 1.1.0 Link the

[jira] [Created] (SPARK-2494) Hash of None is different cross machines in CPython

2014-07-15 Thread Davies Liu (JIRA)
Davies Liu created SPARK-2494: - Summary: Hash of None is different cross machines in CPython Key: SPARK-2494 URL: https://issues.apache.org/jira/browse/SPARK-2494 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-2495) Ability to re-create ML models

2014-07-15 Thread Alexander Albul (JIRA)
Alexander Albul created SPARK-2495: -- Summary: Ability to re-create ML models Key: SPARK-2495 URL: https://issues.apache.org/jira/browse/SPARK-2495 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-2471) SBT assembly does not include runtime dependencies

2014-07-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2471. -- Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1402

[jira] [Updated] (SPARK-2471) SBT assembly does not include runtime dependencies

2014-07-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2471: - Assignee: Xiangrui Meng SBT assembly does not include runtime dependencies

[jira] [Resolved] (SPARK-2483) HiveQL parses accessing struct fields in an array incorrectly.

2014-07-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2483. - Resolution: Fixed Fix Version/s: 1.0.2 1.1.0

[jira] [Resolved] (SPARK-2474) For a registered table in OverrideCatalog, the Analyzer failed to resolve references in the format of tableName.fieldName

2014-07-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2474. - Resolution: Fixed Fix Version/s: 1.1.0 I'm going to mark this as resolved.

[jira] [Resolved] (SPARK-2407) Implement SQL SUBSTR() directly in Catalyst

2014-07-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2407. - Resolution: Fixed Fix Version/s: 1.0.2 1.1.0 Implement SQL

[jira] [Updated] (SPARK-2045) Sort-based shuffle implementation

2014-07-15 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2045: - Attachment: (was: Sort-basedshuffledesign.pdf) Sort-based shuffle implementation

[jira] [Updated] (SPARK-2045) Sort-based shuffle implementation

2014-07-15 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2045: - Attachment: Sort-basedshuffledesign.pdf I've posted a design doc for a simple version of this.

[jira] [Updated] (SPARK-2045) Sort-based shuffle implementation

2014-07-15 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2045: - Attachment: Sort-basedshuffledesign.pdf Oops, attached the wrong file before. Here's the right

[jira] [Created] (SPARK-2496) Compression streams should write its codec info to the stream

2014-07-15 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-2496: -- Summary: Compression streams should write its codec info to the stream Key: SPARK-2496 URL: https://issues.apache.org/jira/browse/SPARK-2496 Project: Spark

[jira] [Updated] (SPARK-2496) Compression streams should write its codec info to the stream

2014-07-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2496: --- Component/s: Spark Core Shuffle Compression streams should write its codec info to

[jira] [Commented] (SPARK-1667) Jobs never finish successfully once bucket file missing occurred

2014-07-15 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062774#comment-14062774 ] Kousuke Saruta commented on SPARK-1667: --- Can anyone review the patch or have

[jira] [Updated] (SPARK-2374) Pattern matching of paths in graphs

2014-07-15 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave updated SPARK-2374: -- Target Version/s: (was: 1.1.0) Pattern matching of paths in graphs

[jira] [Updated] (SPARK-1667) Jobs never finish successfully once bucket file missing occurred

2014-07-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-1667: --- Target Version/s: 1.1.0, 1.0.2 Jobs never finish successfully once bucket file missing occurred

[jira] [Commented] (SPARK-2347) Graph object can not be set to StorageLevel.MEMORY_ONLY_SER

2014-07-15 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062783#comment-14062783 ] Ankur Dave commented on SPARK-2347: --- SPARK-2455 should have fixed this, so I'm closing

[jira] [Commented] (SPARK-1667) Jobs never finish successfully once bucket file missing occurred

2014-07-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062785#comment-14062785 ] Reynold Xin commented on SPARK-1667: Thanks. Just took a look and left some comments.

[jira] [Resolved] (SPARK-2347) Graph object can not be set to StorageLevel.MEMORY_ONLY_SER

2014-07-15 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave resolved SPARK-2347. --- Resolution: Fixed Fix Version/s: 1.1.0 Assignee: Ankur Dave Graph object can not be

[jira] [Commented] (SPARK-1955) VertexRDD can incorrectly assume index sharing

2014-07-15 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062792#comment-14062792 ] Ankur Dave commented on SPARK-1955: --- IndexedRDD fixes this issue. VertexRDD can

[jira] [Created] (SPARK-2497) @DeveloperApi tag does not suppress MIMA warnings

2014-07-15 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-2497: Summary: @DeveloperApi tag does not suppress MIMA warnings Key: SPARK-2497 URL: https://issues.apache.org/jira/browse/SPARK-2497 Project: Spark

[jira] [Commented] (SPARK-2458) Make failed application log visible on History Server

2014-07-15 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062817#comment-14062817 ] Masayoshi TSUZUKI commented on SPARK-2458: -- I think this function should be able

[jira] [Updated] (SPARK-2497) @DeveloperApi tag does not suppress MIMA warnings

2014-07-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-2497: - Issue Type: Sub-task (was: Bug) Parent: SPARK-2487 @DeveloperApi tag does not

[jira] [Commented] (SPARK-2497) @DeveloperApi tag does not suppress MIMA warnings

2014-07-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2497?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062848#comment-14062848 ] Patrick Wendell commented on SPARK-2497: [~prashant_] - I think the build change

[jira] [Updated] (SPARK-2459) the user should be able to configure the resources used by JDBC server

2014-07-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2459: Assignee: Cheng Lian the user should be able to configure the resources used by JDBC

[jira] [Updated] (SPARK-2498) Concurrent initialization of various DataType objects causes exceptions

2014-07-15 Thread Zongheng Yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zongheng Yang updated SPARK-2498: - Priority: Critical (was: Major) Concurrent initialization of various DataType objects causes

[jira] [Created] (SPARK-2498) Concurrent initialization of various DataType objects causes exceptions

2014-07-15 Thread Zongheng Yang (JIRA)
Zongheng Yang created SPARK-2498: Summary: Concurrent initialization of various DataType objects causes exceptions Key: SPARK-2498 URL: https://issues.apache.org/jira/browse/SPARK-2498 Project: Spark

[jira] [Commented] (SPARK-2498) Concurrent initialization of various DataType objects causes exceptions

2014-07-15 Thread Zongheng Yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062858#comment-14062858 ] Zongheng Yang commented on SPARK-2498: -- Github PR:

[jira] [Commented] (SPARK-2481) The environment variables SPARK_HISTORY_OPTS is covered in start-history-server.sh

2014-07-15 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062870#comment-14062870 ] Masayoshi TSUZUKI commented on SPARK-2481: -- spark-env.sh is loaded after these

[jira] [Created] (SPARK-2499) spark-default.conf parser doesn't ignore inline comments

2014-07-15 Thread Travis Thompson (JIRA)
Travis Thompson created SPARK-2499: -- Summary: spark-default.conf parser doesn't ignore inline comments Key: SPARK-2499 URL: https://issues.apache.org/jira/browse/SPARK-2499 Project: Spark

[jira] [Commented] (SPARK-2500) Move the loginfo for registering BlockManager to BlockManagerMasterActor.register method

2014-07-15 Thread Henry Saputra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062904#comment-14062904 ] Henry Saputra commented on SPARK-2500: -- PR at

[jira] [Commented] (SPARK-2500) Move the loginfo for registering BlockManager to BlockManagerMasterActor.register method

2014-07-15 Thread Henry Saputra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062902#comment-14062902 ] Henry Saputra commented on SPARK-2500: -- Working on this one Move the loginfo for

[jira] [Created] (SPARK-2500) Move the loginfo for registering BlockManager to BlockManagerMasterActor.register method

2014-07-15 Thread Henry Saputra (JIRA)
Henry Saputra created SPARK-2500: Summary: Move the loginfo for registering BlockManager to BlockManagerMasterActor.register method Key: SPARK-2500 URL: https://issues.apache.org/jira/browse/SPARK-2500

[jira] [Commented] (SPARK-2282) PySpark crashes if too many tasks complete quickly

2014-07-15 Thread Ken Carlile (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062916#comment-14062916 ] Ken Carlile commented on SPARK-2282: We may be running into this issue on our cluster.

[jira] [Commented] (SPARK-2499) spark-default.conf parser doesn't ignore inline comments

2014-07-15 Thread Mark Wagner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062912#comment-14062912 ] Mark Wagner commented on SPARK-2499: Travis, spark-default.conf is loaded as a

[jira] [Comment Edited] (SPARK-2282) PySpark crashes if too many tasks complete quickly

2014-07-15 Thread Ken Carlile (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062916#comment-14062916 ] Ken Carlile edited comment on SPARK-2282 at 7/16/14 12:14 AM: --

[jira] [Comment Edited] (SPARK-2282) PySpark crashes if too many tasks complete quickly

2014-07-15 Thread Ken Carlile (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14062916#comment-14062916 ] Ken Carlile edited comment on SPARK-2282 at 7/16/14 12:17 AM: --

[jira] [Updated] (SPARK-2298) Show stage attempt in UI

2014-07-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2298: --- Assignee: Masayoshi TSUZUKI (was: Andrew Or) Show stage attempt in UI

[jira] [Updated] (SPARK-2501) Handle stage submissions properly in the UI

2014-07-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2501: --- Issue Type: Sub-task (was: Bug) Parent: SPARK-2015 Handle stage submissions

  1   2   >