[jira] [Updated] (SPARK-4441) Close Tachyon client when TachyonBlockManager is shut down

2014-11-18 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4441: --- Assignee: shimingfei Close Tachyon client when TachyonBlockManager is shut down

[jira] [Commented] (SPARK-2321) Design a proper progress reporting event listener API

2014-11-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14218502#comment-14218502 ] Patrick Wendell commented on SPARK-2321: Currently this a programmatic API for

[jira] [Resolved] (SPARK-3962) Mark spark dependency as provided in external libraries

2014-11-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3962. Resolution: Fixed Fix Version/s: 1.2.0 Mark spark dependency as provided in

[jira] [Resolved] (SPARK-4429) Build for Scala 2.11 using sbt fails.

2014-11-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4429. Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Takuya Ueshin

[jira] [Updated] (SPARK-4501) Create build/mvn to automatically download maven/zinc/scalac

2014-11-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4501: --- Assignee: Prashant Sharma Create build/mvn to automatically download maven/zinc/scalac

[jira] [Updated] (SPARK-4376) Put external modules behind build profiles

2014-11-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4376: --- Target Version/s: 1.3.0 (was: 1.2.0) Put external modules behind build profiles

[jira] [Updated] (SPARK-2630) Input data size of CoalescedRDD is incorrect

2014-11-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2630: --- Fix Version/s: (was: 1.2.0) Input data size of CoalescedRDD is incorrect

[jira] [Updated] (SPARK-4384) Too many open files during sort in pyspark

2014-11-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4384: --- Priority: Blocker (was: Critical) Too many open files during sort in pyspark

[jira] [Updated] (SPARK-4516) Race condition in netty

2014-11-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4516: --- Priority: Critical (was: Major) Race condition in netty ---

[jira] [Updated] (SPARK-4516) Race condition in netty

2014-11-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4516: --- Target Version/s: 1.2.0 Race condition in netty ---

[jira] [Updated] (SPARK-2630) Input data size of CoalescedRDD is incorrect

2014-11-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2630: --- Target Version/s: 1.3.0 (was: 1.2.0) Input data size of CoalescedRDD is incorrect

[jira] [Updated] (SPARK-2630) Input data size of CoalescedRDD is incorrect

2014-11-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2630: --- Target Version/s: 1.3.0, 1.2.1 (was: 1.3.0) Input data size of CoalescedRDD is incorrect

[jira] [Resolved] (SPARK-4128) Create instructions on fully building Spark in Intellij

2014-11-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4128. Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Patrick Wendell I updated

[jira] [Commented] (SPARK-4479) Avoid unnecessary defensive copies when Sort based shuffle is on

2014-11-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14220229#comment-14220229 ] Patrick Wendell commented on SPARK-4479: I did some research into this. In the

[jira] [Updated] (SPARK-4525) MesosSchedulerBackend.resourceOffers cannot decline unused offers from acceptedOffers

2014-11-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4525: --- Assignee: Jongyoul Lee MesosSchedulerBackend.resourceOffers cannot decline unused offers

[jira] [Updated] (SPARK-4516) Netty off-heap memory use causes executors to be killed by OS

2014-11-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4516: --- Summary: Netty off-heap memory use causes executors to be killed by OS (was: Lost task with

[jira] [Commented] (SPARK-4516) Netty off-heap memory use causes executors to be killed by OS

2014-11-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14220352#comment-14220352 ] Patrick Wendell commented on SPARK-4516: [~hector.yee] I updated the title, let me

[jira] [Commented] (SPARK-4515) OOM/GC errors with sort-based shuffle

2014-11-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14220358#comment-14220358 ] Patrick Wendell commented on SPARK-4515: I can see that you are running LZF

[jira] [Created] (SPARK-4532) make-distribution in Spark 1.2 does not correctly detect whether Hive is enabled

2014-11-20 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-4532: -- Summary: make-distribution in Spark 1.2 does not correctly detect whether Hive is enabled Key: SPARK-4532 URL: https://issues.apache.org/jira/browse/SPARK-4532

[jira] [Updated] (SPARK-4525) MesosSchedulerBackend.resourceOffers cannot decline unused offers from acceptedOffers

2014-11-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4525: --- Fix Version/s: (was: 1.3.0) (was: 1.2.0)

[jira] [Resolved] (SPARK-4532) make-distribution in Spark 1.2 does not correctly detect whether Hive is enabled

2014-11-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4532. Resolution: Fixed Fix Version/s: 1.2.0 make-distribution in Spark 1.2 does not

[jira] [Commented] (SPARK-4541) Add --version to spark-submit

2014-11-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14221475#comment-14221475 ] Patrick Wendell commented on SPARK-4541: This is a good idea. Add --version to

[jira] [Commented] (SPARK-4516) Netty off-heap memory use causes executors to be killed by OS

2014-11-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14221519#comment-14221519 ] Patrick Wendell commented on SPARK-4516: Okay sounds good. Does changing the netty

[jira] [Commented] (SPARK-4550) In sort-based shuffle, store map outputs in serialized form

2014-11-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14221690#comment-14221690 ] Patrick Wendell commented on SPARK-4550: Not an expert on the internals of this

[jira] [Commented] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-11-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14221699#comment-14221699 ] Patrick Wendell commented on SPARK-3633: [~nravi] resolved this because his

[jira] [Commented] (SPARK-4516) Netty off-heap memory use causes executors to be killed by OS

2014-11-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222172#comment-14222172 ] Patrick Wendell commented on SPARK-4516: Okay then I think this is just a

[jira] [Updated] (SPARK-2143) Display Spark version on Driver web page

2014-11-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2143: --- Priority: Critical (was: Major) Display Spark version on Driver web page

[jira] [Resolved] (SPARK-4542) Post nightly releases

2014-11-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4542. Resolution: Duplicate Post nightly releases - Key:

[jira] [Updated] (SPARK-1517) Publish nightly snapshots of documentation, maven artifacts, and binary builds

2014-11-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1517: --- Fix Version/s: (was: 1.2.0) Publish nightly snapshots of documentation, maven artifacts,

[jira] [Updated] (SPARK-1517) Publish nightly snapshots of documentation, maven artifacts, and binary builds

2014-11-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1517: --- Priority: Critical (was: Major) Publish nightly snapshots of documentation, maven

[jira] [Updated] (SPARK-1517) Publish nightly snapshots of documentation, maven artifacts, and binary builds

2014-11-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1517: --- Target Version/s: 1.3.0 Publish nightly snapshots of documentation, maven artifacts, and

[jira] [Updated] (SPARK-4507) PR merge script should support closing multiple JIRA tickets

2014-11-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4507: --- Labels: starter (was: ) PR merge script should support closing multiple JIRA tickets

[jira] [Updated] (SPARK-4377) ZooKeeperPersistenceEngine: java.lang.IllegalStateException: Trying to deserialize a serialized ActorRef without an ActorSystem in scope.

2014-11-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4377: --- Fix Version/s: 1.3.0 ZooKeeperPersistenceEngine: java.lang.IllegalStateException: Trying to

[jira] [Commented] (SPARK-4556) binary distribution assembly can't run in local mode

2014-11-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1415#comment-1415 ] Patrick Wendell commented on SPARK-4556: Checkout make-distribution.sh rather than

[jira] [Comment Edited] (SPARK-4556) binary distribution assembly can't run in local mode

2014-11-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1415#comment-1415 ] Patrick Wendell edited comment on SPARK-4556 at 11/22/14 10:17 PM:

[jira] [Resolved] (SPARK-4377) ZooKeeperPersistenceEngine: java.lang.IllegalStateException: Trying to deserialize a serialized ActorRef without an ActorSystem in scope.

2014-11-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4377. Resolution: Fixed ZooKeeperPersistenceEngine: java.lang.IllegalStateException: Trying to

[jira] [Updated] (SPARK-4377) ZooKeeperPersistenceEngine: java.lang.IllegalStateException: Trying to deserialize a serialized ActorRef without an ActorSystem in scope.

2014-11-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4377: --- Target Version/s: (was: 1.2.0) ZooKeeperPersistenceEngine:

[jira] [Updated] (SPARK-4377) ZooKeeperPersistenceEngine: java.lang.IllegalStateException: Trying to deserialize a serialized ActorRef without an ActorSystem in scope.

2014-11-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4377: --- Affects Version/s: (was: 1.2.0) 1.3.0 ZooKeeperPersistenceEngine:

[jira] [Updated] (SPARK-4258) NPE with new Parquet Filters

2014-11-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4258: --- Priority: Critical (was: Blocker) NPE with new Parquet Filters

[jira] [Commented] (SPARK-4258) NPE with new Parquet Filters

2014-11-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222551#comment-14222551 ] Patrick Wendell commented on SPARK-4258: After discussion with [~lian cheng] I am

[jira] [Updated] (SPARK-4105) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2014-11-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4105: --- Target Version/s: 1.2.1 (was: 1.2.0) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle

[jira] [Updated] (SPARK-3958) Possible stream-corruption issues in TorrentBroadcast

2014-11-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3958: --- Target Version/s: 1.2.1 (was: 1.2.0) Possible stream-corruption issues in TorrentBroadcast

[jira] [Created] (SPARK-4568) Publish release candidates under $VERSION-RCX instead of $VERSION

2014-11-23 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-4568: -- Summary: Publish release candidates under $VERSION-RCX instead of $VERSION Key: SPARK-4568 URL: https://issues.apache.org/jira/browse/SPARK-4568 Project: Spark

[jira] [Updated] (SPARK-4568) Publish release candidates under $VERSION-RCX instead of $VERSION

2014-11-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4568: --- Issue Type: Improvement (was: Bug) Publish release candidates under $VERSION-RCX instead of

[jira] [Updated] (SPARK-3628) Don't apply accumulator updates multiple times for tasks in result stages

2014-11-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3628: --- Target Version/s: 1.1.1, 0.9.3, 1.0.3, 1.2.1 (was: 1.1.1, 1.2.0, 0.9.3, 1.0.3) Don't apply

[jira] [Commented] (SPARK-3628) Don't apply accumulator updates multiple times for tasks in result stages

2014-11-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222672#comment-14222672 ] Patrick Wendell commented on SPARK-3628: I took a quick look at the current patch

[jira] [Updated] (SPARK-4548) Python broadcast perf regression from Spark 1.1

2014-11-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4548: --- Summary: Python broadcast perf regression from Spark 1.1 (was: Python broadcast is very

[jira] [Updated] (SPARK-4548) Python broadcast is very slow

2014-11-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4548: --- Priority: Blocker (was: Major) Python broadcast is very slow -

[jira] [Updated] (SPARK-4562) serialization in MLlib is slow

2014-11-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4562: --- Priority: Blocker (was: Major) serialization in MLlib is slow

[jira] [Updated] (SPARK-4562) GLM testing time regressions from Spark 1.1

2014-11-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4562: --- Summary: GLM testing time regressions from Spark 1.1 (was: serialization in MLlib is slow)

[jira] [Updated] (SPARK-4567) Make SparkJobInfo and SparkStageInfo serializable

2014-11-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4567: --- Fix Version/s: (was: 1.2.0) Make SparkJobInfo and SparkStageInfo serializable

[jira] [Updated] (SPARK-3628) Don't apply accumulator updates multiple times for tasks in result stages

2014-11-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3628: --- Component/s: Spark Core Don't apply accumulator updates multiple times for tasks in result

[jira] [Commented] (SPARK-4567) Make SparkJobInfo and SparkStageInfo serializable

2014-11-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222729#comment-14222729 ] Patrick Wendell commented on SPARK-4567: [~xuefuz] please don't set the FixVersion

[jira] [Updated] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3633: --- Fix Version/s: 1.2.0 1.1.1 Fetches failure observed after SPARK-2711

[jira] [Updated] (SPARK-4385) DataSource DDL Parser can't handle table names with '_'

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4385: --- Fix Version/s: 1.2.0 DataSource DDL Parser can't handle table names with '_'

[jira] [Updated] (SPARK-4385) DataSource DDL Parser can't handle table names with '_'

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4385: --- Fix Version/s: (was: 1.2.0) DataSource DDL Parser can't handle table names with '_'

[jira] [Updated] (SPARK-3615) Kafka test should not hard code Zookeeper port

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3615: --- Fix Version/s: 1.2.0 Kafka test should not hard code Zookeeper port

[jira] [Updated] (SPARK-3686) flume.SparkSinkSuite.Success is flaky

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3686: --- Fix Version/s: 1.2.0 flume.SparkSinkSuite.Success is flaky

[jira] [Updated] (SPARK-4264) SQL HashJoin induces refCnt = 0 error in ShuffleBlockFetcherIterator

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4264: --- Fix Version/s: 1.2.0 SQL HashJoin induces refCnt = 0 error in ShuffleBlockFetcherIterator

[jira] [Updated] (SPARK-4468) Wrong Parquet filters are created for all inequality predicates with literals on the left hand side

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4468: --- Fix Version/s: 1.2.0 Wrong Parquet filters are created for all inequality predicates with

[jira] [Updated] (SPARK-1860) Standalone Worker cleanup should not clean up running executors

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1860: --- Fix Version/s: 1.2.0 Standalone Worker cleanup should not clean up running executors

[jira] [Reopened] (SPARK-4515) OOM/GC errors with sort-based shuffle

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-4515: OOM/GC errors with sort-based shuffle -

[jira] [Updated] (SPARK-3452) Maven build should skip publishing artifacts people shouldn't depend on

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3452: --- Fix Version/s: 1.2.0 Maven build should skip publishing artifacts people shouldn't depend on

[jira] [Resolved] (SPARK-4515) OOM/GC errors with sort-based shuffle

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4515. Resolution: Duplicate OOM/GC errors with sort-based shuffle

[jira] [Updated] (SPARK-4266) Avoid expensive JavaScript for StagePages with huge numbers of tasks

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4266: --- Affects Version/s: 1.2.0 Avoid expensive JavaScript for StagePages with huge numbers of

[jira] [Resolved] (SPARK-4145) Create jobs overview and job details pages on the web UI

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4145. Resolution: Fixed Fix Version/s: 1.2.0 Create jobs overview and job details pages

[jira] [Updated] (SPARK-4266) Avoid expensive JavaScript for StagePages with huge numbers of tasks

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4266: --- Priority: Blocker (was: Critical) Avoid expensive JavaScript for StagePages with huge

[jira] [Updated] (SPARK-4548) Python broadcast perf regression from Spark 1.1

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4548: --- Assignee: Davies Liu Python broadcast perf regression from Spark 1.1

[jira] [Assigned] (SPARK-4196) Streaming + checkpointing + saveAsNewAPIHadoopFiles = NotSerializableException for Hadoop Configuration

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reassigned SPARK-4196: -- Assignee: Patrick Wendell Streaming + checkpointing + saveAsNewAPIHadoopFiles =

[jira] [Updated] (SPARK-4196) Streaming + checkpointing + saveAsNewAPIHadoopFiles = NotSerializableException for Hadoop Configuration

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4196: --- Assignee: Tathagata Das (was: Patrick Wendell) Streaming + checkpointing +

[jira] [Resolved] (SPARK-4578) Row.asDict() should keep the type of values

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4578. Resolution: Fixed Fix Version/s: 1.2.0 Thanks davies I've resolved this.

[jira] [Updated] (SPARK-4525) MesosSchedulerBackend.resourceOffers cannot decline unused offers from acceptedOffers

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4525: --- Target Version/s: 1.2.0 (was: 1.2.0, 1.3.0) MesosSchedulerBackend.resourceOffers cannot

[jira] [Resolved] (SPARK-4525) MesosSchedulerBackend.resourceOffers cannot decline unused offers from acceptedOffers

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4525. Resolution: Fixed MesosSchedulerBackend.resourceOffers cannot decline unused offers from

[jira] [Updated] (SPARK-4525) MesosSchedulerBackend.resourceOffers cannot decline unused offers from acceptedOffers

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4525: --- Fix Version/s: 1.2.0 MesosSchedulerBackend.resourceOffers cannot decline unused offers from

[jira] [Updated] (SPARK-1476) 2GB limit in spark for blocks

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1476: --- Target Version/s: (was: 1.2.0) 2GB limit in spark for blocks

[jira] [Updated] (SPARK-4598) Paginate stage page to avoid OOM with 100,000 tasks

2014-11-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4598: --- Summary: Paginate stage page to avoid OOM with 100,000 tasks (was:

[jira] [Updated] (SPARK-4598) java.lang.OutOfMemoryError occurs when opening stage page of an application has 100000 tasks,

2014-11-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4598: --- Priority: Critical (was: Major) java.lang.OutOfMemoryError occurs when opening stage page

[jira] [Commented] (SPARK-4598) Paginate stage page to avoid OOM with 100,000 tasks

2014-11-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14224810#comment-14224810 ] Patrick Wendell commented on SPARK-4598: It is a good idea to paginate this page.

[jira] [Commented] (SPARK-4605) Proposed Contribution: Spark Kernel to enable interactive Spark applications

2014-11-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14225173#comment-14225173 ] Patrick Wendell commented on SPARK-4605: Thanks for sharing this design doc.

[jira] [Commented] (SPARK-4605) Proposed Contribution: Spark Kernel to enable interactive Spark applications

2014-11-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14225264#comment-14225264 ] Patrick Wendell commented on SPARK-4605: I see - so basically this is a standalone

[jira] [Updated] (SPARK-4613) Make JdbcRDD easier to use from Java

2014-11-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4613: --- Description: We might eventually deprecate it, but for now it would be nice to expose a Java

[jira] [Commented] (SPARK-4613) Make JdbcRDD easier to use from Java

2014-11-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4613?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14225632#comment-14225632 ] Patrick Wendell commented on SPARK-4613: Yeah the only other tricky bit is the

[jira] [Updated] (SPARK-4613) Make JdbcRDD easier to use from Java

2014-11-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4613: --- Assignee: Cheng Lian Make JdbcRDD easier to use from Java

[jira] [Resolved] (SPARK-4516) Netty off-heap memory use causes executors to be killed by OS

2014-11-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4516. Resolution: Fixed Fix Version/s: 1.2.0 Netty off-heap memory use causes executors

[jira] [Created] (SPARK-4628) Put all external projects behind a build flag

2014-11-26 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-4628: -- Summary: Put all external projects behind a build flag Key: SPARK-4628 URL: https://issues.apache.org/jira/browse/SPARK-4628 Project: Spark Issue Type:

[jira] [Commented] (SPARK-4598) Paginate stage page to avoid OOM with 100,000 tasks

2014-11-27 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14227842#comment-14227842 ] Patrick Wendell commented on SPARK-4598: Having sorting with pagination seems very

[jira] [Updated] (SPARK-3182) Twitter Streaming Geoloaction Filter

2014-11-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3182: --- Fix Version/s: (was: 1.2.0) Twitter Streaming Geoloaction Filter

[jira] [Updated] (SPARK-3182) Twitter Streaming Geoloaction Filter

2014-11-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3182: --- Affects Version/s: (was: 1.0.2) (was: 1.0.0) Twitter

[jira] [Updated] (SPARK-4645) Asynchronous execution in HiveThriftServer2 with Hive 0.13.1 doesn't play well with Simba ODBC driver

2014-11-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4645: --- Assignee: Cheng Lian Asynchronous execution in HiveThriftServer2 with Hive 0.13.1 doesn't

[jira] [Updated] (SPARK-4632) Upgrade MQTT dependency to use latest mqtt-client

2014-11-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4632: --- Target Version/s: 1.3.0 (was: 1.2.0) Upgrade MQTT dependency to use latest mqtt-client

[jira] [Updated] (SPARK-4632) Upgrade MQTT dependency to use latest mqtt-client

2014-11-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4632: --- Priority: Critical (was: Blocker) Upgrade MQTT dependency to use latest mqtt-client

[jira] [Resolved] (SPARK-4643) Remove unneeded staging repositories from build

2014-11-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4643. Resolution: Fixed Fix Version/s: 1.3.0 Remove unneeded staging repositories from

[jira] [Updated] (SPARK-4643) Remove unneeded staging repositories from build

2014-11-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4643: --- Summary: Remove unneeded staging repositories from build (was: spark staging repository

[jira] [Updated] (SPARK-4643) Remove unneeded staging repositories from build

2014-11-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4643: --- Assignee: Adrian Wang Remove unneeded staging repositories from build

[jira] [Resolved] (SPARK-4645) Asynchronous execution in HiveThriftServer2 with Hive 0.13.1 doesn't play well with Simba ODBC driver

2014-11-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4645. Resolution: Fixed Fix Version/s: 1.2.0 Asynchronous execution in HiveThriftServer2

[jira] [Resolved] (SPARK-4193) Disable doclint in Java 8 to prevent from build error.

2014-11-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4193. Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Takuya Ueshin

[jira] [Commented] (SPARK-3694) Allow printing object graph of tasks/RDD's with a debug flag

2014-11-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14228489#comment-14228489 ] Patrick Wendell commented on SPARK-3694: Yes we should print that too - I said

[jira] [Issue Comment Deleted] (SPARK-3694) Allow printing object graph of tasks/RDD's with a debug flag

2014-11-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3694: --- Comment: was deleted (was: User 'davies' has created a pull request for this issue:

[jira] [Commented] (SPARK-4349) Spark driver hangs on sc.parallelize() if exception is thrown during serialization

2014-11-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14228493#comment-14228493 ] Patrick Wendell commented on SPARK-4349: Hey Matt, It turns out that parallel

[jira] [Resolved] (SPARK-4584) 2x Performance regression for Spark-on-YARN

2014-11-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4584. Resolution: Fixed 2x Performance regression for Spark-on-YARN

<    13   14   15   16   17   18   19   20   21   22   >