spark git commit: [SPARK-9649] Fix MasterSuite, third time's a charm

2015-08-13 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.5 883c7d35f - 30460206f [SPARK-9649] Fix MasterSuite, third time's a charm This particular test did not load the default configurations so it continued to start the REST server, which causes port bind exceptions. Project:

spark git commit: [SPARK-9649] Fix MasterSuite, third time's a charm

2015-08-13 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 65fec798c - 8815ba2f6 [SPARK-9649] Fix MasterSuite, third time's a charm This particular test did not load the default configurations so it continued to start the REST server, which causes port bind exceptions. Project:

spark git commit: [SPARK-9870] Disable driver UI and Master REST server in SparkSubmitSuite

2015-08-12 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master f4bc01f1f - 7b13ed27c [SPARK-9870] Disable driver UI and Master REST server in SparkSubmitSuite I think that we should pass additional configuration flags to disable the driver UI and Master REST server in SparkSubmitSuite and

spark git commit: [SPARK-9870] Disable driver UI and Master REST server in SparkSubmitSuite

2015-08-12 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.5 ca39c9e91 - 4b547b91d [SPARK-9870] Disable driver UI and Master REST server in SparkSubmitSuite I think that we should pass additional configuration flags to disable the driver UI and Master REST server in SparkSubmitSuite and

spark git commit: [SPARK-9806] [WEB UI] Don't share ReplayListenerBus between multiple applications

2015-08-12 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master b85f9a242 - a807fcbe5 [SPARK-9806] [WEB UI] Don't share ReplayListenerBus between multiple applications Author: Rohit Agarwal roh...@qubole.com Closes #8088 from mindprince/SPARK-9806. Project:

spark git commit: [SPARK-9806] [WEB UI] Don't share ReplayListenerBus between multiple applications

2015-08-12 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.5 2f909184e - 402c0ca9a [SPARK-9806] [WEB UI] Don't share ReplayListenerBus between multiple applications Author: Rohit Agarwal roh...@qubole.com Closes #8088 from mindprince/SPARK-9806. (cherry picked from commit

spark git commit: [SPARK-9829] [WEBUI] Display the update value for peak execution memory

2015-08-12 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master a807fcbe5 - 4e3f4b934 [SPARK-9829] [WEBUI] Display the update value for peak execution memory The peak execution memory is not correct because it shows the sum of finished tasks' values when a task finishes. This PR fixes it by using the

spark git commit: [SPARK-9829] [WEBUI] Display the update value for peak execution memory

2015-08-12 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.5 402c0ca9a - d9d4bdea2 [SPARK-9829] [WEBUI] Display the update value for peak execution memory The peak execution memory is not correct because it shows the sum of finished tasks' values when a task finishes. This PR fixes it by using

spark git commit: [SPARK-9575] [MESOS] Add docuemntation around Mesos shuffle service.

2015-08-12 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 5c99d8bf9 - 741a29f98 [SPARK-9575] [MESOS] Add docuemntation around Mesos shuffle service. andrewor14 Author: Timothy Chen tnac...@gmail.com Closes #7907 from tnachen/mesos_shuffle. Project: http://git-wip-us.apache.org/repos/asf/spark

spark git commit: [SPARK-8366] maxNumExecutorsNeeded should properly handle failed tasks

2015-08-12 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.5 b994f8928 - 2f909184e [SPARK-8366] maxNumExecutorsNeeded should properly handle failed tasks Author: xutingjun xuting...@huawei.com Author: meiyoula 1039320...@qq.com Closes #6817 from XuTingjun/SPARK-8366. (cherry picked from commit

spark git commit: [SPARK-8366] maxNumExecutorsNeeded should properly handle failed tasks

2015-08-12 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master b1581ac28 - b85f9a242 [SPARK-8366] maxNumExecutorsNeeded should properly handle failed tasks Author: xutingjun xuting...@huawei.com Author: meiyoula 1039320...@qq.com Closes #6817 from XuTingjun/SPARK-8366. Project:

spark git commit: [SPARK-8798] [MESOS] Allow additional uris to be fetched with mesos

2015-08-12 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.5 93fc95934 - a2f805729 [SPARK-8798] [MESOS] Allow additional uris to be fetched with mesos Some users like to download additional files in their sandbox that they can refer to from their spark program, or even later mount these files

spark git commit: [SPARK-8798] [MESOS] Allow additional uris to be fetched with mesos

2015-08-12 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master bab892328 - 5c99d8bf9 [SPARK-8798] [MESOS] Allow additional uris to be fetched with mesos Some users like to download additional files in their sandbox that they can refer to from their spark program, or even later mount these files to

spark git commit: [SPARK-9806] [WEB UI] Don't share ReplayListenerBus between multiple applications

2015-08-12 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 6dde38026 - 89c8aea94 [SPARK-9806] [WEB UI] Don't share ReplayListenerBus between multiple applications Author: Rohit Agarwal roh...@qubole.com Closes #8088 from mindprince/SPARK-9806. (cherry picked from commit

spark git commit: [SPARK-9426] [WEBUI] Job page DAG visualization is not shown

2015-08-12 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 4e3f4b934 - bab892328 [SPARK-9426] [WEBUI] Job page DAG visualization is not shown To reproduce the issue, go to the stage page and click DAG Visualization once, then go to the job page to show the job DAG visualization. You will only see

spark git commit: [SPARK-9426] [WEBUI] Job page DAG visualization is not shown

2015-08-12 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.5 d9d4bdea2 - 93fc95934 [SPARK-9426] [WEBUI] Job page DAG visualization is not shown To reproduce the issue, go to the stage page and click DAG Visualization once, then go to the job page to show the job DAG visualization. You will only

spark git commit: [SPARK-9795] Dynamic allocation: avoid double counting when killing same executor twice

2015-08-12 Thread andrewor14
-9745, fixing this potential race condition seems like a strict improvement. I've added a regression test to illustrate the issue. Author: Andrew Or and...@databricks.com Closes #8078 from andrewor14/da-double-kill. (cherry picked from commit be5d1912076c2ffd21ec88611e53d3b3c59b7ecc) Signed-off

spark git commit: [HOTFIX] Fix style error caused by 017b5de

2015-08-11 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 017b5de07 - 736af95bd [HOTFIX] Fix style error caused by 017b5de Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/736af95b Tree:

spark git commit: [HOTFIX] Fix style error caused by ef961ed48a4f45447f0e0ad256b040c7ab2d78d9

2015-08-11 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.5 725e5c7a4 - 1067c7369 [HOTFIX] Fix style error caused by ef961ed48a4f45447f0e0ad256b040c7ab2d78d9 Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/1067c736 Tree:

spark git commit: [SPARK-9731] Standalone scheduling incorrect cores if spark.executor.cores is not set

2015-08-08 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master c564b2744 - ef062c159 [SPARK-9731] Standalone scheduling incorrect cores if spark.executor.cores is not set The issue only happens if `spark.executor.cores` is not set and executor memory is set to a high value. For example, if we have a

spark git commit: [SPARK-8873] [MESOS] Clean up shuffle files if external shuffle service is used

2015-08-03 Thread andrewor14
. There should be no functional differences between this and #7820. Author: Timothy Chen tnac...@gmail.com Author: Andrew Or and...@databricks.com Closes #7881 from andrewor14/tim-cleanup-mesos-shuffle and squashes the following commits: 8894f7d [Andrew Or] Clean up code 2a5fa10 [Andrew Or] Merge

spark git commit: [SPARK-4751] Dynamic allocation in standalone mode

2015-08-01 Thread andrewor14
andrewor14/standalone-da and squashes the following commits: b3c1736 [Andrew Or] Merge branch 'master' of github.com:apache/spark into standalone-da 879e928 [Andrew Or] Add end-to-end tests for standalone dynamic allocation accc8f6 [Andrew Or] Address comments ee686a8 [Andrew Or] Merge branch

spark git commit: [SPARK-9352] [SPARK-9353] Add tests for standalone scheduling code

2015-07-27 Thread andrewor14
. Author: Andrew Or and...@databricks.com Closes #7668 from andrewor14/standalone-scheduling-tests and squashes the following commits: d852faf [Andrew Or] Add tests + fix scheduling with memory limits Conflicts: core/src/test/scala/org/apache/spark/deploy/master/MasterSuite.scala

spark git commit: [SPARK-9352] [SPARK-9353] Add tests for standalone scheduling code

2015-07-26 Thread andrewor14
. Author: Andrew Or and...@databricks.com Closes #7668 from andrewor14/standalone-scheduling-tests and squashes the following commits: d852faf [Andrew Or] Add tests + fix scheduling with memory limits Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org

spark git commit: [SPARK-9352] [SPARK-9353] Add tests for standalone scheduling code

2015-07-26 Thread andrewor14
. Author: Andrew Or and...@databricks.com Closes #7668 from andrewor14/standalone-scheduling-tests and squashes the following commits: d852faf [Andrew Or] Add tests + fix scheduling with memory limits (cherry picked from commit 1cf19760d61a5a17bd175a906d34a2940141b76d) Signed-off-by: Andrew

spark git commit: [HOTFIX] Fix compile in MasterSuite

2015-07-26 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 d0ef42c4e - 5aed92613 [HOTFIX] Fix compile in MasterSuite Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/5aed9261 Tree:

spark git commit: [SPARK-8881] [SPARK-9260] Fix algorithm for scheduling executors on workers

2015-07-25 Thread andrewor14
spark.cores.max/spark.executor.cores (i.e, num_executors) num_workers, executors are not launched and the app hangs. This PR fixes and refactors the scheduling algorithm. andrewor14 Author: Nishkam Ravi nr...@cloudera.com Author: nishkamravi2 nishkamr...@gmail.com Closes #7274 from nishkamravi2

spark git commit: [SPARK-8881] [SPARK-9260] Fix algorithm for scheduling executors on workers

2015-07-25 Thread andrewor14
spark.cores.max/spark.executor.cores (i.e, num_executors) num_workers, executors are not launched and the app hangs. This PR fixes and refactors the scheduling algorithm. andrewor14 Author: Nishkam Ravi nr...@cloudera.com Author: nishkamravi2 nishkamr...@gmail.com Closes #7274 from nishkamravi2

spark git commit: [SPARK-7171] Added a method to retrieve metrics sources in TaskContext

2015-07-21 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 9a4fd875b - 31954910d [SPARK-7171] Added a method to retrieve metrics sources in TaskContext Author: Jacek Lewandowski lewandowski.ja...@gmail.com Closes #5805 from jacek-lewandowski/SPARK-7171 and squashes the following commits:

spark git commit: [SPARK-5423] [CORE] Register a TaskCompletionListener to make sure release all resources

2015-07-21 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 4f7f1ee37 - d45355ee2 [SPARK-5423] [CORE] Register a TaskCompletionListener to make sure release all resources Make `DiskMapIterator.cleanup` idempotent and register a TaskCompletionListener to make sure call `cleanup`. Author: zsxwing

spark git commit: [SPARK-9128] [CORE] Get outerclasses and objects with only one method calling in ClosureCleaner

2015-07-21 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master f67da43c3 - 9a4fd875b [SPARK-9128] [CORE] Get outerclasses and objects with only one method calling in ClosureCleaner JIRA: https://issues.apache.org/jira/browse/SPARK-9128 Currently, in `ClosureCleaner`, the outerclasses and objects are

spark git commit: [SPARK-4598] [WEBUI] Task table pagination for the Stage page

2015-07-21 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 31954910d - 4f7f1ee37 [SPARK-4598] [WEBUI] Task table pagination for the Stage page This PR adds pagination for the task table to solve the scalability issue of the stage page. Here is the initial screenshot: img width=1347 alt=pagination

spark git commit: [SPARK-9036] [CORE] SparkListenerExecutorMetricsUpdate messages not included in JsonProtocol

2015-07-21 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 6592a6058 - f67da43c3 [SPARK-9036] [CORE] SparkListenerExecutorMetricsUpdate messages not included in JsonProtocol This PR implements a JSON serializer and deserializer in the JSONProtocol to handle the (de)serialization of

spark git commit: [SPARK-8646] PySpark does not run on YARN if master not provided in command line

2015-07-16 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 57e9b13bf - 49351c7f5 [SPARK-8646] PySpark does not run on YARN if master not provided in command line andrewor14 davies vanzin can you take a look at this? thanks Author: Lianhui Wang lianhuiwan...@gmail.com Closes #7438 from

spark git commit: [SPARK-8119] HeartbeatReceiver should replace executors, not kill

2015-07-16 Thread andrewor14
simply wants a replacement for the expired executor. **Fix.** Differentiate between the intention to kill and the intention to replace an executor with a fresh one. More details can be found in the commit message. Author: Andrew Or and...@databricks.com Closes #7107 from andrewor14/heartbeat

spark git commit: [SPARK-6284] [MESOS] Add mesos role, principal and secret

2015-07-16 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 49351c7f5 - d86bbb4e2 [SPARK-6284] [MESOS] Add mesos role, principal and secret Mesos supports framework authentication and role to be set per framework, which the role is used to identify the framework's role which impacts the sharing

spark git commit: [SPARK-8911] Fix local mode endless heartbeats

2015-07-14 Thread andrewor14
the missing event that needs to be posted. Author: Andrew Or and...@databricks.com Closes #7382 from andrewor14/fix-local-heartbeat and squashes the following commits: 1258bdf [Andrew Or] Post ExecutorAdded event to local executor Project: http://git-wip-us.apache.org/repos/asf/spark/repo

spark git commit: [CORE] [MINOR] change the log level to info

2015-07-10 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 5dd45bde4 - db6d57f87 [CORE] [MINOR] change the log level to info Too many logs even when set the log level to warning. Author: Cheng Hao hao.ch...@intel.com Closes #7340 from chenghao-intel/log and squashes the following commits:

spark git commit: [SPARK-8958] Dynamic allocation: change cached timeout to infinity

2015-07-10 Thread andrewor14
be very expensive and we should only allow it if the user knows what he/she is doing. FYI harishreedharan sryza. Author: Andrew Or and...@databricks.com Closes #7329 from andrewor14/da-cached-timeout and squashes the following commits: cef0b4e [Andrew Or] Change timeout to infinity Project: http

spark git commit: [SPARK-7419] [STREAMING] [TESTS] Fix CheckpointSuite.recovery with file input stream

2015-07-09 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 930fe9535 - 88bf43033 [SPARK-7419] [STREAMING] [TESTS] Fix CheckpointSuite.recovery with file input stream Fix this failure:

spark git commit: [SPARK-6287] [MESOS] Add dynamic allocation to the coarse-grained Mesos scheduler

2015-07-09 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master ebdf58538 - c4830598b [SPARK-6287] [MESOS] Add dynamic allocation to the coarse-grained Mesos scheduler This is largely based on extracting the dynamic allocation parts from tnachen's #3861. Author: Iulian Dragos jagua...@gmail.com

spark git commit: [MINOR] [STREAMING] Fix log statements in ReceiverSupervisorImpl

2015-07-09 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 0b0b9ceaf - 7ce3b818f [MINOR] [STREAMING] Fix log statements in ReceiverSupervisorImpl Log statements incorrectly showed that the executor was being stopped when receiver was being stopped. Author: Tathagata Das

spark git commit: [SPARK-2017] [UI] Stage page hangs with many tasks

2015-07-09 Thread andrewor14
) at java.lang.AbstractStringBuilder.expandCapacity(AbstractStringBuilder.java:130) ``` This patch compresses Jetty responses in gzip. The correct long-term fix is to add pagination. Author: Andrew Or and...@databricks.com Closes #7296 from andrewor14/gzip-jetty and squashes the following commits: a051c64 [Andrew Or] Use GZIP

spark git commit: [SPARK-2017] [UI] Stage page hangs with many tasks

2015-07-09 Thread andrewor14
) at java.lang.AbstractStringBuilder.expandCapacity(AbstractStringBuilder.java:130) ``` This patch compresses Jetty responses in gzip. The correct long-term fix is to add pagination. Author: Andrew Or and...@databricks.com Closes #7296 from andrewor14/gzip-jetty and squashes the following commits: a051c64 [Andrew Or] Use GZIP

spark git commit: [SPARK-8953] SPARK_EXECUTOR_CORES is not read in SparkSubmit

2015-07-09 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 7ce3b818f - 930fe9535 [SPARK-8953] SPARK_EXECUTOR_CORES is not read in SparkSubmit The configuration ```SPARK_EXECUTOR_CORES``` won't put into ```SparkConf```, so it has no effect to the dynamic executor allocation. Author: xutingjun

spark git commit: [SPARK-7419] [STREAMING] [TESTS] Fix CheckpointSuite.recovery with file input stream

2015-07-09 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 c04f0a5cf - dfc99715e [SPARK-7419] [STREAMING] [TESTS] Fix CheckpointSuite.recovery with file input stream Fix this failure:

spark git commit: [SPARK-6707] [CORE] [MESOS] Mesos Scheduler should allow the user to specify constraints based on slave attributes

2015-07-06 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 9ff203346 - 1165b17d2 [SPARK-6707] [CORE] [MESOS] Mesos Scheduler should allow the user to specify constraints based on slave attributes Currently, the mesos scheduler only looks at the 'cpu' and 'mem' resources when trying to determine

spark git commit: [SPARK-8656] [WEBUI] Fix the webUI and JSON API number is not synced

2015-07-06 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 132e7fca1 - 9ff203346 [SPARK-8656] [WEBUI] Fix the webUI and JSON API number is not synced Spark standalone master web UI show Alive Workers total core, total used cores and Alive workers total memory, memory used. But the JSON API page

spark git commit: [SPARK-8819] Fix build for maven 3.3.x

2015-07-06 Thread andrewor14
in `pom.xml` and tested locally with both versions of maven. Author: Andrew Or and...@databricks.com Closes #7219 from andrewor14/fix-maven-build and squashes the following commits: 1d37e87 [Andrew Or] Merge branch 'master' of github.com:apache/spark into fix-maven-build 3574ae4 [Andrew Or] Review

spark git commit: [SPARK-8819] Fix build for maven 3.3.x

2015-07-06 Thread andrewor14
in `pom.xml` and tested locally with both versions of maven. Author: Andrew Or and...@databricks.com Closes #7219 from andrewor14/fix-maven-build and squashes the following commits: 1d37e87 [Andrew Or] Merge branch 'master' of github.com:apache/spark into fix-maven-build 3574ae4 [Andrew Or] Review

spark git commit: Revert [SPARK-8781] Fix variables in published pom.xml are not resolved

2015-07-06 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 9eae5fa64 - 929dfa24b Revert [SPARK-8781] Fix variables in published pom.xml are not resolved This reverts commit 82cf3315e690f4ac15b50edea6a3d673aa5be4c0. Conflicts: pom.xml Project:

spark git commit: Revert [SPARK-8781] Fix variables in published pom.xml are not resolved

2015-07-06 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.3 5f1d1c0b8 - 960aec976 Revert [SPARK-8781] Fix variables in published pom.xml are not resolved This reverts commit 502e1fd68f9efc0311062146fa058dec3ef0e70b. Conflicts: pom.xml Project:

spark git commit: Revert [SPARK-8781] Fix variables in published pom.xml are not resolved

2015-07-06 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 947b84598 - 997444c78 Revert [SPARK-8781] Fix variables in published pom.xml are not resolved This reverts commit 82cf3315e690f4ac15b50edea6a3d673aa5be4c0. Conflicts: pom.xml Project:

spark git commit: [SPARK-8687] [YARN] Fix bug: Executor can't fetch the new set configuration in yarn-client

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 3697232b7 - 1b0c8e610 [SPARK-8687] [YARN] Fix bug: Executor can't fetch the new set configuration in yarn-client Spark initi the properties CoarseGrainedSchedulerBackend.start ```scala // TODO (prashant) send conf instead of

spark git commit: [SPARK-3071] Increase default driver memory

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 377ff4c9e - 3697232b7 [SPARK-3071] Increase default driver memory I've updated default values in comments, documentation, and in the command line builder to be 1g based on comments in the JIRA. I've also updated most usages to point at a

spark git commit: [SPARK-8740] [PROJECT INFRA] Support GitHub OAuth tokens in dev/merge_spark_pr.py

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 15d41cc50 - 377ff4c9e [SPARK-8740] [PROJECT INFRA] Support GitHub OAuth tokens in dev/merge_spark_pr.py This commit allows `dev/merge_spark_pr.py` to use personal GitHub OAuth tokens in order to make authenticated requests. This is

spark git commit: [SPARK-8688] [YARN] Bug fix: disable the cache fs to gain the HDFS connection.

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 792fcd802 - 646366b5d [SPARK-8688] [YARN] Bug fix: disable the cache fs to gain the HDFS connection. If `fs.hdfs.impl.disable.cache` was `false`(default), `FileSystem` will use the cached `DFSClient` which use old token.

spark git commit: [SPARK-8687] [YARN] Fix bug: Executor can't fetch the new set configuration in yarn-client

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 e33c0f0a4 - 7cbfef23a [SPARK-8687] [YARN] Fix bug: Executor can't fetch the new set configuration in yarn-client Spark initi the properties CoarseGrainedSchedulerBackend.start ```scala // TODO (prashant) send conf instead of

spark git commit: [SPARK-8769] [TRIVIAL] [DOCS] toLocalIterator should mention it results in many jobs

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master d14338eaf - 15d41cc50 [SPARK-8769] [TRIVIAL] [DOCS] toLocalIterator should mention it results in many jobs Author: Holden Karau hol...@pigscanfly.ca Closes #7171 from holdenk/SPARK-8769-toLocalIterator-documentation-improvement and

spark git commit: [SPARK-8769] [TRIVIAL] [DOCS] toLocalIterator should mention it results in many jobs

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 5b468cf0c - e33c0f0a4 [SPARK-8769] [TRIVIAL] [DOCS] toLocalIterator should mention it results in many jobs Author: Holden Karau hol...@pigscanfly.ca Closes #7171 from holdenk/SPARK-8769-toLocalIterator-documentation-improvement and

spark git commit: [SPARK-8771] [TRIVIAL] Add a version to the deprecated annotation for the actorSystem

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 646366b5d - d14338eaf [SPARK-8771] [TRIVIAL] Add a version to the deprecated annotation for the actorSystem Author: Holden Karau hol...@pigscanfly.ca Closes #7172 from

spark git commit: [SPARK-8781] Fix variables in published pom.xml are not resolved

2015-07-02 Thread andrewor14
the maven build in a different way. We limit the dependencies of `KinesisReceiverSuite` to avoid having to deal with the complexities in how maven deals with transitive test dependencies. Author: Andrew Or and...@databricks.com Closes #7193 from andrewor14/fix-kinesis-pom and squashes

spark git commit: [SPARK-8781] Fix variables in published pom.xml are not resolved

2015-07-02 Thread andrewor14
and fixes the maven build in a different way. We limit the dependencies of `KinesisReceiverSuite` to avoid having to deal with the complexities in how maven deals with transitive test dependencies. Author: Andrew Or and...@databricks.com Closes #7193 from andrewor14/fix-kinesis-pom and squashes

spark git commit: [SPARK-8781] Fix variables in published pom.xml are not resolved

2015-07-02 Thread andrewor14
and fixes the maven build in a different way. We limit the dependencies of `KinesisReceiverSuite` to avoid having to deal with the complexities in how maven deals with transitive test dependencies. Author: Andrew Or and...@databricks.com Closes #7193 from andrewor14/fix-kinesis-pom and squashes

spark git commit: [SPARK-1564] [DOCS] Added Javascript to Javadocs to create badges for tags like :: Experimental ::

2015-07-02 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 82cf3315e - fcbcba66c [SPARK-1564] [DOCS] Added Javascript to Javadocs to create badges for tags like :: Experimental :: Modified copy_api_dirs.rb and created api-javadocs.js and api-javadocs.css files in order to add badges to javadoc

spark git commit: [SPARK-7835] Refactor HeartbeatReceiverSuite for coverage + cleanup

2015-07-02 Thread andrewor14
constructs like `eventually`. Author: Andrew Or and...@databricks.com Closes #7173 from andrewor14/heartbeat-receiver-tests and squashes the following commits: 4a903d6 [Andrew Or] Increase HeartReceiverSuite coverage and clean up Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http

spark git commit: [SPARK-8581] [SPARK-8584] Simplify checkpointing code + better error message

2015-07-02 Thread andrewor14
of checkpoint directory to provide a clearer error message. Most of the changes here are relatively minor. Author: Andrew Or and...@databricks.com Closes #6968 from andrewor14/checkpoint-cleanup and squashes the following commits: 4ef8263 [Andrew Or] Use global synchronized instead 6f6fd84

spark git commit: [SPARK-8754] [YARN] YarnClientSchedulerBackend doesn't stop gracefully in failure conditions

2015-07-01 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 17def3957 - 5b468cf0c [SPARK-8754] [YARN] YarnClientSchedulerBackend doesn't stop gracefully in failure conditions In YarnClientSchedulerBackend.stop(), added a check for monitorThread. Author: Devaraj K deva...@apache.org Closes

spark git commit: [SPARK-8437] [DOCS] Corrected: Using directory path without wildcard for filename slow for large number of files with wholeTextFiles and binaryFiles

2015-06-30 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 eab1d16a7 - 255b2be94 [SPARK-8437] [DOCS] Corrected: Using directory path without wildcard for filename slow for large number of files with wholeTextFiles and binaryFiles Note that 'dir/*' can be more efficient in some Hadoop FS

spark git commit: [SPARK-8437] [DOCS] Corrected: Using directory path without wildcard for filename slow for large number of files with wholeTextFiles and binaryFiles

2015-06-30 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 689da28a5 - ada384b78 [SPARK-8437] [DOCS] Corrected: Using directory path without wildcard for filename slow for large number of files with wholeTextFiles and binaryFiles Note that 'dir/*' can be more efficient in some Hadoop FS

spark git commit: [SPARK-2645] [CORE] Allow SparkEnv.stop() to be called multiple times without side effects.

2015-06-30 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 79f0b371a - 7dda0844e [SPARK-2645] [CORE] Allow SparkEnv.stop() to be called multiple times without side effects. Fix for SparkContext stop behavior - Allow sc.stop() to be called multiple times without side effects. Author: Joshi

spark git commit: [SPARK-8705] [WEBUI] Don't display rects when totalExecutionTime is 0

2015-06-30 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 3ba23ffd3 - 8c898964f [SPARK-8705] [WEBUI] Don't display rects when totalExecutionTime is 0 Because `System.currentTimeMillis()` is not accurate for tasks that only need several milliseconds, sometimes `totalExecutionTime` in

spark git commit: [SPARK-8372] Do not show applications that haven't recorded their app ID yet.

2015-06-30 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 7dda0844e - 4bb8375fc [SPARK-8372] Do not show applications that haven't recorded their app ID yet. Showing these applications may lead to weird behavior in the History Server. For old logs, if the app ID is recorded later, you may end up

spark git commit: [SPARK-8560] [UI] The Executors page will have negative if having resubmitted tasks

2015-06-30 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 61d7b533d - 79f0b371a [SPARK-8560] [UI] The Executors page will have negative if having resubmitted tasks when the ```taskEnd.reason``` is ```Resubmitted```, it shouldn't do statistics. Because this tasks has a ```SUCCESS``` taskEnd

spark git commit: Revert [SPARK-8372] History server shows incorrect information for application not started

2015-06-29 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 715f084ca - ea88b1a50 Revert [SPARK-8372] History server shows incorrect information for application not started This reverts commit 2837e067099921dd4ab6639ac5f6e89f789d4ff4. Project: http://git-wip-us.apache.org/repos/asf/spark/repo

spark git commit: Revert [SPARK-8372] History server shows incorrect information for application not started

2015-06-29 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 9d9c4b476 - f7c200e6a Revert [SPARK-8372] History server shows incorrect information for application not started This reverts commit f0513733d4f6fc34f86feffd3062600cbbd56a28. Project:

spark git commit: [SPARK-8634] [STREAMING] [TESTS] Fix flaky test StreamingListenerSuite receiver info reporting

2015-06-29 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 881662e9c - cec98525f [SPARK-8634] [STREAMING] [TESTS] Fix flaky test StreamingListenerSuite receiver info reporting As per the unit test log in https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35754/ ``` 15/06/24

spark git commit: [SPARK-7287] [SPARK-8567] [TEST] Add sc.stop to applications in SparkSubmitSuite

2015-06-29 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master cec98525f - fbf75738f [SPARK-7287] [SPARK-8567] [TEST] Add sc.stop to applications in SparkSubmitSuite Hopefully, this suite will not be flaky anymore. Author: Yin Huai yh...@databricks.com Closes #7027 from yhuai/SPARK-8567 and squashes

spark git commit: [SPARK-8634] [STREAMING] [TESTS] Fix flaky test StreamingListenerSuite receiver info reporting

2015-06-29 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 6a45d86db - f84f24769 [SPARK-8634] [STREAMING] [TESTS] Fix flaky test StreamingListenerSuite receiver info reporting As per the unit test log in https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35754/ ``` 15/06/24

spark git commit: [SPARK-8019] [SPARKR] Support SparkR spawning worker R processes with a command other then Rscript

2015-06-29 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master d7f796da4 - 4a9e03fa8 [SPARK-8019] [SPARKR] Support SparkR spawning worker R processes with a command other then Rscript This is a simple change to add a new environment variable spark.sparkr.r.command that specifies the command that

spark git commit: [SPARK-8437] [DOCS] Using directory path without wildcard for filename slow for large number of files with wholeTextFiles and binaryFiles

2015-06-29 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 cdfa388dd - b2684557f [SPARK-8437] [DOCS] Using directory path without wildcard for filename slow for large number of files with wholeTextFiles and binaryFiles Note that 'dir/*' can be more efficient in some Hadoop FS implementations

spark git commit: [SPARK-8410] [SPARK-8475] remove previous ivy resolution when using spark-submit

2015-06-29 Thread andrewor14
will be prioritized. cc andrewor14 Author: Burak Yavuz brk...@gmail.com Closes #7089 from brkyvz/delete-prev-ivy-resolution and squashes the following commits: a21f95a [Burak Yavuz] remove previous ivy resolution when using spark-submit Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit

spark git commit: [SPARK-8410] [SPARK-8475] remove previous ivy resolution when using spark-submit

2015-06-29 Thread andrewor14
will be prioritized. cc andrewor14 Author: Burak Yavuz brk...@gmail.com Closes #7089 from brkyvz/delete-prev-ivy-resolution and squashes the following commits: a21f95a [Burak Yavuz] remove previous ivy resolution when using spark-submit (cherry picked from commit

spark git commit: Revert [SPARK-8437] [DOCS] Using directory path without wildcard for filename slow for large number of files with wholeTextFiles and binaryFiles

2015-06-29 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 4a9e03fa8 - 4c1808be4 Revert [SPARK-8437] [DOCS] Using directory path without wildcard for filename slow for large number of files with wholeTextFiles and binaryFiles This reverts commit 5d30eae56051c563a8427f330b09ef66db0a0d21.

spark git commit: Revert [SPARK-8437] [DOCS] Using directory path without wildcard for filename slow for large number of files with wholeTextFiles and binaryFiles

2015-06-29 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 c0fbd6781 - 80d53565a Revert [SPARK-8437] [DOCS] Using directory path without wildcard for filename slow for large number of files with wholeTextFiles and binaryFiles This reverts commit b2684557fa0d2ec14b7529324443c8154d81c348.

spark git commit: [SPARK-8437] [DOCS] Using directory path without wildcard for filename slow for large number of files with wholeTextFiles and binaryFiles

2015-06-29 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master fbf75738f - 5d30eae56 [SPARK-8437] [DOCS] Using directory path without wildcard for filename slow for large number of files with wholeTextFiles and binaryFiles Note that 'dir/*' can be more efficient in some Hadoop FS implementations that

spark git commit: [SPARK-7287] [SPARK-8567] [TEST] Add sc.stop to applications in SparkSubmitSuite

2015-06-29 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 f84f24769 - cdfa388dd [SPARK-7287] [SPARK-8567] [TEST] Add sc.stop to applications in SparkSubmitSuite Hopefully, this suite will not be flaky anymore. Author: Yin Huai yh...@databricks.com Closes #7027 from yhuai/SPARK-8567 and

spark git commit: [SPARK-8567] [SQL] Increase the timeout of HiveSparkSubmitSuite

2015-06-24 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 93793237e - 792ed7a4b [SPARK-8567] [SQL] Increase the timeout of HiveSparkSubmitSuite https://issues.apache.org/jira/browse/SPARK-8567 Author: Yin Huai yh...@databricks.com Closes #6957 from yhuai/SPARK-8567 and squashes the

spark git commit: [SPARK-8567] [SQL] Increase the timeout of HiveSparkSubmitSuite

2015-06-24 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master dca21a83a - 7daa70292 [SPARK-8567] [SQL] Increase the timeout of HiveSparkSubmitSuite https://issues.apache.org/jira/browse/SPARK-8567 Author: Yin Huai yh...@databricks.com Closes #6957 from yhuai/SPARK-8567 and squashes the following

spark git commit: [SPARK-8095][BACKPORT] Resolve dependencies of --packages in local ivy cache

2015-06-22 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.3 0b8dce0c0 - 45b4527e3 [SPARK-8095][BACKPORT] Resolve dependencies of --packages in local ivy cache Backported PR #6788 cc andrewor14 Author: Burak Yavuz brk...@gmail.com Closes #6923 from brkyvz/backport-local-ivy and squashes

spark git commit: [HOTFIX] [TESTS] Typo mqqt - mqtt

2015-06-22 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 96aa01378 - 1dfb0f7b2 [HOTFIX] [TESTS] Typo mqqt - mqtt This was introduced in #6866. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/1dfb0f7b Tree:

spark git commit: [HOTFIX] [SPARK-8489] Correct JIRA number in previous commit

2015-06-19 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 093c34838 - bec40e52b [HOTFIX] [SPARK-8489] Correct JIRA number in previous commit It should be SPARK-8489, not SPARK-8498. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

spark git commit: [HOTFIX] [SPARK-8489] Correct JIRA number in previous commit

2015-06-19 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 a7b773a8b - 9b16508d2 [HOTFIX] [SPARK-8489] Correct JIRA number in previous commit It should be SPARK-8489, not SPARK-8498. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

spark git commit: Add example that reads a local file, writes to a DFS path provided by th...

2015-06-19 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 0c32fc125 - a9858036b Add example that reads a local file, writes to a DFS path provided by th... ...e user, reads the file back from the DFS, and compares word counts on the local and DFS versions. Useful for verifying DFS correctness.

spark git commit: [SPARK-8451] [SPARK-7287] SparkSubmitSuite should check exit code

2015-06-19 Thread andrewor14
: Andrew Or and...@databricks.com Closes #6886 from andrewor14/spark-submit-suite-fix and squashes the following commits: 3f99ff1 [Andrew Or] Move destroy to finally block 9a62188 [Andrew Or] Re-enable ignored tests 2382672 [Andrew Or] Check for exit code (cherry picked from commit

spark git commit: [SPARK-8451] [SPARK-7287] SparkSubmitSuite should check exit code

2015-06-19 Thread andrewor14
: Andrew Or and...@databricks.com Closes #6886 from andrewor14/spark-submit-suite-fix and squashes the following commits: 3f99ff1 [Andrew Or] Move destroy to finally block 9a62188 [Andrew Or] Re-enable ignored tests 2382672 [Andrew Or] Check for exit code Project: http://git-wip-us.apache.org/repos

spark git commit: [SPARK-5836] [DOCS] [STREAMING] Clarify what may cause long-running Spark apps to preserve shuffle files

2015-06-19 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 68a2dca29 - 4be53d039 [SPARK-5836] [DOCS] [STREAMING] Clarify what may cause long-running Spark apps to preserve shuffle files Clarify what may cause long-running Spark apps to preserve shuffle files Author: Sean Owen so...@cloudera.com

spark git commit: [SPARK-8451] [SPARK-7287] SparkSubmitSuite should check exit code

2015-06-19 Thread andrewor14
: Andrew Or and...@databricks.com Closes #6886 from andrewor14/spark-submit-suite-fix and squashes the following commits: 3f99ff1 [Andrew Or] Move destroy to finally block 9a62188 [Andrew Or] Re-enable ignored tests 2382672 [Andrew Or] Check for exit code (cherry picked from commit

spark git commit: [SPARK-5836] [DOCS] [STREAMING] Clarify what may cause long-running Spark apps to preserve shuffle files

2015-06-19 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.4 aedd893b4 - 3415fb978 [SPARK-5836] [DOCS] [STREAMING] Clarify what may cause long-running Spark apps to preserve shuffle files Clarify what may cause long-running Spark apps to preserve shuffle files Author: Sean Owen

spark git commit: [SPARK-5836] [DOCS] [STREAMING] Clarify what may cause long-running Spark apps to preserve shuffle files

2015-06-19 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.3 1d441475d - 0b8dce0c0 [SPARK-5836] [DOCS] [STREAMING] Clarify what may cause long-running Spark apps to preserve shuffle files Clarify what may cause long-running Spark apps to preserve shuffle files Author: Sean Owen

<    2   3   4   5   6   7   8   9   10   11   >