spark git commit: [SPARK-11026] [YARN] spark.yarn.user.classpath.first does work for 'spark-submit --jars hdfs://user/foo.jar'

2015-10-13 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.5 2217f4f8b -> 47bc6c0fa [SPARK-11026] [YARN] spark.yarn.user.classpath.first does work for 'spark-submit --jars hdfs://user/foo.jar' when spark.yarn.user.classpath.first=true and using 'spark-submit --jars hdfs://user/foo.jar', it can

spark git commit: [SPARK-11026] [YARN] spark.yarn.user.classpath.first does work for 'spark-submit --jars hdfs://user/foo.jar'

2015-10-13 Thread tgraves
Repository: spark Updated Branches: refs/heads/master c4da5345a -> 626aab79c [SPARK-11026] [YARN] spark.yarn.user.classpath.first does work for 'spark-submit --jars hdfs://user/foo.jar' when spark.yarn.user.classpath.first=true and using 'spark-submit --jars hdfs://user/foo.jar', it can not

spark git commit: [SPARK-9439] [YARN] External shuffle service robust to NM restarts using leveldb

2015-08-21 Thread tgraves
Repository: spark Updated Branches: refs/heads/master bb220f657 - 708036c1d [SPARK-9439] [YARN] External shuffle service robust to NM restarts using leveldb https://issues.apache.org/jira/browse/SPARK-9439 In general, Yarn apps should be robust to NodeManager restarts. However, if you run

spark git commit: [SPARK-8405] [DOC] Add how to view logs on Web UI when yarn log aggregation is enabled

2015-07-27 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.4 2b1973dd2 - a671dad62 [SPARK-8405] [DOC] Add how to view logs on Web UI when yarn log aggregation is enabled Some users may not be aware that the logs are available on Web UI even if Yarn log aggregation is enabled. Update the doc to

spark git commit: [SPARK-8405] [DOC] Add how to view logs on Web UI when yarn log aggregation is enabled

2015-07-27 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 72981bc8f - 622838165 [SPARK-8405] [DOC] Add how to view logs on Web UI when yarn log aggregation is enabled Some users may not be aware that the logs are available on Web UI even if Yarn log aggregation is enabled. Update the doc to

spark git commit: [SPARK-8851] [YARN] In Client mode, make sure the client logs in and updates tokens

2015-07-17 Thread tgraves
Repository: spark Updated Branches: refs/heads/master ec8973d12 - c043a3e9d [SPARK-8851] [YARN] In Client mode, make sure the client logs in and updates tokens In client side, the flow is SparkSubmit - SparkContext - yarn/Client. Since the yarn client only gets a cloned config and the

spark git commit: [SPARK-8574] org/apache/spark/unsafe doesn't honor the java source/ta…

2015-06-25 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.4 74001db04 - 13802163d [SPARK-8574] org/apache/spark/unsafe doesn't honor the java source/ta… …rget versions. I basically copied the compatibility rules from the top level pom.xml into here. Someone more familiar with all the

spark git commit: [SPARK-7524] [SPARK-7846] add configs for keytab and principal, pass these two configs with different way in different modes

2015-05-29 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 8db40f671 - a51b133de [SPARK-7524] [SPARK-7846] add configs for keytab and principal, pass these two configs with different way in different modes * As spark now supports long running service by updating tokens for namenode, but only

spark git commit: [SPARK-6869] [PYSPARK] Add pyspark archives path to PYTHONPATH

2015-05-08 Thread tgraves
Repository: spark Updated Branches: refs/heads/master c2f0821aa - ebff7327a [SPARK-6869] [PYSPARK] Add pyspark archives path to PYTHONPATH Based on https://github.com/apache/spark/pull/5478 that provide a PYSPARK_ARCHIVES_PATH env. within this PR, we just should export

hadoop git commit: YARN-3600. AM container link is broken (Naganarasimha G R via tgraves (cherry picked from commit 5d708a4725529cf09d2dd8b5b4aa3542cc8610b0)

2015-05-08 Thread tgraves
Repository: hadoop Updated Branches: refs/heads/branch-2 28e0593b9 - 547b06988 YARN-3600. AM container link is broken (Naganarasimha G R via tgraves (cherry picked from commit 5d708a4725529cf09d2dd8b5b4aa3542cc8610b0) Project: http://git-wip-us.apache.org/repos/asf/hadoop/repo Commit: http

hadoop git commit: YARN-3600. AM container link is broken (Naganarasimha G R via tgraves

2015-05-08 Thread tgraves
Repository: hadoop Updated Branches: refs/heads/trunk bcf289050 - 5d708a472 YARN-3600. AM container link is broken (Naganarasimha G R via tgraves Project: http://git-wip-us.apache.org/repos/asf/hadoop/repo Commit: http://git-wip-us.apache.org/repos/asf/hadoop/commit/5d708a47 Tree: http://git

hadoop git commit: YARN-20. More information for yarn.resourcemanager.webapp.address in yarn-default.xml (Bartosz Ługowski vai tgraves) (cherry picked from commit f0f5e3c0751bcadcacd6d91e2c5504803ec3d

2015-05-08 Thread tgraves
Repository: hadoop Updated Branches: refs/heads/branch-2 68d85e92b - 638feaaa3 YARN-20. More information for yarn.resourcemanager.webapp.address in yarn-default.xml (Bartosz Ługowski vai tgraves) (cherry picked from commit f0f5e3c0751bcadcacd6d91e2c5504803ec3d0a5) Project: http://git-wip

hadoop git commit: YARN-20. More information for yarn.resourcemanager.webapp.address in yarn-default.xml (Bartosz Ługowski vai tgraves)

2015-05-08 Thread tgraves
Repository: hadoop Updated Branches: refs/heads/trunk a2d40bced - f0f5e3c07 YARN-20. More information for yarn.resourcemanager.webapp.address in yarn-default.xml (Bartosz Ługowski vai tgraves) Project: http://git-wip-us.apache.org/repos/asf/hadoop/repo Commit: http://git-wip-us.apache.org

spark git commit: [SPARK-5342] [YARN] Allow long running Spark apps to run on secure YARN/HDFS

2015-05-01 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 4dc8d7449 - b1f4ca82d [SPARK-5342] [YARN] Allow long running Spark apps to run on secure YARN/HDFS Take 2. Does the same thing as #4688, but fixes Hadoop-1 build. Author: Hari Shreedharan hshreedha...@apache.org Closes #5823 from

spark git commit: [SPARK-5342] [YARN] Allow long running Spark apps to run on secure YARN/HDFS

2015-04-30 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 7dacc08ab - 6c65da6bb [SPARK-5342] [YARN] Allow long running Spark apps to run on secure YARN/HDFS Current Spark apps running on Secure YARN/HDFS would not be able to write data to HDFS after 7 days, since delegation tokens cannot be

hadoop git commit: YARN-3517. RM web ui for dumping scheduler logs should be for admins only (Varun Vasudev via tgraves)

2015-04-29 Thread tgraves
Repository: hadoop Updated Branches: refs/heads/trunk 3dd6395bb - 2e215484b YARN-3517. RM web ui for dumping scheduler logs should be for admins only (Varun Vasudev via tgraves) Project: http://git-wip-us.apache.org/repos/asf/hadoop/repo Commit: http://git-wip-us.apache.org/repos/asf/hadoop

hadoop git commit: YARN-3517. RM web ui for dumping scheduler logs should be for admins only (Varun Vasudev via tgraves) (cherry picked from commit 2e215484bd05cd5e3b7a81d3558c6879a05dd2d2)

2015-04-29 Thread tgraves
Repository: hadoop Updated Branches: refs/heads/branch-2 460127e6f - 2e13183f6 YARN-3517. RM web ui for dumping scheduler logs should be for admins only (Varun Vasudev via tgraves) (cherry picked from commit 2e215484bd05cd5e3b7a81d3558c6879a05dd2d2) Project: http://git-wip-us.apache.org

spark git commit: [SPARK-6918] [YARN] Secure HBase support.

2015-04-29 Thread tgraves
Repository: spark Updated Branches: refs/heads/master f49284b5b - baed3f2c7 [SPARK-6918] [YARN] Secure HBase support. Obtain HBase security token with Kerberos credentials locally to be sent to executors. Tested on eBay's secure HBase cluster. Similar to obtainTokenForNamenodes and fails

hadoop git commit: YARN-3434. Interaction between reservations and userlimit can result in significant ULF violation (cherry picked from commit 189a63a719c63b67a1783a280bfc2f72dcb55277)

2015-04-23 Thread tgraves
Parents: 889b92f Author: tgraves tgra...@apache.org Authored: Thu Apr 23 14:39:25 2015 + Committer: tgraves tgra...@apache.org Committed: Thu Apr 23 14:49:24 2015 + -- hadoop-yarn-project/CHANGES.txt | 3

hadoop git commit: YARN-3434. Interaction between reservations and userlimit can result in significant ULF violation

2015-04-23 Thread tgraves
/commit/189a63a7 Tree: http://git-wip-us.apache.org/repos/asf/hadoop/tree/189a63a7 Diff: http://git-wip-us.apache.org/repos/asf/hadoop/diff/189a63a7 Branch: refs/heads/trunk Commit: 189a63a719c63b67a1783a280bfc2f72dcb55277 Parents: baf8bc6 Author: tgraves tgra...@apache.org Authored: Thu Apr 23 14:39

spark git commit: [SPARK-2669] [yarn] Distribute client configuration to AM.

2015-04-17 Thread tgraves
Repository: spark Updated Branches: refs/heads/master c84d91692 - 50ab8a654 [SPARK-2669] [yarn] Distribute client configuration to AM. Currently, when Spark launches the Yarn AM, the process will use the local Hadoop configuration on the node where the AM launches, if one is present. A more

spark git commit: [SPARK-6207] [YARN] [SQL] Adds delegation tokens for metastore to conf.

2015-04-13 Thread tgraves
Repository: spark Updated Branches: refs/heads/master b29663eee - 77620be76 [SPARK-6207] [YARN] [SQL] Adds delegation tokens for metastore to conf. Adds hive2-metastore delegation token to conf when running in secure mode. Without this change, running on YARN in cluster mode fails with a GSS

spark git commit: [SPARK-3591][YARN]fire and forget for YARN cluster mode

2015-04-07 Thread tgraves
Repository: spark Updated Branches: refs/heads/master ae980eb41 - b65bad65c [SPARK-3591][YARN]fire and forget for YARN cluster mode https://issues.apache.org/jira/browse/SPARK-3591 The output after this patch: doggie153:/opt/oss/spark-1.3.0-bin-hadoop2.4/bin # ./spark-submit --class

spark git commit: [SPARK-6050] [yarn] Relax matching of vcore count in received containers.

2015-03-02 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.3 a83b9bbb2 - 650d1e7fb [SPARK-6050] [yarn] Relax matching of vcore count in received containers. Some YARN configurations return a vcore count for allocated containers that does not match the requested resource. That means Spark would

spark git commit: [SPARK-6050] [yarn] Relax matching of vcore count in received containers.

2015-03-02 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 582e5a24c - 6b348d90f [SPARK-6050] [yarn] Relax matching of vcore count in received containers. Some YARN configurations return a vcore count for allocated containers that does not match the requested resource. That means Spark would

spark git commit: SPARK-5393. Flood of util.RackResolver log messages after SPARK-1714

2015-01-30 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 6f21dce5f - 254eaa4d3 SPARK-5393. Flood of util.RackResolver log messages after SPARK-1714 Previously I had tried to solve this with by adding a line in Spark's log4j-defaults.properties. The issue with the message in

spark git commit: SPARK-5370. [YARN] Remove some unnecessary synchronization in YarnAlloca...

2015-01-22 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 246111d17 - 820ce0359 SPARK-5370. [YARN] Remove some unnecessary synchronization in YarnAlloca... ...tor Author: Sandy Ryza sa...@cloudera.com Closes #4164 from sryza/sandy-spark-5370 and squashes the following commits: 0c8d736 [Sandy

spark git commit: SPARK-1714. Take advantage of AMRMClient APIs to simplify logic in YarnA...

2015-01-21 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 8c06a5faa - 2eeada373 SPARK-1714. Take advantage of AMRMClient APIs to simplify logic in YarnA... ...llocator The goal of this PR is to simplify YarnAllocator as much as possible and get it up to the level of code quality we see in the

spark git commit: [SPARK-5336][YARN]spark.executor.cores must not be less than spark.task.cpus

2015-01-21 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 424d8c6ff - 8c06a5faa [SPARK-5336][YARN]spark.executor.cores must not be less than spark.task.cpus https://issues.apache.org/jira/browse/SPARK-5336 Author: WangTao barneystin...@aliyun.com Author: WangTaoTheTonic barneystin...@aliyun.com

spark git commit: [SPARK-5169][YARN]fetch the correct max attempts

2015-01-09 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 167a5ab0b - f3da4bd72 [SPARK-5169][YARN]fetch the correct max attempts Soryy for fetching the wrong max attempts in this commit https://github.com/apache/spark/commit/8fdd48959c93b9cf809f03549e2ae6c4687d1fcd. We need to fix it now.

spark git commit: [SPARK-2165][YARN]add support for setting maxAppAttempts in the ApplicationSubmissionContext

2015-01-07 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 5fde66163 - 8fdd48959 [SPARK-2165][YARN]add support for setting maxAppAttempts in the ApplicationSubmissionContext ...xt https://issues.apache.org/jira/browse/SPARK-2165 I still have 2 questions: * If this config is not set, we should

spark git commit: [YARN][SPARK-4929] Bug fix: fix the yarn-client code to support HA

2015-01-07 Thread tgraves
Repository: spark Updated Branches: refs/heads/master e21acc197 - 5fde66163 [YARN][SPARK-4929] Bug fix: fix the yarn-client code to support HA Nowadays, yarn-client will exit directly when the HA change happens no matter how many times the am should retry. The reason may be that the default

spark git commit: [YARN][SPARK-4929] Bug fix: fix the yarn-client code to support HA

2015-01-07 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.2 db83acb1f - 7a4be0b45 [YARN][SPARK-4929] Bug fix: fix the yarn-client code to support HA Nowadays, yarn-client will exit directly when the HA change happens no matter how many times the am should retry. The reason may be that the

spark git commit: [SPARK-4966][YARN]The MemoryOverhead value is setted not correctly

2014-12-29 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.2 23d64cf08 - 2cd446a90 [SPARK-4966][YARN]The MemoryOverhead value is setted not correctly Author: meiyoula 1039320...@qq.com Closes #3797 from XuTingjun/MemoryOverhead and squashes the following commits: 5a780fc [meiyoula] Update

spark git commit: [SPARK-4344][DOCS] adding documentation on spark.yarn.user.classpath.first

2014-11-25 Thread tgraves
Repository: spark Updated Branches: refs/heads/master fef27b294 - d24076019 [SPARK-4344][DOCS] adding documentation on spark.yarn.user.classpath.first The documentation for the two parameters is the same with a pointer from the standalone parameter to the yarn parameter Author: arahuja

spark git commit: [SPARK-4344][DOCS] adding documentation on spark.yarn.user.classpath.first

2014-11-25 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.2 b026546e3 - a689ab98d [SPARK-4344][DOCS] adding documentation on spark.yarn.user.classpath.first The documentation for the two parameters is the same with a pointer from the standalone parameter to the yarn parameter Author: arahuja

spark git commit: SPARK-4457. Document how to build for Hadoop versions greater than 2.4

2014-11-24 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 9b2a3c612 - 29372b631 SPARK-4457. Document how to build for Hadoop versions greater than 2.4 Author: Sandy Ryza sa...@cloudera.com Closes #3322 from sryza/sandy-spark-4457 and squashes the following commits: 5e72b77 [Sandy Ryza] Feedback

spark git commit: [SPARK-3722][Docs]minor improvement and fix in docs

2014-11-14 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 825709a0b - e421072da [SPARK-3722][Docs]minor improvement and fix in docs https://issues.apache.org/jira/browse/SPARK-3722 Author: WangTao barneystin...@aliyun.com Closes #2579 from WangTaoTheTonic/docsWork and squashes the following

spark git commit: SPARK-4305 [BUILD] yarn-alpha profile won't build due to network/yarn module

2014-11-11 Thread tgraves
Repository: spark Updated Branches: refs/heads/master deefd9d73 - f820b563d SPARK-4305 [BUILD] yarn-alpha profile won't build due to network/yarn module SPARK-3797 introduced the `network/yarn` module, but its YARN code depends on YARN APIs not present in older versions covered by the

spark git commit: SPARK-4305 [BUILD] yarn-alpha profile won't build due to network/yarn module

2014-11-11 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.2 cc1f3a0d6 - 8f7e80f30 SPARK-4305 [BUILD] yarn-alpha profile won't build due to network/yarn module SPARK-3797 introduced the `network/yarn` module, but its YARN code depends on YARN APIs not present in older versions covered by the

spark git commit: [SPARK-4282][YARN] Stopping flag in YarnClientSchedulerBackend should be volatile

2014-11-11 Thread tgraves
Repository: spark Updated Branches: refs/heads/master f820b563d - 7f3718842 [SPARK-4282][YARN] Stopping flag in YarnClientSchedulerBackend should be volatile In YarnClientSchedulerBackend, a variable stopping is used as a flag and it's accessed by some threads so it should be volatile.

git commit: SPARK-3837. Warn when YARN kills containers for exceeding memory limits

2014-10-31 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 58a6077e5 - acd4ac7c9 SPARK-3837. Warn when YARN kills containers for exceeding memory limits I triggered the issue and verified the message gets printed on a pseudo-distributed cluster. Author: Sandy Ryza sa...@cloudera.com Closes

git commit: [SPARK-4116][YARN]Delete the abandoned log4j-spark-container.properties

2014-10-28 Thread tgraves
Repository: spark Updated Branches: refs/heads/master fae095bc7 - 47346cd02 [SPARK-4116][YARN]Delete the abandoned log4j-spark-container.properties Since its name reduced at https://github.com/apache/spark/pull/560, the log4j-spark-container.properties was never used again. And I have

git commit: [SPARK-4098][YARN]use appUIAddress instead of appUIHostPort in yarn-client mode

2014-10-28 Thread tgraves
Repository: spark Updated Branches: refs/heads/master e8813be65 - 0ac52e305 [SPARK-4098][YARN]use appUIAddress instead of appUIHostPort in yarn-client mode https://issues.apache.org/jira/browse/SPARK-4098 Author: WangTaoTheTonic barneystin...@aliyun.com Closes #2958 from

git commit: [SPARK-3788] [yarn] Fix compareFs to do the right thing for HDFS namespaces.

2014-10-08 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 35afdfd62 - 7fca8f41c [SPARK-3788] [yarn] Fix compareFs to do the right thing for HDFS namespaces. HA and viewfs use namespaces instead of host names, so you can't resolve them since that will fail. So be smarter to avoid doing unnecessary

git commit: [SPARK-3788] [yarn] Fix compareFs to do the right thing for HDFS namespaces (1.1 version).

2014-10-08 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.1 a1f833f75 - a44af7302 [SPARK-3788] [yarn] Fix compareFs to do the right thing for HDFS namespaces (1.1 version). HA and viewfs use namespaces instead of host names, so you can't resolve them since that will fail. So be smarter to

git commit: [SPARK-3848] yarn alpha doesn't build on master

2014-10-08 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 7fca8f41c - f18dd5962 [SPARK-3848] yarn alpha doesn't build on master yarn alpha build was broken by #2432 as it added an argument to YarnAllocator but not to yarn/alpha YarnAllocationHandler commit

git commit: Modify default YARN memory_overhead-- from an additive constant to a multiplier

2014-10-02 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 82a6a083a - b4fb7b80a Modify default YARN memory_overhead-- from an additive constant to a multiplier Redone against the recent master branch (https://github.com/apache/spark/pull/1391) Author: Nishkam Ravi nr...@cloudera.com Author:

git commit: [YARN] SPARK-2668: Add variable of yarn log directory for reference from the log4j configuration

2014-09-23 Thread tgraves
Repository: spark Updated Branches: refs/heads/master f9d6220c7 - 14f8c3404 [YARN] SPARK-2668: Add variable of yarn log directory for reference from the log4j configuration Assign value of yarn container log directory to java opts spark.yarn.app.container.log.dir, So user defined

[1/2] [SPARK-3477] Clean up code in Yarn Client / ClientBase

2014-09-23 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 14f8c3404 - c4022dd52 http://git-wip-us.apache.org/repos/asf/spark/blob/c4022dd5/yarn/stable/src/main/scala/org/apache/spark/deploy/yarn/Client.scala -- diff --git

[2/2] git commit: [SPARK-3477] Clean up code in Yarn Client / ClientBase

2014-09-23 Thread tgraves
[SPARK-3477] Clean up code in Yarn Client / ClientBase This is part of a broader effort to clean up the Yarn integration code after #2020. The high-level changes in this PR include: - Removing duplicate code, especially across the alpha and stable APIs - Simplify unnecessarily complex method

git commit: [SPARK-3304] [YARN] ApplicationMaster's Finish status is wrong when uncaught exception is thrown from ReporterThread

2014-09-23 Thread tgraves
Repository: spark Updated Branches: refs/heads/master c4022dd52 - 11c10df82 [SPARK-3304] [YARN] ApplicationMaster's Finish status is wrong when uncaught exception is thrown from ReporterThread Author: Kousuke Saruta saru...@oss.nttdata.co.jp Closes #2198 from sarutak/SPARK-3304 and squashes

git commit: SPARK-3177 (on Master Branch)

2014-09-17 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 983609a4d - 7d1a37239 SPARK-3177 (on Master Branch) The JIRA and PR was original created for branch-1.1, and move to master branch now. Chester The Issue is due to that yarn-alpha and yarn have different APIs for certain class fields.

git commit: [SPARK-3410] The priority of shutdownhook for ApplicationMaster should not be integer literal

2014-09-15 Thread tgraves
Repository: spark Updated Branches: refs/heads/master f493f7982 - cc1464446 [SPARK-3410] The priority of shutdownhook for ApplicationMaster should not be integer literal I think, it need to keep the priority of shutdown hook for ApplicationMaster than the priority of shutdown hook for

git commit: SPARK-3014. Log a more informative messages in a couple failure scenario...

2014-09-12 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 15a564598 - 1d767967e SPARK-3014. Log a more informative messages in a couple failure scenario... ...s Author: Sandy Ryza sa...@cloudera.com Closes #1934 from sryza/sandy-spark-3014 and squashes the following commits: ae19cc1 [Sandy

git commit: [SPARK-2140] Updating heap memory calculation for YARN stable and alpha.

2014-09-11 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.1 e51ce9a55 - 06fb2d057 [SPARK-2140] Updating heap memory calculation for YARN stable and alpha. Updated pull request, reflecting YARN stable and alpha states. I am getting intermittent test failures on my own test infrastructure. Is

git commit: [SPARK-3286] - Cannot view ApplicationMaster UI when Yarn’s url scheme i...

2014-09-10 Thread tgraves
Repository: spark Updated Branches: refs/heads/master b734ed0c2 - 6f7a76838 [SPARK-3286] - Cannot view ApplicationMaster UI when Yarn’s url scheme i... ...s https Author: Benoy Antony be...@apache.org Closes #2276 from benoyantony/SPARK-3286 and squashes the following commits: c3d51ee

git commit: SPARK-1713. Use a thread pool for launching executors.

2014-09-10 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 26503fdf2 - 1f4a648d4 SPARK-1713. Use a thread pool for launching executors. This patch copies the approach used in the MapReduce application master for launching containers. Author: Sandy Ryza sa...@cloudera.com Closes #663 from

git commit: [SPARK-3260] yarn - pass acls along with executor launch

2014-09-05 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 6a37ed838 - 51b53a758 [SPARK-3260] yarn - pass acls along with executor launch Pass along the acl settings when we launch a container so that they can be applied to viewing the logs on a running NodeManager. Author: Thomas Graves

git commit: [SPARK-3375] spark on yarn container allocation issues

2014-09-05 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 51b53a758 - 62c557609 [SPARK-3375] spark on yarn container allocation issues If yarn doesn't get the containers immediately it stops asking for them and the yarn application hangs with never getting any executors. The issue here is that

git commit: [SPARK-3347] [yarn] Fix yarn-alpha compilation.

2014-09-02 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 8f1f9aaf4 - 066f31a6b [SPARK-3347] [yarn] Fix yarn-alpha compilation. Missing import. Oops. Author: Marcelo Vanzin van...@cloudera.com Closes #2236 from vanzin/SPARK-3347 and squashes the following commits: 594fc39 [Marcelo Vanzin]

[1/2] [SPARK-2933] [yarn] Refactor and cleanup Yarn AM code.

2014-08-27 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 6f671d04f - b92d823ad http://git-wip-us.apache.org/repos/asf/spark/blob/b92d823a/yarn/common/src/main/scala/org/apache/spark/scheduler/cluster/YarnClientClusterScheduler.scala

[2/2] git commit: [SPARK-2933] [yarn] Refactor and cleanup Yarn AM code.

2014-08-27 Thread tgraves
[SPARK-2933] [yarn] Refactor and cleanup Yarn AM code. This change modifies the Yarn module so that all the logic related to running the ApplicationMaster is localized. Instead of, previously, 4 different classes with mostly identical code, now we have: - A single, shared ApplicationMaster

git commit: [SPARK-3072] YARN - Exit when reach max number failed executors

2014-08-19 Thread tgraves
Repository: spark Updated Branches: refs/heads/master cd0720ca7 - 7eb9cbc27 [SPARK-3072] YARN - Exit when reach max number failed executors In some cases on hadoop 2.x the spark application master doesn't properly exit and hangs around for 10 minutes after its really done. We should make

git commit: SPARK-1528 - spark on yarn, add support for accessing remote HDFS

2014-08-05 Thread tgraves
Repository: spark Updated Branches: refs/heads/master e87075df9 - 2c0f705e2 SPARK-1528 - spark on yarn, add support for accessing remote HDFS Add a config (spark.yarn.access.namenodes) to allow applications running on yarn to access other secure HDFS cluster. User just specifies the

git commit: SPARK-1528 - spark on yarn, add support for accessing remote HDFS

2014-08-05 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.1 b92a45058 - 6c0c65fc8 SPARK-1528 - spark on yarn, add support for accessing remote HDFS Add a config (spark.yarn.access.namenodes) to allow applications running on yarn to access other secure HDFS cluster. User just specifies the

git commit: SPARK-1890 and SPARK-1891- add admin and modify acls

2014-08-05 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 2c0f705e2 - 1ca23 SPARK-1890 and SPARK-1891- add admin and modify acls It was easier to combine these 2 jira since they touch many of the same places. This pr adds the following: - adds modify acls - adds admin acls (list of

git commit: SPARK-1890 and SPARK-1891- add admin and modify acls

2014-08-05 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.1 6c0c65fc8 - e3fe6571d SPARK-1890 and SPARK-1891- add admin and modify acls It was easier to combine these 2 jira since they touch many of the same places. This pr adds the following: - adds modify acls - adds admin acls (list of

git commit: SPARK-1680: use configs for specifying environment variables on YARN

2014-08-05 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 74f82c71b - 41e0a21b2 SPARK-1680: use configs for specifying environment variables on YARN Note that this also documents spark.executorEnv.* which to me means its public. If we don't want that please speak up. Author: Thomas Graves

git commit: SPARK-1680: use configs for specifying environment variables on YARN

2014-08-05 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.1 46b698307 - 7b798e10e SPARK-1680: use configs for specifying environment variables on YARN Note that this also documents spark.executorEnv.* which to me means its public. If we don't want that please speak up. Author: Thomas Graves

git commit: SPARK-2150: Provide direct link to finished application UI in yarn resou...

2014-07-24 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 42dfab7d3 - 46e224aaa SPARK-2150: Provide direct link to finished application UI in yarn resou... ...rce manager UI Use the event logger directory to provide a direct link to finished application UI in yarn resourcemanager UI. Author:

git commit: [SPARK-2037]: yarn client mode doesn't support spark.yarn.max.executor.failures

2014-07-24 Thread tgraves
Repository: spark Updated Branches: refs/heads/master c960b5051 - 323a83c52 [SPARK-2037]: yarn client mode doesn't support spark.yarn.max.executor.failures Author: GuoQiang Li wi...@qq.com Closes #1180 from witgo/SPARK-2037 and squashes the following commits: 3d52411 [GuoQiang Li] review

git commit: [YARN][SPARK-2606]:In some cases, the spark UI pages display incorrect

2014-07-22 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 5f7b99168 - ddadf1b00 [YARN][SPARK-2606]:In some cases,the spark UI pages display incorrect The issue is caused by #1112 . Author: GuoQiang Li wi...@qq.com Closes #1501 from witgo/webui_style and squashes the following commits: 4b34998

git commit: SPARK-1707. Remove unnecessary 3 second sleep in YarnClusterScheduler

2014-07-21 Thread tgraves
Repository: spark Updated Branches: refs/heads/master cd273a238 - f89cf65d7 SPARK-1707. Remove unnecessary 3 second sleep in YarnClusterScheduler Author: Sandy Ryza sa...@cloudera.com Closes #634 from sryza/sandy-spark-1707 and squashes the following commits: 2f6e358 [Sandy Ryza] Default

git commit: SPARK-1291: Link the spark UI to RM ui in yarn-client mode

2014-07-15 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 9dd635eb5 - 72ea56da8 SPARK-1291: Link the spark UI to RM ui in yarn-client mode Author: witgo wi...@qq.com Closes #1112 from witgo/SPARK-1291 and squashes the following commits: 6022bcd [witgo] review commit 1fbb925 [witgo] add

git commit: [SPARK-1946] Submit tasks after (configured ratio) executors have been registered

2014-07-14 Thread tgraves
Repository: spark Updated Branches: refs/heads/master d60b09bb6 - 3dd8af7a6 [SPARK-1946] Submit tasks after (configured ratio) executors have been registered Because submitting tasks and registering executors are asynchronous, in most situation, early stages' tasks run without preferred

git commit: SPARK-2400 : fix spark.yarn.max.executor.failures explaination

2014-07-08 Thread tgraves
Repository: spark Updated Branches: refs/heads/master c8a2313cd - b520b6453 SPARK-2400 : fix spark.yarn.max.executor.failures explaination According to ```scala private val maxNumExecutorFailures = sparkConf.getInt(spark.yarn.max.executor.failures,

svn commit: r1606091 - in /hadoop/common/tags: release-0.23.11-rc0/ release-0.23.11/

2014-06-27 Thread tgraves
Author: tgraves Date: Fri Jun 27 13:25:29 2014 New Revision: 1606091 URL: http://svn.apache.org/r1606091 Log: Hadoop 0.23.11 release. Added: hadoop/common/tags/release-0.23.11/ (props changed) - copied from r1606090, hadoop/common/tags/release-0.23.11-rc0/ Removed: hadoop/common

git commit: Remove use of spark.worker.instances

2014-06-26 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.0 47f8829e0 - 2d3080855 Remove use of spark.worker.instances spark.worker.instances was added as part of this commit: https://github.com/apache/spark/commit/1617816090e7b20124a512a43860a21232ebf511 My understanding is that

git commit: [SPARK-2051]In yarn.ClientBase spark.yarn.dist.* do not work

2014-06-19 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 67fca189c - bce0897bc [SPARK-2051]In yarn.ClientBase spark.yarn.dist.* do not work Author: witgo wi...@qq.com Closes #969 from witgo/yarn_ClientBase and squashes the following commits: 8117765 [witgo] review commit 3bdbc52 [witgo] Merge

svn commit: r1603642 - in /hadoop/common/branches/branch-0.23/hadoop-hdfs-project: hadoop-hdfs-httpfs/pom.xml hadoop-hdfs/CHANGES.txt hadoop-hdfs/pom.xml hadoop-hdfs/src/test/aop/build/aop.xml pom.xml

2014-06-18 Thread tgraves
Author: tgraves Date: Wed Jun 18 21:47:05 2014 New Revision: 1603642 URL: http://svn.apache.org/r1603642 Log: Preparing for 0.23.12 development Modified: hadoop/common/branches/branch-0.23/hadoop-hdfs-project/hadoop-hdfs-httpfs/pom.xml hadoop/common/branches/branch-0.23/hadoop-hdfs

svn commit: r1603694 - in /hadoop/common/branches/branch-0.23.11/hadoop-hdfs-project: hadoop-hdfs-httpfs/pom.xml hadoop-hdfs/CHANGES.txt hadoop-hdfs/pom.xml hadoop-hdfs/src/test/aop/build/aop.xml pom.

2014-06-18 Thread tgraves
Author: tgraves Date: Thu Jun 19 01:17:43 2014 New Revision: 1603694 URL: http://svn.apache.org/r1603694 Log: Preparing for release 0.23.11 Modified: hadoop/common/branches/branch-0.23.11/hadoop-hdfs-project/hadoop-hdfs-httpfs/pom.xml hadoop/common/branches/branch-0.23.11/hadoop-hdfs

svn commit: r1603694 - in /hadoop/common/branches/branch-0.23.11/hadoop-mapreduce-project: ./ hadoop-mapreduce-client/ hadoop-mapreduce-client/hadoop-mapreduce-client-app/ hadoop-mapreduce-client/hado

2014-06-18 Thread tgraves
Author: tgraves Date: Thu Jun 19 01:17:43 2014 New Revision: 1603694 URL: http://svn.apache.org/r1603694 Log: Preparing for release 0.23.11 Modified: hadoop/common/branches/branch-0.23.11/hadoop-mapreduce-project/CHANGES.txt hadoop/common/branches/branch-0.23.11/hadoop-mapreduce-project

svn commit: r1603694 - in /hadoop/common/branches/branch-0.23.11/hadoop-yarn-project: ./ hadoop-yarn/ hadoop-yarn/hadoop-yarn-api/ hadoop-yarn/hadoop-yarn-applications/ hadoop-yarn/hadoop-yarn-applica

2014-06-18 Thread tgraves
Author: tgraves Date: Thu Jun 19 01:17:43 2014 New Revision: 1603694 URL: http://svn.apache.org/r1603694 Log: Preparing for release 0.23.11 Modified: hadoop/common/branches/branch-0.23.11/hadoop-yarn-project/CHANGES.txt hadoop/common/branches/branch-0.23.11/hadoop-yarn-project/hadoop

svn commit: r1603641 - /hadoop/common/branches/branch-0.23.11/

2014-06-18 Thread tgraves
Author: tgraves Date: Wed Jun 18 21:43:10 2014 New Revision: 1603641 URL: http://svn.apache.org/r1603641 Log: Branching for 0.23.11 releases Added: hadoop/common/branches/branch-0.23.11/ (props changed) - copied from r1603640, hadoop/common/branches/branch-0.23/ Propchange: hadoop

svn commit: r1603642 - in /hadoop/common/branches/branch-0.23: ./ hadoop-assemblies/ hadoop-client/ hadoop-dist/ hadoop-minicluster/ hadoop-project-dist/ hadoop-project/ hadoop-tools/ hadoop-tools/had

2014-06-18 Thread tgraves
Author: tgraves Date: Wed Jun 18 21:47:05 2014 New Revision: 1603642 URL: http://svn.apache.org/r1603642 Log: Preparing for 0.23.12 development Modified: hadoop/common/branches/branch-0.23/hadoop-assemblies/pom.xml hadoop/common/branches/branch-0.23/hadoop-client/pom.xml hadoop

svn commit: r1603642 - in /hadoop/common/branches/branch-0.23/hadoop-common-project: hadoop-annotations/pom.xml hadoop-auth-examples/pom.xml hadoop-auth/pom.xml hadoop-common/CHANGES.txt hadoop-common

2014-06-18 Thread tgraves
Author: tgraves Date: Wed Jun 18 21:47:05 2014 New Revision: 1603642 URL: http://svn.apache.org/r1603642 Log: Preparing for 0.23.12 development Modified: hadoop/common/branches/branch-0.23/hadoop-common-project/hadoop-annotations/pom.xml hadoop/common/branches/branch-0.23/hadoop-common

svn commit: r1603694 - in /hadoop/common/branches/branch-0.23.11: ./ hadoop-assemblies/ hadoop-client/ hadoop-dist/ hadoop-minicluster/ hadoop-project-dist/ hadoop-project/ hadoop-tools/ hadoop-tools/

2014-06-18 Thread tgraves
Author: tgraves Date: Thu Jun 19 01:17:43 2014 New Revision: 1603694 URL: http://svn.apache.org/r1603694 Log: Preparing for release 0.23.11 Modified: hadoop/common/branches/branch-0.23.11/hadoop-assemblies/pom.xml hadoop/common/branches/branch-0.23.11/hadoop-client/pom.xml hadoop

svn commit: r1603694 [3/3] - in /hadoop/common/branches/branch-0.23.11/hadoop-common-project: ./ hadoop-annotations/ hadoop-auth-examples/ hadoop-auth/ hadoop-common/ hadoop-common/src/main/docs/

2014-06-18 Thread tgraves
Modified: hadoop/common/branches/branch-0.23.11/hadoop-common-project/pom.xml URL: http://svn.apache.org/viewvc/hadoop/common/branches/branch-0.23.11/hadoop-common-project/pom.xml?rev=1603694r1=1603693r2=1603694view=diff

svn commit: r1603694 [1/3] - in /hadoop/common/branches/branch-0.23.11/hadoop-common-project: ./ hadoop-annotations/ hadoop-auth-examples/ hadoop-auth/ hadoop-common/ hadoop-common/src/main/docs/

2014-06-18 Thread tgraves
Author: tgraves Date: Thu Jun 19 01:17:43 2014 New Revision: 1603694 URL: http://svn.apache.org/r1603694 Log: Preparing for release 0.23.11 Modified: hadoop/common/branches/branch-0.23.11/hadoop-common-project/hadoop-annotations/pom.xml hadoop/common/branches/branch-0.23.11/hadoop

svn commit: r1603696 - /hadoop/common/tags/release-0.23.11-rc0/

2014-06-18 Thread tgraves
Author: tgraves Date: Thu Jun 19 01:32:25 2014 New Revision: 1603696 URL: http://svn.apache.org/r1603696 Log: Hadoop 0.23.11-rc0 release. Added: hadoop/common/tags/release-0.23.11-rc0/ (props changed) - copied from r1603695, hadoop/common/branches/branch-0.23.11/ Propchange: hadoop

git commit: [SPARK-2080] Yarn: report HS URL in client mode, correct user in cluster mode.

2014-06-12 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 83c226d45 - ecde5b837 [SPARK-2080] Yarn: report HS URL in client mode, correct user in cluster mode. Yarn client mode was not setting the app's tracking URL to the History Server's URL when configured by the user. Now client mode behaves

git commit: SPARK-1639. Tidy up some Spark on YARN code

2014-06-11 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 6e1193031 - 2a4225dd9 SPARK-1639. Tidy up some Spark on YARN code This contains a bunch of small tidyings of the Spark on YARN code. I focused on the yarn stable code. @tgravescs, let me know if you'd like me to make these for the alpha

git commit: [SPARK-1978] In some cases, spark-yarn does not automatically restart the failed container

2014-06-10 Thread tgraves
Repository: spark Updated Branches: refs/heads/master a9a461c59 - 884ca718b [SPARK-1978] In some cases, spark-yarn does not automatically restart the failed container Author: witgo wi...@qq.com Closes #921 from witgo/allocateExecutors and squashes the following commits: bc3aa66 [witgo]

git commit: [SPARK-1522] : YARN ClientBase throws a NPE if there is no YARN Application CP

2014-06-09 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 6cf335d79 - e27344768 [SPARK-1522] : YARN ClientBase throws a NPE if there is no YARN Application CP The current implementation of ClientBase.getDefaultYarnApplicationClasspath inspects the MRJobConfig class for the field

git commit: SPARK-1557 Set permissions on event log files/directories

2014-04-29 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 9a1184a8a - 8db0f7e28 SPARK-1557 Set permissions on event log files/directories This adds minimal setting of event log directory/files permissions. To have a secure environment the user must manually create the top level event log

svn commit: r1588283 - in /hadoop/common/branches/branch-2/hadoop-yarn-project: CHANGES.txt hadoop-yarn/hadoop-yarn-common/src/main/java/org/apache/hadoop/yarn/util/Apps.java

2014-04-17 Thread tgraves
Author: tgraves Date: Thu Apr 17 15:25:27 2014 New Revision: 1588283 URL: http://svn.apache.org/r1588283 Log: Merge 1588281 to branch-2. YARN-1931. Private API change in YARN-1824 in 2.4 broke compatibility with previous releases (Sandy Ryza via tgraves) Modified: hadoop/common/branches

svn commit: r1588281 - in /hadoop/common/trunk/hadoop-yarn-project: CHANGES.txt hadoop-yarn/hadoop-yarn-common/src/main/java/org/apache/hadoop/yarn/util/Apps.java

2014-04-17 Thread tgraves
Author: tgraves Date: Thu Apr 17 15:23:22 2014 New Revision: 1588281 URL: http://svn.apache.org/r1588281 Log: YARN-1931. Private API change in YARN-1824 in 2.4 broke compatibility with previous releases (Sandy Ryza via tgraves) Modified: hadoop/common/trunk/hadoop-yarn-project/CHANGES.txt

svn commit: r1588287 - in /hadoop/common/branches/branch-2.4/hadoop-yarn-project: CHANGES.txt hadoop-yarn/hadoop-yarn-common/src/main/java/org/apache/hadoop/yarn/util/Apps.java

2014-04-17 Thread tgraves
Author: tgraves Date: Thu Apr 17 15:31:58 2014 New Revision: 1588287 URL: http://svn.apache.org/r1588287 Log: Merge 1588281 to branch-2.4. YARN-1931. Private API change in YARN-1824 in 2.4 broke compatibility with previous releases (Sandy Ryza via tgraves) Modified: hadoop/common/branches

git commit: [SPARK-1395] Allow local: URIs to work on Yarn.

2014-04-17 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.0 b3ad707c4 - a83a794f1 [SPARK-1395] Allow local: URIs to work on Yarn. This only works for the three paths defined in the environment (SPARK_JAR, SPARK_YARN_APP_JAR and SPARK_LOG4J_CONF). Tested by running SparkPi with local: and file:

<    1   2   3   4   5   6   7   8   9   10   >