[GitHub] spark pull request #22362: [SPARK-25372][YARN][K8S] Deprecate and generalize...

2018-09-26 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/22362


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #22362: [SPARK-25372][YARN][K8S] Deprecate and generalize...

2018-09-25 Thread ifilonenko
Github user ifilonenko commented on a diff in the pull request:

https://github.com/apache/spark/pull/22362#discussion_r220364426
  
--- Diff: core/src/main/scala/org/apache/spark/SparkConf.scala ---
@@ -726,7 +726,11 @@ private[spark] object SparkConf extends Logging {
 DRIVER_MEMORY_OVERHEAD.key -> Seq(
   AlternateConfig("spark.yarn.driver.memoryOverhead", "2.3")),
 EXECUTOR_MEMORY_OVERHEAD.key -> Seq(
-  AlternateConfig("spark.yarn.executor.memoryOverhead", "2.3"))
+  AlternateConfig("spark.yarn.executor.memoryOverhead", "2.3")),
+KEYTAB.key -> Seq(
+  AlternateConfig("spark.yarn.keytab", "2.4")),
--- End diff --

Okay. I will then point to 2.5.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #22362: [SPARK-25372][YARN][K8S] Deprecate and generalize...

2018-09-25 Thread vanzin
Github user vanzin commented on a diff in the pull request:

https://github.com/apache/spark/pull/22362#discussion_r220360201
  
--- Diff: core/src/main/scala/org/apache/spark/SparkConf.scala ---
@@ -726,7 +726,11 @@ private[spark] object SparkConf extends Logging {
 DRIVER_MEMORY_OVERHEAD.key -> Seq(
   AlternateConfig("spark.yarn.driver.memoryOverhead", "2.3")),
 EXECUTOR_MEMORY_OVERHEAD.key -> Seq(
-  AlternateConfig("spark.yarn.executor.memoryOverhead", "2.3"))
+  AlternateConfig("spark.yarn.executor.memoryOverhead", "2.3")),
+KEYTAB.key -> Seq(
+  AlternateConfig("spark.yarn.keytab", "2.4")),
--- End diff --

This change doesn't make a whole lot of sense without the rest of the k8s 
kerberos work, and that change is a lot more controversial to pull into 2.4 at 
this point. So I'm leaning towards no.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #22362: [SPARK-25372][YARN][K8S] Deprecate and generalize...

2018-09-25 Thread ifilonenko
Github user ifilonenko commented on a diff in the pull request:

https://github.com/apache/spark/pull/22362#discussion_r220356187
  
--- Diff: core/src/main/scala/org/apache/spark/SparkConf.scala ---
@@ -726,7 +726,11 @@ private[spark] object SparkConf extends Logging {
 DRIVER_MEMORY_OVERHEAD.key -> Seq(
   AlternateConfig("spark.yarn.driver.memoryOverhead", "2.3")),
 EXECUTOR_MEMORY_OVERHEAD.key -> Seq(
-  AlternateConfig("spark.yarn.executor.memoryOverhead", "2.3"))
+  AlternateConfig("spark.yarn.executor.memoryOverhead", "2.3")),
+KEYTAB.key -> Seq(
+  AlternateConfig("spark.yarn.keytab", "2.4")),
--- End diff --

Thank you for the test! I appreciate it. As the final RC Is not cut, it 
could be added if you think it should.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #22362: [SPARK-25372][YARN][K8S] Deprecate and generalize...

2018-09-25 Thread vanzin
Github user vanzin commented on a diff in the pull request:

https://github.com/apache/spark/pull/22362#discussion_r220354878
  
--- Diff: core/src/main/scala/org/apache/spark/SparkConf.scala ---
@@ -726,7 +726,11 @@ private[spark] object SparkConf extends Logging {
 DRIVER_MEMORY_OVERHEAD.key -> Seq(
   AlternateConfig("spark.yarn.driver.memoryOverhead", "2.3")),
 EXECUTOR_MEMORY_OVERHEAD.key -> Seq(
-  AlternateConfig("spark.yarn.executor.memoryOverhead", "2.3"))
+  AlternateConfig("spark.yarn.executor.memoryOverhead", "2.3")),
+KEYTAB.key -> Seq(
+  AlternateConfig("spark.yarn.keytab", "2.4")),
--- End diff --

This should be 2.5, also below. This change won't go into 2.4 at this 
point...


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #22362: [SPARK-25372][YARN][K8S] Deprecate and generalize...

2018-09-21 Thread ifilonenko
Github user ifilonenko commented on a diff in the pull request:

https://github.com/apache/spark/pull/22362#discussion_r219638207
  
--- Diff: R/pkg/R/sparkR.R ---
@@ -624,8 +624,8 @@ sparkConfToSubmitOps[["spark.driver.extraClassPath"]]   
<- "--driver-class-path"
 sparkConfToSubmitOps[["spark.driver.extraJavaOptions"]] <- 
"--driver-java-options"
 sparkConfToSubmitOps[["spark.driver.extraLibraryPath"]] <- 
"--driver-library-path"
 sparkConfToSubmitOps[["spark.master"]] <- "--master"
-sparkConfToSubmitOps[["spark.yarn.keytab"]] <- "--keytab"
--- End diff --

Added backwards compatibility


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #22362: [SPARK-25372][YARN][K8S] Deprecate and generalize...

2018-09-08 Thread felixcheung
Github user felixcheung commented on a diff in the pull request:

https://github.com/apache/spark/pull/22362#discussion_r216122659
  
--- Diff: 
core/src/main/scala/org/apache/spark/deploy/SparkSubmitArguments.scala ---
@@ -199,8 +199,8 @@ private[deploy] class SparkSubmitArguments(args: 
Seq[String], env: Map[String, S
 numExecutors = Option(numExecutors)
   .getOrElse(sparkProperties.get("spark.executor.instances").orNull)
 queue = 
Option(queue).orElse(sparkProperties.get("spark.yarn.queue")).orNull
-keytab = 
Option(keytab).orElse(sparkProperties.get("spark.yarn.keytab")).orNull
-principal = 
Option(principal).orElse(sparkProperties.get("spark.yarn.principal")).orNull
+keytab = 
Option(keytab).orElse(sparkProperties.get("spark.kerberos.keytab")).orNull
--- End diff --

agreed, shouldn't the "old" config still work? `spark.yarn.keytab` etc


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #22362: [SPARK-25372][YARN][K8S] Deprecate and generalize...

2018-09-07 Thread vanzin
Github user vanzin commented on a diff in the pull request:

https://github.com/apache/spark/pull/22362#discussion_r216108106
  
--- Diff: 
core/src/main/scala/org/apache/spark/deploy/SparkSubmitArguments.scala ---
@@ -199,8 +199,8 @@ private[deploy] class SparkSubmitArguments(args: 
Seq[String], env: Map[String, S
 numExecutors = Option(numExecutors)
   .getOrElse(sparkProperties.get("spark.executor.instances").orNull)
 queue = 
Option(queue).orElse(sparkProperties.get("spark.yarn.queue")).orNull
-keytab = 
Option(keytab).orElse(sparkProperties.get("spark.yarn.keytab")).orNull
-principal = 
Option(principal).orElse(sparkProperties.get("spark.yarn.principal")).orNull
+keytab = 
Option(keytab).orElse(sparkProperties.get("spark.kerberos.keytab")).orNull
--- End diff --

This might be easier to fix though; `prepareSubmitEnvironment` (where these 
things are used) could just read them from `SparkConf` at that point. Something 
like this in that method, instead of this code:

Option(args.keytab).orElse(sparkConf.get(KEYTAB))


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #22362: [SPARK-25372][YARN][K8S] Deprecate and generalize...

2018-09-07 Thread vanzin
Github user vanzin commented on a diff in the pull request:

https://github.com/apache/spark/pull/22362#discussion_r216107597
  
--- Diff: 
core/src/main/scala/org/apache/spark/deploy/SparkSubmitArguments.scala ---
@@ -199,8 +199,8 @@ private[deploy] class SparkSubmitArguments(args: 
Seq[String], env: Map[String, S
 numExecutors = Option(numExecutors)
   .getOrElse(sparkProperties.get("spark.executor.instances").orNull)
 queue = 
Option(queue).orElse(sparkProperties.get("spark.yarn.queue")).orNull
-keytab = 
Option(keytab).orElse(sparkProperties.get("spark.yarn.keytab")).orNull
-principal = 
Option(principal).orElse(sparkProperties.get("spark.yarn.principal")).orNull
+keytab = 
Option(keytab).orElse(sparkProperties.get("spark.kerberos.keytab")).orNull
--- End diff --

This has a similar issue as the R code above re: backwards compatibility.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #22362: [SPARK-25372][YARN][K8S] Deprecate and generalize...

2018-09-07 Thread vanzin
Github user vanzin commented on a diff in the pull request:

https://github.com/apache/spark/pull/22362#discussion_r216107505
  
--- Diff: core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala ---
@@ -520,6 +520,10 @@ private[spark] class SparkSubmit extends Logging {
 confKey = "spark.driver.extraJavaOptions"),
   OptionAssigner(args.driverExtraLibraryPath, ALL_CLUSTER_MGRS, 
ALL_DEPLOY_MODES,
 confKey = "spark.driver.extraLibraryPath"),
+  OptionAssigner(args.principal, ALL_CLUSTER_MGRS, ALL_DEPLOY_MODES,
+confKey = "spark.kerberos.principal"),
--- End diff --

Since you're changing this, could you use `PRINCIPAL.key` and `KEYTAB.key` 
in the places where the config name is referenced?


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #22362: [SPARK-25372][YARN][K8S] Deprecate and generalize...

2018-09-07 Thread vanzin
Github user vanzin commented on a diff in the pull request:

https://github.com/apache/spark/pull/22362#discussion_r216107385
  
--- Diff: R/pkg/R/sparkR.R ---
@@ -624,8 +624,8 @@ sparkConfToSubmitOps[["spark.driver.extraClassPath"]]   
<- "--driver-class-path"
 sparkConfToSubmitOps[["spark.driver.extraJavaOptions"]] <- 
"--driver-java-options"
 sparkConfToSubmitOps[["spark.driver.extraLibraryPath"]] <- 
"--driver-library-path"
 sparkConfToSubmitOps[["spark.master"]] <- "--master"
-sparkConfToSubmitOps[["spark.yarn.keytab"]] <- "--keytab"
--- End diff --

@felixcheung 

This seems wrong, since existing apps are using the old config, not the new 
one, so you may need some backwards compatibility code.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #22362: [SPARK-25372][YARN][K8S] Deprecate and generalize...

2018-09-07 Thread ifilonenko
GitHub user ifilonenko opened a pull request:

https://github.com/apache/spark/pull/22362

[SPARK-25372][YARN][K8S] Deprecate and generalize keytab / principal config

## What changes were proposed in this pull request?

SparkSubmit already logs in the user if a keytab is provided, the only 
issue is that it uses the existing configs which have "yarn" in their name. As 
such, the configs were changed to:

`spark.kerberos.keytab` and `spark.kerberos.principal`.

## How was this patch tested?

Will be tested with K8S tests, but needs to be tested with Yarn

- [ ] K8S Secure HDFS tests
- [ ] Yarn Secure HDFS tests @vanzin 

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/ifilonenko/spark SPARK-25372

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/22362.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #22362


commit 30286f85ac315cad2578fe2e11c8718b8883630a
Author: Ilan Filonenko 
Date:   2018-09-07T22:28:25Z

first step




---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org