[GitHub] spark pull request #20383: [SPARK-23200] Reset Kubernetes-specific config on...

2018-01-26 Thread jerryshao
Github user jerryshao commented on a diff in the pull request:

https://github.com/apache/spark/pull/20383#discussion_r164106372
  
--- Diff: 
streaming/src/main/scala/org/apache/spark/streaming/Checkpoint.scala ---
@@ -53,6 +53,21 @@ class Checkpoint(ssc: StreamingContext, val 
checkpointTime: Time)
   "spark.driver.host",
   "spark.driver.bindAddress",
   "spark.driver.port",
+  "spark.kubernetes.driver.pod.name",
+  "spark.kubernetes.executor.podNamePrefix",
+  "spark.kubernetes.initcontainer.executor.configmapname",
+  "spark.kubernetes.initcontainer.executor.configmapkey",
+  "spark.kubernetes.initcontainer.downloadJarsResourceIdentifier",
+  "spark.kubernetes.initcontainer.downloadJarsSecretLocation",
+  "spark.kubernetes.initcontainer.downloadFilesResourceIdentifier",
+  "spark.kubernetes.initcontainer.downloadFilesSecretLocation",
+  "spark.kubernetes.initcontainer.remoteJars",
+  "spark.kubernetes.initcontainer.remoteFiles",
+  "spark.kubernetes.mountdependencies.jarsDownloadDir",
+  "spark.kubernetes.mountdependencies.filesDownloadDir",
+  "spark.kubernetes.initcontainer.executor.stagingServerSecret.name",
--- End diff --

I think it will not affect the correctness of streaming application. 


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #20383: [SPARK-23200] Reset Kubernetes-specific config on...

2018-01-26 Thread foxish
Github user foxish commented on a diff in the pull request:

https://github.com/apache/spark/pull/20383#discussion_r164084503
  
--- Diff: 
streaming/src/main/scala/org/apache/spark/streaming/Checkpoint.scala ---
@@ -53,6 +53,21 @@ class Checkpoint(ssc: StreamingContext, val 
checkpointTime: Time)
   "spark.driver.host",
   "spark.driver.bindAddress",
   "spark.driver.port",
+  "spark.kubernetes.driver.pod.name",
+  "spark.kubernetes.executor.podNamePrefix",
+  "spark.kubernetes.initcontainer.executor.configmapname",
+  "spark.kubernetes.initcontainer.executor.configmapkey",
+  "spark.kubernetes.initcontainer.downloadJarsResourceIdentifier",
+  "spark.kubernetes.initcontainer.downloadJarsSecretLocation",
+  "spark.kubernetes.initcontainer.downloadFilesResourceIdentifier",
+  "spark.kubernetes.initcontainer.downloadFilesSecretLocation",
+  "spark.kubernetes.initcontainer.remoteJars",
+  "spark.kubernetes.initcontainer.remoteFiles",
+  "spark.kubernetes.mountdependencies.jarsDownloadDir",
+  "spark.kubernetes.mountdependencies.filesDownloadDir",
+  "spark.kubernetes.initcontainer.executor.stagingServerSecret.name",
--- End diff --

Sorry - should have caught this. The `stagingServer` configs don't exist in 
upstream. We should remove those, although it's a NOP here I'm guessing. 

@ssaavedra @felixcheung 


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #20383: [SPARK-23200] Reset Kubernetes-specific config on...

2018-01-25 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/20383


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #20383: [SPARK-23200] Reset Kubernetes-specific config on...

2018-01-24 Thread ssaavedra
GitHub user ssaavedra opened a pull request:

https://github.com/apache/spark/pull/20383

[SPARK-23200] Reset Kubernetes-specific config on Checkpoint restore

## What changes were proposed in this pull request?

When using the Kubernetes cluster-manager and spawning a Streaming 
workload, it is important to reset many spark.kubernetes.* properties that are 
generated by spark-submit but which would get rewritten when restoring a 
Checkpoint. This is so, because the spark-submit codepath creates Kubernetes 
resources, such as a ConfigMap, a Secret and other variables, which have an 
autogenerated name and the previous one will not resolve anymore.

In short, this change enables checkpoint restoration for streaming 
workloads, and thus enables Spark Streaming workloads in Kubernetes, which were 
not possible to restore from a checkpoint before if the workload went down.

## How was this patch tested?

This patch was tested with the twitter-streaming example in AWS, using 
checkpoints in s3 with the s3a:// protocol, as supported by Hadoop.

This is similar to the YARN related code for resetting a Spark Streaming 
workload, but for the Kubernetes scheduler. I'm adding the initcontainers 
properties because even if the discussion is not completely settled on the 
mailing list, my understanding is that at this moment they are going forward 
for the moment.

For a previous discussion, see the non-rebased work at: 
https://github.com/apache-spark-on-k8s/spark/pull/516

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/ssaavedra/spark fix-k8s-checkpointing

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/20383.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #20383


commit cd9b73a1e85c45886243ae7176a1179de7329c16
Author: Santiago Saavedra 
Date:   2017-10-17T13:18:03Z

Reset Kubernetes-specific config on Checkpoint restore

Several configuration parameters related to Kubernetes need to be
reset, as they are changed with each invokation of spark-submit and
thus prevents recovery of Spark Streaming tasks.




---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org