[ 
https://issues.apache.org/jira/browse/SPARK-37537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hyukjin Kwon updated SPARK-37537:
---------------------------------
    Priority: Major  (was: Blocker)

> Spark 3.2.0 driver pod does not mount checkpoint filesystem from Kubernetes 
> PVC
> -------------------------------------------------------------------------------
>
>                 Key: SPARK-37537
>                 URL: https://issues.apache.org/jira/browse/SPARK-37537
>             Project: Spark
>          Issue Type: Bug
>          Components: Spark Submit
>    Affects Versions: 3.2.0
>            Reporter: Petri
>            Priority: Major
>
> I have Spark 3.2.0 driver executing in Kubernetes pod in client mode and 
> following configs has been defined in spark-submit:
> {code:java}
> --deploy-mode client
> --conf 
> spark.kubernetes.driver.volumes.persistentVolumeClaim.glustervol.mount.path=/mnt/distributedDisk
> --conf 
> spark.kubernetes.driver.volumes.persistentVolumeClaim.glustervol.readOnly=false
> --conf 
> spark.kubernetes.driver.volumes.persistentVolumeClaim.glustervol.options.claimName=lolastreamingapp-conf
>  
> spark.kubernetes.executor.volumes.persistentVolumeClaim.glustervol.mount.path=/mnt/distributedDisk
> --conf 
> spark.kubernetes.executor.volumes.persistentVolumeClaim.glustervol.readOnly=false
> --conf 
> spark.kubernetes.executor.volumes.persistentVolumeClaim.glustervol.options.claimName=lolastreamingapp
>   {code}
> I face a problem when starting the driver pod that it cannot access the 
> filesystem mounted from GlusterFS PVC. I can see that driver pod has not 
> mounted the PVC when describing the pod. I can also see that PVC is not 
> mounted when describing the PVC.
> This has been working with Spark version 2.4.x, but not with Spark 3.2.0.
> Only notable change we have between using Spark version 2.4.x and 3.2.0 is 
> that in 2.4.x we used deploy-mode cluster and in 3.2.0 we use deploy-mode 
> client.
>  
> Because the filesystem used for checkpointing is not mounted properly, we get 
> following kind of error in our application:
> {code:java}
> java.io.FileNotFoundException: File 
> /mnt/distributedDisk/SE/LolaStreamingApp/1.0.0/1468589949 does not exist
>         at 
> org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:779)
>  ~[hadoop-client-api-3.3.1.jar:?]
>         at 
> org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:1100)
>  ~[hadoop-client-api-3.3.1.jar:?]
>         at 
> org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:769)
>  ~[hadoop-client-api-3.3.1.jar:?]
>         at 
> org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:462)
>  ~[hadoop-client-api-3.3.1.jar:?]
>         at 
> org.apache.spark.streaming.StreamingContext.checkpoint(StreamingContext.scala:240)
>  ~[spark-streaming_2.12-3.2.0.jar:3.2.0]
>         at 
> org.apache.spark.streaming.api.java.JavaStreamingContext.checkpoint(JavaStreamingContext.scala:509)
>  ~[spark-streaming_2.12-3.2.0.jar:3.2.0] {code}
>  
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to