[ 
https://issues.apache.org/jira/browse/SPARK-43504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Fei Wang updated SPARK-43504:
-----------------------------
    Description: 
Since SPARK-25815 [,|https://github.com/apache/spark/pull/22911,] the hadoop 
config map is not in executor side.

Per the  
[https://github.com/apache/spark/pull/22911|https://github.com/apache/spark/pull/22911,]
 description:
{code:java}
The main two things that don't need to happen in executors anymore are:
1. adding the Hadoop config to the executor pods: this is not needed
since the Spark driver will serialize the Hadoop config and send
it to executors when running tasks. {code}
But in fact, the executor still need the hadoop configuration.

 

!https://user-images.githubusercontent.com/6757692/238268640-8ff41144-5812-4232-b572-2de2408348ed.png!

 

As shown in above picture, the driver can resolve `hdfs://zeus`, but the 
executor can not.

so we still need to mount the hadoop config map in executor side.

  was:
Since SPARK-25815[,|https://github.com/apache/spark/pull/22911,] the hadoop 
config map is not in executor side.

Per the  
[https://github.com/apache/spark/pull/22911|https://github.com/apache/spark/pull/22911,]
 description:
{code:java}
The main two things that don't need to happen in executors anymore are:
1. adding the Hadoop config to the executor pods: this is not needed
since the Spark driver will serialize the Hadoop config and send
it to executors when running tasks. {code}
But in fact, the executor still need the hadoop configuration.

 

!https://user-images.githubusercontent.com/6757692/238268640-8ff41144-5812-4232-b572-2de2408348ed.png!

 

As shown in above picture, the driver can resolve `hdfs://zeus`, but the 
executor can not.


> [K8S] Mount hadoop config map in executor side
> ----------------------------------------------
>
>                 Key: SPARK-43504
>                 URL: https://issues.apache.org/jira/browse/SPARK-43504
>             Project: Spark
>          Issue Type: Improvement
>          Components: Kubernetes
>    Affects Versions: 3.4.0
>            Reporter: Fei Wang
>            Priority: Major
>
> Since SPARK-25815 [,|https://github.com/apache/spark/pull/22911,] the hadoop 
> config map is not in executor side.
> Per the  
> [https://github.com/apache/spark/pull/22911|https://github.com/apache/spark/pull/22911,]
>  description:
> {code:java}
> The main two things that don't need to happen in executors anymore are:
> 1. adding the Hadoop config to the executor pods: this is not needed
> since the Spark driver will serialize the Hadoop config and send
> it to executors when running tasks. {code}
> But in fact, the executor still need the hadoop configuration.
>  
> !https://user-images.githubusercontent.com/6757692/238268640-8ff41144-5812-4232-b572-2de2408348ed.png!
>  
> As shown in above picture, the driver can resolve `hdfs://zeus`, but the 
> executor can not.
> so we still need to mount the hadoop config map in executor side.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to