[ 
https://issues.apache.org/jira/browse/SPARK-46343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jingwei (Sophie) Zhang updated SPARK-46343:
-------------------------------------------
    Attachment: Screenshot 2023-05-16 221916.png

> Spark cannot support Docker bridge network in YARN
> --------------------------------------------------
>
>                 Key: SPARK-46343
>                 URL: https://issues.apache.org/jira/browse/SPARK-46343
>             Project: Spark
>          Issue Type: Bug
>          Components: YARN
>    Affects Versions: 4.0.0, 3.5.1
>         Environment: OS: Ubuntu 22.04.2 LTS
> JDK Version: 1.8
> Hadoop Version: 3.3.6
> Spark Version: 3.5.1
>            Reporter: Jingwei (Sophie) Zhang
>            Priority: Major
>         Attachments: Screenshot 2023-05-16 221916.png
>
>
> Hello Spark team,
> I recently found a possible bug in Spark YarnAllocator.
> Basically when I try to run Spark applications on YARN with Docker bridge 
> network, the job failed with binding address error at Executor side.
> I believe it is caused by the YarnAllocator implementation in Spark, the 
> executor is trying to bind the hostname of the NodeManager instead of the 
> hostname of the container. In host network it's fine but bridge network will 
> break.
> For more details please checkout [RCA - Spark + YARN Docker Bridge 
> Network|https://github.com/EC528-Fall-2023/Kata-Containers-for-SPARK/blob/main/docs/troubleshoot/rca-docker-bridge-net.md].
> It looks like YARN Container API does not contain the container hostname 
> related information, which mean to solve this issue, we may also need to make 
> changes at Hadoop YARN side?
>  
> Please let me know if you have any questions, many thanks!
> —
> Best Regards,
> Jingwei Zhang



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to