[
https://issues.apache.org/jira/browse/SPARK-3736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14174753#comment-14174753
]
Andrew Ash commented on SPARK-3736:
---
The configuration for Hadoop's retry policy was
[
https://issues.apache.org/jira/browse/SPARK-3736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14174049#comment-14174049
]
Apache Spark commented on SPARK-3736:
-
User 'mccheah' has created a pull request for
[
https://issues.apache.org/jira/browse/SPARK-3736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14171358#comment-14171358
]
Nan Zhu commented on SPARK-3736:
if the worker itself timeout, the Master will remove the
[
https://issues.apache.org/jira/browse/SPARK-3736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14171360#comment-14171360
]
Nan Zhu commented on SPARK-3736:
BTW, master will not send heartbeat to Worker proactively
[
https://issues.apache.org/jira/browse/SPARK-3736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14165513#comment-14165513
]
Matt Cheah commented on SPARK-3736:
---
Are the two linked cases above different though?
[
https://issues.apache.org/jira/browse/SPARK-3736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14164578#comment-14164578
]
Patrick Wendell commented on SPARK-3736:
I spoke a bit offline with [~ilikerps]
[
https://issues.apache.org/jira/browse/SPARK-3736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14152594#comment-14152594
]
Andrew Ash commented on SPARK-3736:
---
I can't tell for sure but this is possibly related