[ https://issues.apache.org/jira/browse/SPARK-2563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14065735#comment-14065735 ]
Shivaram Venkataraman commented on SPARK-2563: ---------------------------------------------- https://github.com/apache/spark/pull/1471 > Make number of connection retries configurable > ---------------------------------------------- > > Key: SPARK-2563 > URL: https://issues.apache.org/jira/browse/SPARK-2563 > Project: Spark > Issue Type: Improvement > Components: Spark Core > Reporter: Shivaram Venkataraman > Priority: Minor > > In a large EC2 cluster, I often see the first shuffle stage in a job fail due > to connection timeout exceptions. We should make the number of retries before > failing configurable to handle these cases. -- This message was sent by Atlassian JIRA (v6.2#6252)