[
https://issues.apache.org/jira/browse/SPARK-23182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Petar Petrov updated SPARK-23182:
---------------------------------
Affects Version/s: (was: 2.2.0)
2.4.0
> Allow enabling of TCP keep alive for master RPC connections
> -----------------------------------------------------------
>
> Key: SPARK-23182
> URL: https://issues.apache.org/jira/browse/SPARK-23182
> Project: Spark
> Issue Type: Improvement
> Components: Spark Core
> Affects Versions: 2.4.0
> Reporter: Petar Petrov
> Priority: Major
>
> We rely heavily on preemptible worker machines in GCP/GCE. These machines
> disappear without closing the TCP connections to the master which increases
> the number of established connections and new workers can not connect because
> of "Too many open files" on the master.
> To solve the problem we need to enable TCP keep alive for the RPC connections
> to the master but it's not possible to do so via configuration.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]