Was able to resolve the idle connections being terminated issue using
EnvoyFilter

On Sat, 3 Sept 2022 at 18:14, Ilan Filonenko <i...@cornell.edu> wrote:

> Must be set in envoy (maybe could passthrough via istio)
>
> https://www.envoyproxy.io/docs/envoy/latest/api-v3/config/core/v3/protocol.proto#envoy-v3-api-field-config-core-v3-httpprotocoloptions-idle-timeout
>
>
> On Sat, Sep 3, 2022 at 4:23 AM Deepak Sharma <deepakmc...@gmail.com>
> wrote:
>
>> Thank for the reply IIan .
>> Can we set this in spark conf or does it need to goto istio / envoy conf?
>>
>>
>>
>> On Sat, 3 Sept 2022 at 10:28, Ilan Filonenko <i...@cornell.edu> wrote:
>>
>>> This might be a result of the idle_timeout that is configured in envoy.
>>> The default is an hour.
>>>
>>> On Sat, Sep 3, 2022 at 12:17 AM Deepak Sharma <deepakmc...@gmail.com>
>>> wrote:
>>>
>>>> Hi All,
>>>> In 1 of our cluster , we enabled Istio where spark is running in
>>>> distributed mode.
>>>> Spark works fine when we run it with Istio in standalone mode.
>>>> In spark distributed mode , we are seeing that every 1 hour or so the
>>>> workers are getting disassociated from master and then master is not able
>>>> to spawn any jobs on these workers , until we restart spark rest server.
>>>>
>>>> Here is the error we see in the worker logs:
>>>>
>>>>
>>>> *ERROR CoarseGrainedExecutorBackend: Executor self-exiting due to :
>>>> Driver spark-rest-service:44463 disassociated! Shutting down.*
>>>>
>>>> For 1 hour or so (until this issue happens) , spark distributed mode
>>>> works just fine.
>>>>
>>>>
>>>> Thanks
>>>> Deepak
>>>>
>>>

Reply via email to