warrenzhu25 commented on PR #41083:
URL: https://github.com/apache/spark/pull/41083#issuecomment-1552141121
> These looks like things which can be handled by appropriate configuration
tuning ? The PR itself requires a bit more work if that is not a feasible
direction (efficient cleanup, han
warrenzhu25 commented on PR #41083:
URL: https://github.com/apache/spark/pull/41083#issuecomment-1548009051
> How are you observing recoverable fetch failures ?
I have seen 2 cases when target executor has busy shuffle fetch and upload
due to shuffle migration:
1. All Netty request
warrenzhu25 commented on PR #41083:
URL: https://github.com/apache/spark/pull/41083#issuecomment-1545904179
@dongjoon-hyun any comments on this?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to th
warrenzhu25 commented on PR #41083:
URL: https://github.com/apache/spark/pull/41083#issuecomment-1537523100
@dongjoon-hyun @mridulm Help take a look?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go