[
https://issues.apache.org/jira/browse/YARN-10462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
dzcxzl updated YARN-10462:
--------------------------
Attachment: YARN-10462.001.patch
> Configurable shutdown cleanup slop
> ----------------------------------
>
> Key: YARN-10462
> URL: https://issues.apache.org/jira/browse/YARN-10462
> Project: Hadoop YARN
> Issue Type: Improvement
> Components: nodemanager
> Affects Versions: 3.1.0
> Reporter: dzcxzl
> Priority: Trivial
> Attachments: YARN-10462.001.patch
>
>
> When stopping NM or decommission NM, stopping all containers, the waiting
> time is composed of three values
> sleep-delay-before-sigkill+process-kill-wait+SHUTDOWN_CLEANUP_SLOP_MS
> (constant 1000)
> yarn.nodemanager.sleep-delay-before-sigkill.ms=250
> yarn.nodemanager.process-kill-wait.ms=5000
> SHUTDOWN_CLEANUP_SLOP_MS=1000
> The parameters of sleep-delay-before-sigkill and process-kill-wait are the
> time to kill a container/process. When there are too many container lists to
> be killed, it is usually not completely killed.
> We can make SHUTDOWN_CLEANUP_SLOP_MS a configurable parameter, so that in
> some scenarios, we can wait as long as possible to kill all containers to
> complete.
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]