[ 
https://issues.apache.org/jira/browse/YARN-10462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

dzcxzl updated YARN-10462:
--------------------------
    Attachment: YARN-10462.001.patch

> Configurable shutdown cleanup slop
> ----------------------------------
>
>                 Key: YARN-10462
>                 URL: https://issues.apache.org/jira/browse/YARN-10462
>             Project: Hadoop YARN
>          Issue Type: Improvement
>          Components: nodemanager
>    Affects Versions: 3.1.0
>            Reporter: dzcxzl
>            Priority: Trivial
>         Attachments: YARN-10462.001.patch
>
>
> When stopping NM or decommission NM, stopping all containers, the waiting 
> time is composed of three values 
> sleep-delay-before-sigkill+process-kill-wait+SHUTDOWN_CLEANUP_SLOP_MS 
> (constant 1000)
> yarn.nodemanager.sleep-delay-before-sigkill.ms=250
> yarn.nodemanager.process-kill-wait.ms=5000
> SHUTDOWN_CLEANUP_SLOP_MS=1000
> The parameters of sleep-delay-before-sigkill and process-kill-wait are the 
> time to kill a container/process. When there are too many container lists to 
> be killed, it is usually not completely killed.
> We can make SHUTDOWN_CLEANUP_SLOP_MS a configurable parameter, so that in 
> some scenarios, we can wait as long as possible to kill all containers to 
> complete.
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to