[ https://issues.apache.org/jira/browse/YARN-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15139451#comment-15139451 ]
Karthik Kambatla commented on YARN-4679: ---------------------------------------- bq. There are other reasons we will want to persist node information, like YARN-998 YARN-291 was implemented before work-preserving NM restart was enabled. Now, one could dynamically change a node's resource config and bounce the NM. Do we recommend one approach over the other? > When work-preserving restart is enabled, the scheduler should wait for the > earlier of recovery completion and configured wait time > ---------------------------------------------------------------------------------------------------------------------------------- > > Key: YARN-4679 > URL: https://issues.apache.org/jira/browse/YARN-4679 > Project: Hadoop YARN > Issue Type: Improvement > Components: resourcemanager > Reporter: Karthik Kambatla > > When work-preserving restart is enabled, it appears the restart (or failover) > is unconditionally blocked for the configured delay even if the recovery > itself finishes sooner than this. This should be updated to wait for the > earlier of the two conditions. Also, it would be nice to allow setting the > config to -1 to indicate wait as long as need for the recovery to be > completed. -- This message was sent by Atlassian JIRA (v6.3.4#6332)