[ 
https://issues.apache.org/jira/browse/YARN-2001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jian He updated YARN-2001:
--------------------------

    Description: After failover, RM may require a certain threshold to 
determine whether it’s safe to make scheduling decisions and start accepting 
new container requests from AMs. The threshold could be a certain amount of 
nodes. i.e. RM waits until a certain amount of nodes joining before accepting 
new container requests.  Or it could simply be a timeout, only after the 
timeout RM accepts new requests.  (was: RM may not accept allocate requests 
from AMs until all the NMs have re-synced back with RM. This is to eliminate 
some race conditions like containerIds overlapping between 
)

> Threshold for RM to accept requests from AM after failover
> ----------------------------------------------------------
>
>                 Key: YARN-2001
>                 URL: https://issues.apache.org/jira/browse/YARN-2001
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: resourcemanager
>            Reporter: Jian He
>            Assignee: Jian He
>
> After failover, RM may require a certain threshold to determine whether it’s 
> safe to make scheduling decisions and start accepting new container requests 
> from AMs. The threshold could be a certain amount of nodes. i.e. RM waits 
> until a certain amount of nodes joining before accepting new container 
> requests.  Or it could simply be a timeout, only after the timeout RM accepts 
> new requests.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to