[ 
https://issues.apache.org/jira/browse/YARN-209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13615545#comment-13615545
 ] 

Vinod Kumar Vavilapalli commented on YARN-209:
----------------------------------------------

Patch looks good, can you test this manually too? Something like this: Starting 
one NM with very little resources first and another with sufficient resources 
later and an app which can only run on the second NM.
                
> Capacity scheduler doesn't trigger app-activation after adding nodes
> --------------------------------------------------------------------
>
>                 Key: YARN-209
>                 URL: https://issues.apache.org/jira/browse/YARN-209
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Bikas Saha
>            Assignee: Zhijie Shen
>             Fix For: 3.0.0
>
>         Attachments: YARN-209.1.patch, YARN-209.2.patch, YARN-209.3.patch, 
> YARN-209.4.patch, YARN-209-test.patch
>
>
> Say application A is submitted but at that time it does not meet the bar for 
> activation because of resource limit settings for applications. After that if 
> more hardware is added to the system and the application becomes valid it 
> still remains in pending state, likely forever.
> This might be rare to hit in real life because enough NM's heartbeat to the 
> RM before applications can get submitted. But a change in settings or 
> heartbeat interval might make it easier to repro. In RM restart scenarios, 
> this will likely hit more if its implemented by re-playing events and 
> re-submitting applications to the scheduler before the RPC to NM's is 
> activated.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to