Jason Lowe created YARN-5290: -------------------------------- Summary: ResourceManager can place more containers on a node than the node size allows Key: YARN-5290 URL: https://issues.apache.org/jira/browse/YARN-5290 Project: Hadoop YARN Issue Type: Bug Components: resourcemanager Reporter: Jason Lowe
When the ResourceManager or an ApplicationMaster kills a container the RM scheduler instantly thinks the container is dead and frees those resources within the scheduler bookkeeping. However that container can still be running on the node until the node heartbeats back into the RM and is told to kill the container. If the RM allocates the space associated with the released container and gives it to an AM quickly enough, the AM can launch a new container while the old container is still running on the NM. That leads to a scenario where we're technically running more resources on the node than the node advertised to the RM. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org