[ https://issues.apache.org/jira/browse/YARN-257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13752764#comment-13752764 ]
Eli Collins commented on YARN-257: ---------------------------------- This seems like a blocker for GA given that MR1 handles disk failures. > NM should gracefully handle a full local disk > --------------------------------------------- > > Key: YARN-257 > URL: https://issues.apache.org/jira/browse/YARN-257 > Project: Hadoop YARN > Issue Type: Improvement > Components: nodemanager > Affects Versions: 2.0.2-alpha, 0.23.5 > Reporter: Jason Lowe > > When a local disk becomes full, the node will fail every container launched > on it because the container is unable to localize. It tries to create an > app-specific directory for each local and log directories. If any of those > directory creates fail (due to lack of free space) the container fails. > It would be nice if the node could continue to launch containers using the > space available on other disks rather than failing all containers trying to > launch on the node. > This is somewhat related to YARN-91 but is centered around the disk becoming > full rather than the disk failing. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira