[ https://issues.apache.org/jira/browse/YARN-2701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14177750#comment-14177750 ]
Xuan Gong commented on YARN-2701: --------------------------------- Had some off line discussion with [~jianhan]. We think that for now, reverting the previous method changes might be the safest way to solve this issue. Uploaded a new patch to do it > Potential race condition in startLocalizer when using LinuxContainerExecutor > ------------------------------------------------------------------------------ > > Key: YARN-2701 > URL: https://issues.apache.org/jira/browse/YARN-2701 > Project: Hadoop YARN > Issue Type: Bug > Reporter: Xuan Gong > Assignee: Xuan Gong > Priority: Blocker > Attachments: YARN-2701.1.patch, YARN-2701.2.patch, YARN-2701.3.patch, > YARN-2701.4.patch, YARN-2701.5.patch, YARN-2701.6.patch > > > When using LinuxContainerExecutor do startLocalizer, we are using native code > container-executor.c. > {code} > if (stat(npath, &sb) != 0) { > if (mkdir(npath, perm) != 0) { > {code} > We are using check and create method to create the appDir under /usercache. > But if there are two containers trying to do this at the same time, race > condition may happen. -- This message was sent by Atlassian JIRA (v6.3.4#6332)