[ https://issues.apache.org/jira/browse/YARN-2701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14179560#comment-14179560 ]
Xuan Gong commented on YARN-2701: --------------------------------- [~aw] Thanks for the summary. Let us not revert the current code. I uploaded an addendum patch. In this patch, I revert the current mkdirs codes to the codes which were committed in YARN-2161. Also I made some necessary changes to solve the race condition issue. If you can review it, that will be very helpful. > Potential race condition in startLocalizer when using LinuxContainerExecutor > ------------------------------------------------------------------------------ > > Key: YARN-2701 > URL: https://issues.apache.org/jira/browse/YARN-2701 > Project: Hadoop YARN > Issue Type: Bug > Reporter: Xuan Gong > Assignee: Xuan Gong > Priority: Blocker > Fix For: 2.6.0 > > Attachments: YARN-2701.1.patch, YARN-2701.2.patch, YARN-2701.3.patch, > YARN-2701.4.patch, YARN-2701.5.patch, YARN-2701.6.patch, > YARN-2701.addendum.1.patch > > > When using LinuxContainerExecutor do startLocalizer, we are using native code > container-executor.c. > {code} > if (stat(npath, &sb) != 0) { > if (mkdir(npath, perm) != 0) { > {code} > We are using check and create method to create the appDir under /usercache. > But if there are two containers trying to do this at the same time, race > condition may happen. -- This message was sent by Atlassian JIRA (v6.3.4#6332)