Matthew Sharp created YARN-10250:
------------------------------------
Summary: Container Relaunch - find: File system loop detected
Key: YARN-10250
URL: https://issues.apache.org/jira/browse/YARN-10250
Project: Hadoop YARN
Issue Type: Bug
Affects Versions: 3.2.0
Reporter: Matthew Sharp
Hive LLAP YARN service tries to relaunch from a container failure and when it
retries on the same node we are seeing it fail with:
{code:java}
find: File system loop detected; ‘./lib/llap-27Apr2020.tar.gz’ is part of the
same file system loop as ‘./lib’. {code}
YARN-8667 attempted to clean up the prior symlinks before relaunching, but in
this case it still exists since it recreates the symlinks right before trying
to output to directory.info for logging.
The following line appears to be the culprit:
[https://github.com/apache/hadoop/blob/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/launcher/ContainerLaunch.java#L1346]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]