[ https://issues.apache.org/jira/browse/YARN-11709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17868614#comment-17868614 ]
ASF GitHub Bot commented on YARN-11709: --------------------------------------- ferdelyi opened a new pull request, #6960: URL: https://github.com/apache/hadoop/pull/6960 …nnot run program /var/lib/yarn-ce/bin/container-executor <!-- Thanks for sending a pull request! 1. If this is your first time, please read our contributor guidelines: https://cwiki.apache.org/confluence/display/HADOOP/How+To+Contribute 2. Make sure your PR title starts with JIRA issue id, e.g., 'HADOOP-17799. Your PR title ...'. --> ### Description of PR ### How was this patch tested? ### For code changes: - [x] Does the title or this PR starts with the corresponding JIRA issue id (e.g. 'HADOOP-17799. Your PR title ...')? - [ ] Object storage: have the integration tests been executed and the endpoint declared according to the connector-specific documentation? - [ ] If adding new dependencies to the code, are these dependencies licensed in a way that is compatible for inclusion under [ASF 2.0](http://www.apache.org/legal/resolved.html#category-a)? - [ ] If applicable, have you updated the `LICENSE`, `LICENSE-binary`, `NOTICE-binary` files? > NodeManager should be shut down or blacklisted when it cannot run program > "/var/lib/yarn-ce/bin/container-executor" > ------------------------------------------------------------------------------------------------------------------- > > Key: YARN-11709 > URL: https://issues.apache.org/jira/browse/YARN-11709 > Project: Hadoop YARN > Issue Type: Improvement > Components: container-executor > Reporter: Ferenc Erdelyi > Assignee: Ferenc Erdelyi > Priority: Major > > When NodeManager encounters the below "No such file or directory" error > reported against the "container-executor", it should give up participating in > the cluster as it is not capable to run any container, but just fail the jobs. > {code:java} > 2023-01-18 10:08:10,600 WARN > org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor: Exit code > from container container_e159_1673543180101_9407_02_ > 000014 startLocalizer is : -1 > org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationException: > java.io.IOException: Cannot run program > "/var/lib/yarn-ce/bin/container-executor": error=2, No such file or directory > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor.executePrivilegedOperation(PrivilegedOperationExecutor.java:183) > at > org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.startLocalizer(LinuxContainerExecutor.java:403) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.j > ava:1250) > Caused by: java.io.IOException: Cannot run program > "/var/lib/yarn-ce/bin/container-executor": error=2, No such file or directory > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org