[ 
https://issues.apache.org/jira/browse/YARN-8645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16936875#comment-16936875
 ] 

meng.ye commented on YARN-8645:
-------------------------------

I met the same issue with YARN 3.1.1 of HDP3.1 after enabling GPU by Ambari
{code:java}
yarn version Hadoop 3.1.1.3.1.0.0-78
{code}

> Yarn NM fail to start when remount cpu control group
> ----------------------------------------------------
>
>                 Key: YARN-8645
>                 URL: https://issues.apache.org/jira/browse/YARN-8645
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>            Reporter: Jiandan Yang 
>            Priority: Major
>
> NM failed to start when we update Yarn to latest version. NM logs are as 
> follows:
> {code:java}
> 2018-08-08 16:07:01,244 INFO [main] 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.CGroupsHandlerImpl:
>  Mounting controller cpu at /sys/fs/cgroup/cpu
> 2018-08-08 16:07:01,246 WARN [main] 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor:
>  Shell execution returned exit code: 32. Privileged Execution Operation 
> Stderr:
> Feature disabled: mount cgroup
> Stdout:
> Full command array for failed execution:
> [/home/hadoop/hadoop_hbase/hadoop-current/bin/container-executor, 
> --mount-cgroups, hadoop-yarn, cpu,cpuset,cpuacct=/sys/fs/cgroup/cpu]
> 2018-08-08 16:07:01,247 ERROR [main] 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.CGroupsHandlerImpl:
>  Failed to mount controller: cpu
> 2018-08-08 16:07:01,247 ERROR [main] 
> org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor: Failed to 
> bootstrap configured resource subsystems!
> org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.ResourceHandlerException:
>  Failed to mount controller: cpu
>  {code}
> The cause of error is that 351cf87c92872d90f62c476f85ae4d02e485769c disable 
> mounting cgroups by default in container-executor, which make 
> container-executor return non-zero when executing mount-cgroups



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to