Hi Chris,
I ran into the same thing with RHEL 7, and you are correct, it's related
to the combined mount of the cpu and cpuacct cgroups (on
/sys/fs/cgroup/cpu,cpuacct). When cgroups are already mounted combined,
it's apparently not possible to mount one of the separately. When I try
a manual mount, I get:
# mount -t cgroup -o cpuacct none /cgroup/cpuacct
mount: none is already mounted or /cgroup/cpuacct busy
I've taken the easy way out and use the linux JobAcctGather plugin
instead of the cgroup one... ;-)
Best,
Robbert
On 12/21/2015 06:32 AM, Christopher Samuel wrote:
Hi folks,
I'm helping bring up a new cluster with Slurm 15.08.5 on RHEL 7.2
and I've run into an odd case where trying to launch a process with
srun triggers this failure:
[2015-12-21T15:38:00.213] unable to mount cpuacct cgroup namespace: Device or
resource busy
[2015-12-21T15:38:00.213] jobacct_gather/cgroup: unable to create cpuacct
namespace
I suspect this might be systemd related, but as I've limited
experience with it so far I'm not certain.
This is what is failing according to strace:
12725 mount("cgroup", "/cgroup/cpuacct", "cgroup", MS_NOSUID|MS_NODEV|MS_NOEXEC,
"cpuacct") = -1 EBUSY (Device or resource busy)
...and it might be related to this existing mount courtesy
of systemd in /proc/mounts:
cgroup /sys/fs/cgroup/cpu,cpuacct cgroup
rw,nosuid,nodev,noexec,relatime,cpuacct,cpu 0 0
Anyone else seen this, or got any ideas?
All the best,
Chris
--
Robbert Eggermont Intelligent Systems
r.eggerm...@tudelft.nl Electr.Eng., Mathematics & Comp.Science
+31 15 27 83234 Delft University of Technology