Package: munge
Version: 0.5.13-1
Severity: important

Dear Maintainer,

munged fails to start at system boot. From sample journal entries:

Jul 16 22:17:56 ron5sid systemd[1]: munge.service: Start operation timed out. 
Terminating.
Jul 16 22:19:26 ron5sid systemd[1]: munge.service: State 'stop-final-sigterm' 
timed out. Killing.
Jul 16 22:19:26 ron5sid systemd[1]: munge.service: Killing process 723 (munged) 
with signal SIGKILL.
Jul 16 22:19:26 ron5sid systemd[1]: munge.service: Failed with result 'timeout'.
Jul 16 22:19:26 ron5sid systemd[1]: Failed to start MUNGE authentication 
service.

... deleted ...

Jul 16 22:20:21 ron5sid kernel: random: crng init done
Jul 16 22:20:21 ron5sid kernel: random: 7 urandom warning(s) missed due to 
ratelimiting

I've included the latter part because I suspect munged timeout is caused
by too long a delay reading /dev/urandom or /dev/random, causing systemd
to time it out.

As a workaround, I inserted this is the [Service] section of
/lib/systemd/system/munge.service:

TimeoutSec=60s

With that in place, munged starts successfully at boot, at least for the
few restarts I've done. I haven't tried to find a lower timeout value
than 60s.

I don't know how long this problem has been present, but no earlier than
01 July, because I ran some tests under Slurm then.  Perhaps it started
around the time Sid got the 4.17 kernel, which I installed 03 July.
Wasn't there a 4.17 change in kernel random number generation which
might have introduced (more) blocking on early random number fetches?

I'm runing Sid in a VM under VMware Workstation Player 14.1.2 hosted on
Windows 7 on a fairly quick laptop. Other setups and hardware would have
different entropy generation characteristics so maybe not all munge
installations will see this problem.

Severity rating "important" is based on Slurm batch system being useless
without Munge service.

Best regards,
Ron

- System Information:
Debian Release: buster/sid
  APT prefers unstable
  APT policy: (500, 'unstable')
Architecture: amd64 (x86_64)

Kernel: Linux 4.17.0-1-amd64 (SMP w/2 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8), 
LANGUAGE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled

Versions of packages munge depends on:
ii  adduser      3.117
ii  libbz2-1.0   1.0.6-8.1
ii  libc6        2.27-5
ii  libgcrypt20  1.8.3-1
ii  libmunge2    0.5.13-1
ii  lsb-base     9.20170808
ii  zlib1g       1:1.2.11.dfsg-1

munge recommends no packages.

munge suggests no packages.

-- no debconf information

Reply via email to