Hi All,
So I had noticed some waiters on my NSD servers that I thought were unrelated
to the mmchdisk. However, I decided to try rebooting my NSD servers one at a
time (mmshutdown failed!) to clear that up … and evidently one of them had
things hung up because the mmchdisk start completed.
Th
Hmm...have you dumped waiters across the entire cluster or just on the NSD
servers/fs managers? Maybe there’s a slow node out there participating in the
suspend effort? Might be worth running some quick tracing on the FS manager to
see what it’s up to.
On July 15, 2018 at 13:27:54 EDT, Bute
Hi All,
We are in a partial cluster downtime today to do firmware upgrades on our
storage arrays. It is a partial downtime because we have two GPFS filesystems:
1. gpfs23 - 900+ TB and which corresponds to /scratch and /data, and which
I’ve unmounted across the cluster because it has data rep