On 24/10/2022 09:32, Ole Holm Nielsen wrote:
On 10/24/22 06:12, Richard Chang wrote:
I have a two node Slurmctld setup and both will mount an NFS exported directory 
as the state save location.

It is definitely a BAD idea to store Slurm StateSaveLocation on a slow NFS 
directory!  SchedMD recommends to use local NVME or SSD disks because there 
will be many IOPS to this file system!

I recommend you to read "Field Notes 6: From The Frontlines of Slurm Support", 
Jason Booth, SchedMD available from https://slurm.schedmd.com/publications.html.  Read 
the Hardware pages 18-20 which recommend:

Fast path to the StateSaveLocation
■ IOPS this filesystem can sustain is a major bottleneck to job throughput
● At least 2 directories and two files created per job
● The corresponding unlink() calls will add to the load

So what is the recommended way if you want to have HA with slurmctld?

Ward

Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

Reply via email to