Steve,

I suspect you did not install the packages.


You need to install slurm-slurmctld to get the slurmctld systemd files:

   /# rpm -qlp slurm-slurmctld-20.11.9-1.el7.x86_64.rpm//
   ///run/slurm/slurmctld.pid//
   /*//usr/lib/systemd/system/slurmctld.service/*/
   ///usr/sbin/slurmctld//
   ///usr/share/man/man8/slurmctld.8.gz//
   /


The same for slurm-slurmdbd. Both of those are management daemons and should only be running on one (two if you configure failover) systems.

Your compute nodes need slurm-slurmd, which will provide the systemd files for slurmd.



On 11/27/2022 7:34 PM, 刘 博涵 wrote:
Hi all,

I'm a newcomer to cluster computing and have been trying to setup a Slurm cluster myself. Right now I'm stuck at starting up Slurm's systemd services. I checked out the following tutorials:

 1. Slurm Workload Manager - Quick Start Administrator Guide
    (schedmd.com) <https://slurm.schedmd.com/quickstart_admin.html>
 2. https://wiki.fysik.dtu.dk/Niflheim_system/Slurm_installation/
 3. https://wiki.bkslab.org/index.php/Slurm_Installation_Guide
 4. Slurm installation (southgreenplatform.github.io)
    <https://southgreenplatform.github.io/trainings/hpc/slurminstallation/>

All of them state that I should run /systemctl enable/start slurmd/slurmdbd/slurmctld/ after installation, however they always fail because the corresponding systemd config files do not exist, regardless of whether I installed Slurm from source or from EPEL repos. All my systems are CentOS 7.9 with the latest updates prior to Slurm installation, and I was trying to install Slurm 22.05.6 from source. My question is are the systemd config files actually created during installation process as the tutorials imply, or do I have to write them myself? If the latter, then how should I write my slurm systemd config file (what parameters should I put in etc.), any templates I can follow?

Many thanks,

Steve

Reply via email to