[slurm-users] SLURM , maximum scalable instance is which one
Dear All, Like to know that what is the maximum scalled up instance of SLURM so far. From which web site I can get the information of the highest scalable instance of SLURM and other popular setup using SLURM Thanks Joseph John
[slurm-users] Selecting only a subset of GPU's from all available GPU's
I'm submitting jobs to a cluster via SLURM scheduler, and let's say I have access to 8 GPUs in my cluster in same node. They are GPUs of type A,B,C,D,E,F,G,H. I would like to submit a job that requests the use of GPUs of type A or B or C but NOT of type D/E/F/G/H. So I need some type of OR logic with the --gres flag. Eg : When I request GPU of type A , I can do sbatch –gres=gpu:TypeA:1, I need to input a subset of GPU’s and let slurm schedule job utilizing one of the GPU from this allowed list. *Regards* *Minulakshmi S*
[slurm-users] Couldn't find the specified plugin name for auth/munge looking at all files
Hello, I need to use SLURM for a project. I installed it by this quick start guide ( https://ibmimaster.cs.uni-tuebingen.de/quickstart_admin.html ). First I just want to run it on one cluster. - I did steps 1 to 7, create the slurm user with my slurm binaries as home dir - created the necessary dirs for log and spool files - created slurm.conf.with the easy configurator - munge was also installed via apt packet manager.(using Ubuntu 20) I do not know what step 8 exactly mean and what the library_location is, so I skipped this one. When running slurmd or slurmctld, I get the following error: root@me:# slurmd slurmd: error: Couldn't find the specified plugin name for auth/munge looking at all files slurmd: error: cannot find auth plugin for auth/munge slurmd: error: cannot create auth context for auth/munge slurmd: fatal: failed to initialize auth plugin root@me:# slurmctld slurmctld: error: Couldn't find the specified plugin name for auth/munge looking at all files slurmctld: error: cannot find auth plugin for auth/munge slurmctld: error: cannot create auth context for auth/munge slurmctld: fatal: failed to initialize auth plugin Any idea how I could fix this? I'd be very thankful. Kind regards
[slurm-users] Sinfo options not working in SLURM 23.11
Hello , I am working on SLURM 23.11 version. sinfo option commands are not working properly (-a , --all , -o , -m etc) e.g : sinfo is giving me below 45637@inv456748703$sinfo PARTITION AVAILTIMELIMIT NODES STATE NODELIST FPGA*up infinite 1 idle FPGA01 also sinfo --help gives same result 45637@inv456748703$ sinfo ---help PARTITION AVAIL TIMELIMITNODES STATE NODELIST FPGA* up infinite 1 idle FPGA01 Any pointers will help. Regards, DJ
Re: [slurm-users] Sinfo options not working in SLURM 23.11
Hello Deepak, Deepak J writes: > Hello , > > > > I am working on SLURM 23.11 version. > > sinfo option commands are not working properly (-a , --all , -o , -m etc) > > > > e.g : sinfo is giving me below > > 45637@inv456748703$sinfo > > > > PARTITION AVAILTIMELIMIT NODES STATE NODELIST > > > > > FPGA*up infinite 1 idle > FPGA01 > > also sinfo --help gives same result > > 45637@inv456748703$ sinfo ---help > > PARTITION AVAIL TIMELIMITNODES STATE NODELIST > > > > > > FPGA* up infinite 1 idle >FPGA01 > > Any pointers will help. Why do you think that the output above is wrong? Cheers, Loris > Regards, > > DJ > -- Dr. Loris Bennett (Herr/Mr) ZEDAT, Freie Universität Berlin