Re: [slurm-users] How to limit # of execution slots for a given node

Ole Holm Nielsen Thu, 06 Jan 2022 23:17:35 -0800

Hi David,

On 1/6/22 22:39, David Henkemeyer wrote:

When my team used PBS, we had several nodes that had a TON of CPUs, somany, in fact, that we ended up setting np to a smaller value, in order tonot starve the system of memory.
What is the best way to do this with Slurm? I tried modifying # of CPUsin the slurm.conf file, but I noticed that Slurm enforces that "CPUs" isequal to Boards * SocketsPerBoard * CoresPerSocket * ThreadsPerCore. Thisleft me with having to "fool" Slurm into thinking there were either fewerThreadsPerCore, fewer CoresPerSocket, or fewer SocketsPerBoard. This is aless than ideal solution, it seems to me. At least, it left me feelinglike there has to be a better way.

If your goal is to limit the amount of RAM memory per job, then kernelCgroups is probably the answer. I've collected some information in myWiki page:

https://wiki.fysik.dtu.dk/niflheim/Slurm_configuration#cgroup-configuration

If some users need more RAM than available for 1 core, they have to submitjobs for a larger number of cores to get it. This makes a lot of sense, IMHO.

SchedMD is working on the use of Cgroups v2, see the talk "Slurm 21.08 andBeyond" by Tim Wickberg, SchedMD, https://slurm.schedmd.com/publications.html

You could probably "fool" Slurm as you describe it, but that shouldn't benecessary.


I hope this helps.

/Ole

Re: [slurm-users] How to limit # of execution slots for a given node

Reply via email to