Hi all,

We are using slurm 14.03.1-2, with select/cons_res (CR_CPU_memory) and
task/cgroup plugins, and Shared=NO partitions. I have noticed that when
users specify --mem=0, slurm "grants" them all memory but not marking it
as allocated, causing other users to be able to use it as well.

example:

$ sbatch --mem=0 -ptest -whm-39 --wrap xterm

within the xterm:

hm-39$ cat 
/sys/fs/cgroup/memory/slurm/uid_${SLURM_JOB_UID}/job_${SLURM_JOBID}/memory.limit_in_bytes
67637346304
hm-39$ scontrol show node ${SLURMD_NODENAME} | grep AllocMem
   OS=Linux RealMemory=64504 AllocMem=0 Sockets=2 Boards=1

This somewhat makes the --mem parameter useless - as it doesn't
guarantee allocated memory will not be shared, and everyone can just use
--mem=0 and they won't be accountable for it. I do want to allow users
to request all memory (preferably with --exclusive).

Am I missing something?
Is it just some misconfiguration on my part?
Do I need to write a job_submit (or other) plugin that will verify that
mem=0 was not set?


Thanks in advance,
    Yair.

Reply via email to