We have the same issue see:
* https://bugs.schedmd.com/show_bug.cgi?id=8527
* temporary fix we switched back to DefMemPerCpu
regards
On 26/03/2020 16:42, Wayne Hendricks wrote:
When using 20.02/cons_tres and defining DefMemPerGPU, jobs submitted
that request GPUs without defining “—mem” will
When using 20.02/cons_tres and defining DefMemPerGPU, jobs submitted that
request GPUs without defining “—mem” will not run more than one job per node. I
can see where it is allocating the correct amount of memory for the job per
GPUs requested, but no other jobs will run on the node. If a