Re: [slurm-users] Suspend QOS help

2022-02-18 Thread Walls, Mitchell
1 node04 45124 test stress.s user2 R 4:24 1 node04 From: slurm-users on behalf of Walls, Mitchell Sent: Friday, February 18, 2022 10:24 AM To: slurm-users@lists.schedmd.com Subject: Re: [slurm-users] Suspend QOS help Time

Re: [slurm-users] Suspend QOS help

2022-02-18 Thread Walls, Mitchell
test stress-s user1 S 2:00 1 node04 #partition PartitionName=testDefault=NO Nodes=node04 OverSubscribe=FORCE:1 MaxTime=30-00:00:00 AllowQos=general,suspend From: slurm-users on behalf of Walls, Mitchell Sent: Friday, F

Re: [slurm-users] Suspend QOS help

2022-02-18 Thread Walls, Mitchell
=suspend AllowQos=suspend From: Walls, Mitchell Sent: Friday, February 18, 2022 9:54 AM To: slurm-users@lists.schedmd.com Subject: Re: [slurm-users] Suspend QOS help Both jobs would be using the whole node same as below but with two nodes. I've reduce

Re: [slurm-users] Suspend QOS help

2022-02-18 Thread Walls, Mitchell
7:20 AM, Walls, Mitchell wrote: > Hello, > > Hoping someone can shed some light on what is causing jobs to run on same > nodes simultaneously rather than being actually suspended for the lower > priority job? I can provide more info if someone can think of something to > help

[slurm-users] Suspend QOS help

2022-02-18 Thread Walls, Mitchell
Hello, Hoping someone can shed some light on what is causing jobs to run on same nodes simultaneously rather than being actually suspended for the lower priority job? I can provide more info if someone can think of something to help! # Relevant config. PreemptType=preempt/qos PreemptMode=SUSPEN

[slurm-users] DefaultQOS not set for cluster when running sacctmgr load file

2022-02-16 Thread Walls, Mitchell
For some reason it seems that DefaultQOS does not get set on the cluster level when loading from file? Any ideas on why or if I have something wrong? I've removed some output below for simplicity. As you can see below it is added when doing sacctmgr modify cluster cluster set DefaultQOS='general

Re: [slurm-users] Secondary Unix group id of users not being issued in interactive srun command

2022-01-28 Thread Walls, Mitchell
Do you see the uid in /sys/fs/cgroup? (i.e. find /sys/fs/cgroup -name "*71953*"). If not that could point to cgroup config. From: slurm-users on behalf of Ratnasamy, Fritz Sent: Friday, January 28, 2022 11:13 AM To: RĂ©mi Palancher; Slurm User Community

Re: [slurm-users] Sbatch python question

2021-03-13 Thread Walls, Mitchell
Are you using '#!/usr/bin/env python'? That could make a difference as an example: #!/usr/bin/env python3 #SBATCH --job-name=Python from time import sleep sleep(1) print("done") From: slurm-users on behalf of Walters, Richard I ERDC-RDE-ITL-MS Contract

[slurm-users] Resource limits issue

2020-08-05 Thread Walls, Mitchell
I am having an issue with limits not being enforced if I set `AccountingStorageEnforce=limits,qos`, is this as expected? It allows any user to submit jobs as well even if they do not have an association. Seems to apply the default qos to every job but then ignores associations and per user/account