Re: [slurm-users] Slurm stats in JSON format

2021-06-07 Thread Ole Holm Nielsen
On 6/8/21 12:27 AM, Sid Young wrote: Is there a tool that will extract the job counts in JSON format? Such as #running, #in pending #onhold etc I am trying to build some custom dashboards for the our new cluster and this would be a really useful set of metrics to gather and display. We have

Re: [slurm-users] Maui equivalent Nodeallocationpolicy

2021-06-07 Thread Juergen Salk
* David Chaffin [210607 14:44]: > > we get a lot of small sub-node jobs that we want to pack together. Maui > does this pretty well with the smallest node that will hold the job, > NODEALLOCATIONPOLICY MINRESOURCE > I can't figure out the slurm equivalent. Default backfill isn't working >

[slurm-users] sbatch: error: memory allocation failure

2021-06-07 Thread Yap, Mike
Hi All Can another advise the possibilities of me encountering the error message as below when submitting a job ? sbatch: error: memory allocation failure The same script use work perfectly fine until I include #SBATCH --nodelist=(compute[015-046]) (once removed it work as it should) The

[slurm-users] Slurm stats in JSON format

2021-06-07 Thread Sid Young
G'Day all, Is there a tool that will extract the job counts in JSON format? Such as #running, #in pending #onhold etc I am trying to build some custom dashboards for the our new cluster and this would be a really useful set of metrics to gather and display. Sid Young W:

[slurm-users] Maui equivalent Nodeallocationpolicy

2021-06-07 Thread David Chaffin
Hi all, we get a lot of small sub-node jobs that we want to pack together. Maui does this pretty well with the smallest node that will hold the job, NODEALLOCATIONPOLICY MINRESOURCE I can't figure out the slurm equivalent. Default backfill isn't working well. Anyone know of one? Thanks, David

[slurm-users] Use Task affinity on a per partition basis?

2021-06-07 Thread Herc Silverstein
Hi, Is there a way to use task affinity on a per-partition basis?  We couldn't find anything in the docs that described doing this.  And our attempts to specify this on a per partition basis failed. Thanks, Herc

Re: [slurm-users] Different max number of jobs in individual and array jobs

2021-06-07 Thread Sebastian T Smith
Hi, This doesn't solve your problem but might be an option: In similar cases, we instruct our users to create `n` Jobs of `m` Steps. Some experimentation may be required to determine the number of Steps to maximize Job run time without hitting your limits. Our max limit is 14 days, so this

Re: [slurm-users] slurmrestd

2021-06-07 Thread Ward Poelmans
Hi, On 7/06/2021 04:33, David Schanzenbach wrote: > In our .rpmmacros file we use, the following option is set: > %_with_slurmrestd 1 You also need libjwt: https://bugs.schedmd.com/show_bug.cgi?id=4 Ward