[slurm-users] sbatch: error: memory allocation failure

2021-06-07 Thread Yap, Mike
Hi All Can another advise the possibilities of me encountering the error message as below when submitting a job ? sbatch: error: memory allocation failure The same script use work perfectly fine until I include #SBATCH --nodelist=(compute[015-046]) (once removed it work as it should) The

Re: [slurm-users] Fairshare +FairTree Algorithm + TRESBillingWeights

2021-04-06 Thread Yap, Mike
want to make predominant. From: slurm-users On Behalf Of Yap, Mike Sent: Wednesday, 7 April 2021 9:57 AM To: Slurm User Community List Subject: Re: [slurm-users] Fairshare +FairTree Algorithm + TRESBillingWeights Thanks Luke.. Will go through the 2 commands (will try to digest them) Wondering

Re: [slurm-users] Fairshare +FairTree Algorithm + TRESBillingWeights

2021-04-06 Thread Yap, Mike
Fix the issue with TRESBillingWeights, It seems like I will need to set PartitionName for it to work https://bugs.schedmd.com/show_bug.cgi?id=3753 PartitionName=DEFAULT TRESBillingWeights="CPU=1.0,Mem=0.25G,GRES/gpu=2.0" From: slurm-users On Behalf Of Yap, Mike Sent: Wednesday, 7 Ap

Re: [slurm-users] Fairshare +FairTree Algorithm + TRESBillingWeights

2021-04-06 Thread Yap, Mike
alue for GrpTRESMins for your "Association Records": scontrol show assoc_mgr Hope that helps! Luke From: slurm-users mailto:slurm-users-boun...@lists.schedmd.com>> On Behalf Of Yap, Mike Sent: Wednesday, March 31, 2021 4:50 PM To: slurm-us...@schedmd.com<mailto:slurm-us...@sch

[slurm-users] Fairshare +FairTree Algorithm + TRESBillingWeights

2021-03-31 Thread Yap, Mike
Hi All Need some clarification on Fairshare (multifactor priority plugin) and FairTree Algorithm If I read correctly, the current default for slurm is FairTree algorithm in which 1. Priority can set on various level 2. No fairshare-actual usage is being consider 3. Job submitted will

Re: [slurm-users] Slurm - UnkillableStepProgram

2021-03-23 Thread Yap, Mike
larger than 126 seconds: https://bugs.schedmd.com/show_bug.cgi?id=11103 From: slurm-users mailto:slurm-users-boun...@lists.schedmd.com>> On Behalf Of Yap, Mike Sent: Monday, March 22, 2021 7:13 PM To: slurm-users@lists.schedmd.com<mailto:slurm-users@lists.schedmd.com> Subject: [slurm-

Re: [slurm-users] Slurm - UnkillableStepProgram

2021-03-23 Thread Yap, Mike
Hi Chris Thanks for the clarification Mike -Original Message- From: slurm-users On Behalf Of Chris Samuel Sent: Tuesday, 23 March 2021 5:30 PM To: slurm-users@lists.schedmd.com Subject: Re: [slurm-users] Slurm - UnkillableStepProgram Hi Mike, On 22/3/21 7:12 pm, Yap, Mike wrote

Re: [slurm-users] Slurm prolog export variable

2021-03-23 Thread Yap, Mike
& Compute Services (SSCS) Kommunikations- und Informationszentrum (kiz) Universität Ulm Telefon: +49 (0)731 50-22478 Telefax: +49 (0)731 50-22471 * Yap, Mike [210323 02:12]: > Hi All > > Can anyone assist the following > > We're using Bright Cluster 9.1 with CentOS 7.9 r

[slurm-users] Slurm - UnkillableStepProgram

2021-03-22 Thread Yap, Mike
Hi All Have been reading on the archive hoping to implement unkillablesteptimeout and unkillablesteprogram to the slurm But I'm kind of confuse with it application 1. I presume UnkillableStepTimeout is set in slurm.conf. and it act as a timer to trigger UnkillableStepProgram 2.

[slurm-users] Slurm prolog export variable

2021-03-22 Thread Yap, Mike
Hi All Can anyone assist the following We're using Bright Cluster 9.1 with CentOS 7.9 running with slurm 2.02.6 We have a script running on prolog exporting the SCRATCH as variable for user running job Addition command on the script to create a user folder accordingly When submitting the job,