[slurm-users] Cleanup of job_container/tmpfs

2023-02-28 Thread Jason Ellul
Hi, We have recently moved to slurm 22.05.8 and have configured job_container/tmpfs to allow private tmp folders. job_container.conf contains: AutoBasePath=true BasePath=/slurm And in slurm.conf we have set JobContainerType=job_container/tmpfs I can see the folders being created and they are b

Re: [slurm-users] Single Node cluster. How to manage oversubscribing

2023-02-28 Thread Doug Meyer
Hi, I forgot one thing you didn't mention. When you change the node descriptors and partitions you have to also restart slurmctld. scontrol reconfigure works for the nodes but the main daemon has to be told to reread the config. Until you restart the daemon it will be referencing the config fro

Re: [slurm-users] Chaining srun commands

2023-02-28 Thread Doug Meyer
Hi, I read the problem differently. Might also want to look at heterogeneous jobs. Slurm Workload Manager - Heterogeneous Job Support (schedmd.com) Doug On Tue, Feb 28, 2023 at 3:27 PM Jake Jellinek wrote: > Hi Brian > > Thanks for your resp

Re: [slurm-users] Chaining srun commands

2023-02-28 Thread Jake Jellinek
Hi Brian Thanks for your response > I am guessing you are using srun to get an interactive session on a node. > That approach is being deprecated and you get a shell by default with salloc This is exactly what I'm trying to do I didn’t know about the salloc thing Let me do some more testin

[slurm-users] Slurm version 23.02 is now available

2023-02-28 Thread Tim Wickberg
We are pleased to announce the availability of Slurm version 23.02. To highlight some new features in 23.02: - Added a new (optional) RPC rate limiting system in slurmctld. - Added usage gathering for gpu/nvml (Nvidia) and gpu/rsmi (AMD) plugins. - Added a new jobcomp/kafka plugin. - Overhaule

Re: [slurm-users] Chaining srun commands

2023-02-28 Thread Brian Andrus
Jake, It may help more to understand what you are trying to do accomplish rather than find out how to do it the way you expect. I am guessing you are using srun to get an interactive session on a node. That approach is being deprecated and you get a shell by default with salloc If you are

[slurm-users] Chaining srun commands

2023-02-28 Thread Jake Jellinek
Hi all I come from a SGE/UGE background and am used to the convention that I can qrsh to a node and, from there, start a new qrsh to a different node with different parameters. I've tried this with Slurm and found that this doesn’t work the same. For example, if I issue an 'srun' command, I get

Re: [slurm-users] Power saving and node weight

2023-02-28 Thread Brian Andrus
Gizo, I had that issue and opened a ticket. It is not considered a bug but a feature request. They have no plans to address it at this time. 9734 – Jobs sent to higher weight idle node instead of starting lower weight node (schedmd.com) You ma

[slurm-users] Power saving and node weight

2023-02-28 Thread Gizo Nanava
Hello, it seems that if a slurm power saving is enabled then the parameter "Weight" seem to be ignored for nodes that are in a power down state. Is there any way to make the option working for a cluster running slurm in a powe saveing mode?. I am aware of the note to the weight option in the

[slurm-users] Evaluation: How collect data regarding slurms cloud scheduling performance?

2023-02-28 Thread Xaver Stiensmeier
Dear slurm-user list, I am currently investigating ways of evaluation regarding slurms cloud scheduling performance. As we are all aware there are many adjustment screws when it comes to cloud scheduling. We can change the regular scheduling (prioritizing, ...), powerup and powerdown times. Ther