[slurm-dev] Dynamic partitions on Linux cluster

2014-08-14 Thread Uwe Sauter
Hi all, I got a question about a configuration detail: dynamic partitions Situation: I operate a Linux cluster of currently 54 nodes for a cooperation of two different institutes at the university. To reflect the ratio of cash those institutes invested I configured SLURM with two partition, one

[slurm-dev] Re: Dynamic partitions on Linux cluster

2014-08-14 Thread Bill Barth
Why not make one partition and use fairshare to balance the usage over time? That way both institutes can run large jobs that span the whole machine when others are not using it. Bill. -- Bill Barth, Ph.D., Director, HPC bba...@tacc.utexas.edu| Phone: (512) 232-7069 Office: ROC 1.435

[slurm-dev] Re: Dynamic partitions on Linux cluster

2014-08-14 Thread Uwe Sauter
Hi Bill, if I understand the concept of fairshare correctly, this could result in a situation where one institute uses all resources. Because of this fairshare is out of the question as I have to enforce the ratio between the institutes - I cannot allow usage that would result in one institute

[slurm-dev] Re: Dynamic partitions on Linux cluster

2014-08-14 Thread Bill Barth
Yes, yes it does. I don't mean to be harsh, but doing it their way is a potentially huge waste of resources. Why not get each institute to agree to share the whole machine in proportion to what they paid? Each institute gets an allocation of time (through accounting) and a fairshare fraction in

[slurm-dev] Re: Dynamic partitions on Linux cluster

2014-08-14 Thread Ryan Cox
I would also recommend QOS if you absolutely can't use fairshare. Set up a QOS per institute with a GrpNodes limit that is the correct ratio and only allow institute members to their QOS (make it their default too). Alternatively you can also do one account per institute and set GrpNodes

[slurm-dev] Re: Change requested cpus of running job

2014-08-14 Thread jette
Quoting Christopher B Coffey chris.cof...@nau.edu: Hi, Is it possible with scontrol to change the number of cpus that were granted to a job while its running? Only if the user's program/script is cooperating. See: http://slurm.schedmd.com/faq.html#job_size -- Morris Moe Jette CTO, SchedMD