Hi all,
I got a question about a configuration detail: dynamic partitions
Situation:
I operate a Linux cluster of currently 54 nodes for a cooperation of two
different institutes at the university. To reflect the ratio of cash
those institutes invested I configured SLURM with two partition, one
Why not make one partition and use fairshare to balance the usage over
time? That way both institutes can run large jobs that span the whole
machine when others are not using it.
Bill.
--
Bill Barth, Ph.D., Director, HPC
bba...@tacc.utexas.edu| Phone: (512) 232-7069
Office: ROC 1.435
Hi Bill,
if I understand the concept of fairshare correctly, this could result in
a situation where one institute uses all resources.
Because of this fairshare is out of the question as I have to enforce
the ratio between the institutes - I cannot allow usage that would
result in one institute
Yes, yes it does. I don't mean to be harsh, but doing it their way is a
potentially huge waste of resources. Why not get each institute to agree
to share the whole machine in proportion to what they paid? Each institute
gets an allocation of time (through accounting) and a fairshare fraction
in
I would also recommend QOS if you absolutely can't use fairshare. Set up
a QOS per institute with a GrpNodes limit that is the correct ratio and
only allow institute members to their QOS (make it their default too).
Alternatively you can also do one account per institute and set GrpNodes
Quoting Christopher B Coffey chris.cof...@nau.edu:
Hi,
Is it possible with scontrol to change the number of cpus that were
granted to a job while its running?
Only if the user's program/script is cooperating. See:
http://slurm.schedmd.com/faq.html#job_size
--
Morris Moe Jette
CTO, SchedMD