Hi,

Job Size Factor in Multifactor Priority Plugin gets its value considering
relative job size, and this size is relative to "node_record_count". The
problems I see with this are two:

- "node_record_count" includes my login node, which is never going to be
used to run jobs. I would solve this by just substracting one to this value.
- "node_record_count" includes all existing nodes in the cluster, doesn't
matter if they are down. I think Job Size priority should be relative to
the maximun size of a job that could be run if there were no other jobs
running in the cluster. So if I have a 70 node cluster, with 2 nodes down,
and a 10 node job, priority for this job should be 10/68, not 10/70.

What would be the easiest way of getting the number of allocated or idle
nodes? I have been trough slurmctld and sinfo code, but I understand they
use loops for this, and I would prefer not having to do this every time I
recalculate priorities.

Thanks,

Miguel

Reply via email to