[slurm-dev] Job submit plugin to improve backfill

2013-06-28 Thread Daniel M. Weeks
At CCNI, we use backfill scheduling on all our systems. However, we have found that users typically do not specify a time limit for their job so the scheduler assumes the maximum from QoS/user limits/partition limits/etc. This really hurts backfilling since the scheduler remains ignorant of short j

[slurm-dev] Re: Job submit plugin to improve backfill

2013-06-28 Thread Ryan Cox
An alternative that we do is choose very low defaults for people: PartitionName=Default DefaultTime=30:00 #plus other options DefMemPerCPU=512 The disadvantage to this approach is that it doesn't give an obvious error message at submit time. However, it's not hard to figure out what h

[slurm-dev] Re: Job submit plugin to improve backfill

2013-06-28 Thread Daniel M. Weeks
Hi Ryan, Thanks. We had considered this approach but went in a different direction for a couple reasons: We have a good number of users that script job submissions and may blast out up to several hundred jobs. A user might not realize their jobs are getting cutoff until many of them run and it's

[slurm-dev] Re: Job submit plugin to improve backfill

2013-06-28 Thread Nikita Burtsev
Hello, Why not enable this functionality by setting DefaultTime=0 in slurm.conf which would let us set this on per-partition basis, rather than through job submit plugin. (Unless i'm missing something obvious here) Also currently setting DefaultTime=0 (on 2.5.6 at least) gives following mess

[slurm-dev] Re: Job submit plugin to improve backfill

2013-06-28 Thread Eckert, Phil
Another route that could be taken is to set the DefaultTime for a partition to 0, and the small patch attached to this email will reject a job when is has no time limit specified and the default_time limit is 0. I also modified the ESLURM_INVALID_TIME_LIMIT to include information that the error mig