Hi.

Il 06.05.2014 21:58, Reuti ha scritto:
Hi,

Am 06.05.2014 um 18:45 schrieb Roberto Nunnari:

I'm running a small cluster using Oracle Grid Engine 6.2u7

At times it happens that one user submits a job that requires several resources 
(-pe, -l mem_free, etc).

For instance, user A submits a job X requiring 32 slots out of 100 available.
The other users, keeps submitting serial jobs filling up all the slots and 
always having more jobs waiting on the queue.

The serial jobs will get ahead of job X, and be scheduled as soon as one slot 
is available and job X will be waiting in the queue forever and never get to 
run until no more serial jobs will be submitted and 32 slots will be available.

I would like the scheduler to also consider how much the job has been waiting 
in the queue, and possibly also the values regarding the historic users 
resources usage, as returned by qacct -o username

What are the possible solutions to solve this problem?

You can also look into Resource Reservation, so that the parallel job collects 
the necessary resources while waiting:

That looks more promising.. I guess the user has to know that he has to use the -R y flags, .right?

What happens if I set max_reservation to 32 and the user submits a job (using -R y) requiring 64 slots?

Thank you and best regards.
Robi

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to