Hi All,
I'm currently having a problem in GE 6.2u5 with a resource quota configuration.
Essentially, I have 5 batch queues set up (veryshort, short, medium, long,
unlimited) and 1 interactive queue. I have written the following resource
quota to ensure that any one user cannot consume more than 40 slots at any one
time:
{
name limit_slots_batch
description "Limit users to 40 slots on batch queue"
enabled TRUE
limit users {*} queues !interactive.q to slots=40
}
However, I am getting a problem when scheduling parallel jobs where GE seems to
think that the slot quota has been exceeded, even though there are plenty of
free slots in the relevant queues. Sometimes restarting sgemaster helps,
sometimes it does not. The output from qstat -j is as below, the offending
line appears to be the penultimate one.
Does anyone have an idea what might be causing this, and how I could fix it?
Thanks,
Chris
==============================================================
job_number: 83168
exec_file: job_scripts/83168
submission_time: Wed May 4 15:57:22 2011
owner: stsiab
uid: 1000
group: st
gid: 1001
sge_o_home: /home/stsiab
sge_o_log_name: stsiab
sge_o_path:
/usr/local/packages/python-2.7/bin:/usr/local/packages/R-2.11.1/bin:/usr/local/packages/eclipse-3.6:/usr/local/packages/valgrind-3.5.0/bin:/usr/local/packages/gdb-6.8/bin:/usr/local/packages/jdk1.6.0_21/bin:/usr/local/packages/openmpi-1.4.3/bin:/usr/local/packages/gcc-4.4.0/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/sge/bin/lx24-amd64:/usr/sge/bin/lx24-amd64
sge_o_shell: /bin/bash
sge_o_workdir:
/home/stsiab/brandenburg/brandenburg/release/src/unitTests
sge_o_host: buster
account: sge
cwd:
/home/stsiab/brandenburg/brandenburg/release/src/unitTests
hard resource_list: h_vmem=500M,h_rt=86400
mail_list: [email protected]
notify: FALSE
job_name: auseiMcmcNC
jobshare: 0
hard_queue_list: !interactive.q
shell_list: NONE:/bin/bash
env_list:
script_file: auseiMcmc.com
parallel environment: mpi range: 16
version: 1
scheduling info: cannot run in queue "interactive.q" because it is
not contained in its hard queue list (-q)
cannot run because it exceeds limit
"stsiab/////" in rule "limit_slots_batch/1"
cannot run in PE "mpi" because it only
offers 0 slots
--
Dr Chris Jewell
Department of Statistics
University of Warwick
Coventry
CV4 7AL
UK
Tel: +44 (0)24 7615 0778
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users