I am trying to submit a job to a specific host in the queue:
# qrsh -verbose -q gui.q@ibm056
Your job 5049542 ("QRLOGIN") has been submitted
waiting for interactive job to be scheduled ...
However it is in waiting state:
# qstat -u johnt
job-ID prior name user state submit/start at queue
slots ja-task-ID
-----------------------------------------------------------------------------------------------------------------
5049542 0.55500 QRLOGIN johnt qw 04/19/2017 14:51:19
1
# qstat -j 5049542 |grep gui.q
hard_queue_list: gui.q@ibm056
queue instance "gui.q@dsbm05" dropped because it is
full
Here is the current status of the queue:
# qstat -f |grep gui.q
gui.q@dsbm04 BIP 0/5/45 8.87 lx24-amd64
gui.q@dsbm05 BIP 0/55/55 7.75 lx24-amd64
gui.q@ibm056 BIP 0/11/30 3.15 lx24-amd64
gui.q@ibm057 BIP 0/11/30 1.34 lx24-amd64
gui.q@ibm058 BIP 0/11/45 3.47 lx24-amd64
The same goes for ibm057 and ibm058. It seems that dsbm05 being full blocks all
following servers in the queue list. In fact I can submit to dsbm04, which
precedes dsbm05.
I recently upgraded from sge6.1 to sge6.2u6, though I can't be sure that's the
only thing that's changed. How do I even begin to debug this?
Thanks for any help
John
________________________________
This email (including its attachments, if any) may be confidential and
proprietary information of SMIC, and intended only for the use of the named
recipient(s) above. Any unauthorized use or disclosure of this email is
strictly prohibited. If you are not the intended recipient(s), please notify
the sender immediately and delete this email from your computer.
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users