Hi,

a "qstat -j" of a simple job yields inter alia:

| scheduling info:            queue instance 
"longrun-...@willow.toolserver.org" dropped because it is temporarily not 
available
|                             queue instance "short-...@willow.toolserver.org" 
dropped because it is temporarily not available
|                             queue instance 
"medium...@mayapple.toolserver.org" dropped because it is temporarily not 
available
|                             queue instance 
"longrun3-...@willow.toolserver.org" dropped because it is temporarily not 
available
|                             queue instance 
"longrun2-...@clematis.toolserver.org" dropped because it is disabled
|                             queue instance 
"longrun2-...@hawthorn.toolserver.org" dropped because it is disabled
|                             queue instance 
"medium-...@ortelius.toolserver.org" dropped because it is overloaded: 
np_load_short=0.791601 (= 0.391601 + 0.8 * 2.000000 with nproc=4) >= 0.75
|                             queue instance "medium...@yarrow.toolserver.org" 
dropped because it is overloaded: np_load_short=1.215000 (= 0.015000 + 0.8 * 
6.000000 with nproc=4) >= 1.2
|                             queue instance 
"medium...@nightshade.toolserver.org" dropped because it is overloaded: 
np_load_short=1.227500 (= 0.127500 + 0.8 * 11.000000 with nproc=8) >= 1.2
|                             queue instance 
"medium-...@wolfsbane.toolserver.org" dropped because it is overloaded: 
np_load_short=0.778613 (= 0.078613 + 0.8 * 7.000000 with nproc=8) >= 0.75
|                             queue instance 
"short-...@wolfsbane.toolserver.org" dropped because it is overloaded: 
np_load_short=1.278613 (= 0.078613 + 0.8 * 12.000000 with nproc=8) >= 1.2
|                             queue instance 
"short-...@ortelius.toolserver.org" dropped because it is overloaded: 
np_load_short=1.391601 (= 0.391601 + 0.8 * 5.000000 with nproc=4) >= 1.2
|                             queue instance "longrun...@yarrow.toolserver.org" 
dropped because it is overloaded: np_load_short=3.215000 (= 0.015000 + 0.8 * 
16.000000 with nproc=4) >= 3.1
|                             queue instance 
"longrun...@nightshade.toolserver.org" dropped because it is overloaded: 
mem_free=-420765696.524288 (= 14098.726562M - 500M * 29.000000) <= 500

At the moment, we have /no/ jobs scheduled by SGE running.
Meanwhile, the hosts are idling:

| queuename                      qtype resv/used/tot. load_avg arch          
states
| 
---------------------------------------------------------------------------------
| short-sol@ortelius.toolserver. B     0/0/8          1.52     sol-amd64
| 
---------------------------------------------------------------------------------
| short-...@willow.toolserver.or B     0/0/8          -NA-     sol-amd64     au
| 
---------------------------------------------------------------------------------
| short-sol@wolfsbane.toolserver B     0/0/12         0.64     sol-amd64
| 
---------------------------------------------------------------------------------
| medium-lx@mayapple.toolserver. B     0/0/32         -NA-     linux-x64     adu
| 
---------------------------------------------------------------------------------
| medium-lx@nightshade.toolserve B     0/0/8          1.05     linux-x64
| 
---------------------------------------------------------------------------------
| medium...@yarrow.toolserver.or B     0/0/8          0.02     linux-x64
| 
---------------------------------------------------------------------------------
| longrun-lx@nightshade.toolserv BI    0/0/64         1.05     linux-x64
| 
---------------------------------------------------------------------------------
| longrun-lx@yarrow.toolserver.o BI    0/0/64         0.02     linux-x64
| 
---------------------------------------------------------------------------------
| longrun-sol@willow.toolserver. BI    0/0/64         -NA-     sol-amd64     au
| 
---------------------------------------------------------------------------------
| medium-sol@ortelius.toolserver B     0/0/4          1.52     sol-amd64
| 
---------------------------------------------------------------------------------
| medium-sol@wolfsbane.toolserve B     0/0/4          0.64     sol-amd64
| 
---------------------------------------------------------------------------------
| longrun2-sol@clematis.toolserv B     0/0/8          0.03     sol-amd64     d
| 
---------------------------------------------------------------------------------
| longrun2-sol@hawthorn.toolserv B     0/0/8          0.23     sol-amd64     d
| 
---------------------------------------------------------------------------------
| longrun3-sol@willow.toolserver B     0/0/4          -NA-     sol-amd64     
aduE

I filed https://jira.toolserver.org/browse/TS-1650 on Monday
to no avail so far.

Tim


_______________________________________________
Toolserver-l mailing list (Toolserver-l@lists.wikimedia.org)
https://lists.wikimedia.org/mailman/listinfo/toolserver-l
Posting guidelines for this list: 
https://wiki.toolserver.org/view/Mailing_list_etiquette

Reply via email to