Hi,
a "qstat -j" of a simple job yields inter alia:
| scheduling info: queue instance "longrun-sol(a)willow.toolserver.org"
dropped because it is temporarily not available
| queue instance "short-sol(a)willow.toolserver.org"
dropped because it is temporarily not available
| queue instance "medium-lx(a)mayapple.toolserver.org"
dropped because it is temporarily not available
| queue instance
"longrun3-sol(a)willow.toolserver.org" dropped because it is temporarily not
available
| queue instance
"longrun2-sol(a)clematis.toolserver.org" dropped because it is disabled
| queue instance
"longrun2-sol(a)hawthorn.toolserver.org" dropped because it is disabled
| queue instance
"medium-sol(a)ortelius.toolserver.org" dropped because it is overloaded:
np_load_short=0.791601 (= 0.391601 + 0.8 * 2.000000 with nproc=4) >= 0.75
| queue instance "medium-lx(a)yarrow.toolserver.org"
dropped because it is overloaded: np_load_short=1.215000 (= 0.015000 + 0.8 * 6.000000 with
nproc=4) >= 1.2
| queue instance
"medium-lx(a)nightshade.toolserver.org" dropped because it is overloaded:
np_load_short=1.227500 (= 0.127500 + 0.8 * 11.000000 with nproc=8) >= 1.2
| queue instance
"medium-sol(a)wolfsbane.toolserver.org" dropped because it is overloaded:
np_load_short=0.778613 (= 0.078613 + 0.8 * 7.000000 with nproc=8) >= 0.75
| queue instance
"short-sol(a)wolfsbane.toolserver.org" dropped because it is overloaded:
np_load_short=1.278613 (= 0.078613 + 0.8 * 12.000000 with nproc=8) >= 1.2
| queue instance "short-sol(a)ortelius.toolserver.org"
dropped because it is overloaded: np_load_short=1.391601 (= 0.391601 + 0.8 * 5.000000 with
nproc=4) >= 1.2
| queue instance "longrun-lx(a)yarrow.toolserver.org"
dropped because it is overloaded: np_load_short=3.215000 (= 0.015000 + 0.8 * 16.000000
with nproc=4) >= 3.1
| queue instance
"longrun-lx(a)nightshade.toolserver.org" dropped because it is overloaded:
mem_free=-420765696.524288 (= 14098.726562M - 500M * 29.000000) <= 500
At the moment, we have /no/ jobs scheduled by SGE running.
Meanwhile, the hosts are idling:
| queuename qtype resv/used/tot. load_avg arch states
| ---------------------------------------------------------------------------------
| short-sol(a)ortelius.toolserver. B 0/0/8 1.52 sol-amd64
| ---------------------------------------------------------------------------------
| short-sol(a)willow.toolserver.or B 0/0/8 -NA- sol-amd64 au
| ---------------------------------------------------------------------------------
| short-sol(a)wolfsbane.toolserver B 0/0/12 0.64 sol-amd64
| ---------------------------------------------------------------------------------
| medium-lx(a)mayapple.toolserver. B 0/0/32 -NA- linux-x64 adu
| ---------------------------------------------------------------------------------
| medium-lx(a)nightshade.toolserve B 0/0/8 1.05 linux-x64
| ---------------------------------------------------------------------------------
| medium-lx(a)yarrow.toolserver.or B 0/0/8 0.02 linux-x64
| ---------------------------------------------------------------------------------
| longrun-lx(a)nightshade.toolserv BI 0/0/64 1.05 linux-x64
| ---------------------------------------------------------------------------------
| longrun-lx(a)yarrow.toolserver.o BI 0/0/64 0.02 linux-x64
| ---------------------------------------------------------------------------------
| longrun-sol(a)willow.toolserver. BI 0/0/64 -NA- sol-amd64 au
| ---------------------------------------------------------------------------------
| medium-sol(a)ortelius.toolserver B 0/0/4 1.52 sol-amd64
| ---------------------------------------------------------------------------------
| medium-sol(a)wolfsbane.toolserve B 0/0/4 0.64 sol-amd64
| ---------------------------------------------------------------------------------
| longrun2-sol(a)clematis.toolserv B 0/0/8 0.03 sol-amd64 d
| ---------------------------------------------------------------------------------
| longrun2-sol(a)hawthorn.toolserv B 0/0/8 0.23 sol-amd64 d
| ---------------------------------------------------------------------------------
| longrun3-sol(a)willow.toolserver B 0/0/4 -NA- sol-amd64 aduE
I filed
https://jira.toolserver.org/browse/TS-1650 on Monday
to no avail so far.
Tim