Hi,
a "qstat -j" of a simple job yields inter alia:
| scheduling info: queue instance "longrun-sol@willow.toolserver.org" dropped because it is temporarily not available | queue instance "short-sol@willow.toolserver.org" dropped because it is temporarily not available | queue instance "medium-lx@mayapple.toolserver.org" dropped because it is temporarily not available | queue instance "longrun3-sol@willow.toolserver.org" dropped because it is temporarily not available | queue instance "longrun2-sol@clematis.toolserver.org" dropped because it is disabled | queue instance "longrun2-sol@hawthorn.toolserver.org" dropped because it is disabled | queue instance "medium-sol@ortelius.toolserver.org" dropped because it is overloaded: np_load_short=0.791601 (= 0.391601 + 0.8 * 2.000000 with nproc=4) >= 0.75 | queue instance "medium-lx@yarrow.toolserver.org" dropped because it is overloaded: np_load_short=1.215000 (= 0.015000 + 0.8 * 6.000000 with nproc=4) >= 1.2 | queue instance "medium-lx@nightshade.toolserver.org" dropped because it is overloaded: np_load_short=1.227500 (= 0.127500 + 0.8 * 11.000000 with nproc=8) >= 1.2 | queue instance "medium-sol@wolfsbane.toolserver.org" dropped because it is overloaded: np_load_short=0.778613 (= 0.078613 + 0.8 * 7.000000 with nproc=8) >= 0.75 | queue instance "short-sol@wolfsbane.toolserver.org" dropped because it is overloaded: np_load_short=1.278613 (= 0.078613 + 0.8 * 12.000000 with nproc=8) >= 1.2 | queue instance "short-sol@ortelius.toolserver.org" dropped because it is overloaded: np_load_short=1.391601 (= 0.391601 + 0.8 * 5.000000 with nproc=4) >= 1.2 | queue instance "longrun-lx@yarrow.toolserver.org" dropped because it is overloaded: np_load_short=3.215000 (= 0.015000 + 0.8 * 16.000000 with nproc=4) >= 3.1 | queue instance "longrun-lx@nightshade.toolserver.org" dropped because it is overloaded: mem_free=-420765696.524288 (= 14098.726562M - 500M * 29.000000) <= 500
At the moment, we have /no/ jobs scheduled by SGE running. Meanwhile, the hosts are idling:
| queuename qtype resv/used/tot. load_avg arch states | --------------------------------------------------------------------------------- | short-sol@ortelius.toolserver. B 0/0/8 1.52 sol-amd64 | --------------------------------------------------------------------------------- | short-sol@willow.toolserver.or B 0/0/8 -NA- sol-amd64 au | --------------------------------------------------------------------------------- | short-sol@wolfsbane.toolserver B 0/0/12 0.64 sol-amd64 | --------------------------------------------------------------------------------- | medium-lx@mayapple.toolserver. B 0/0/32 -NA- linux-x64 adu | --------------------------------------------------------------------------------- | medium-lx@nightshade.toolserve B 0/0/8 1.05 linux-x64 | --------------------------------------------------------------------------------- | medium-lx@yarrow.toolserver.or B 0/0/8 0.02 linux-x64 | --------------------------------------------------------------------------------- | longrun-lx@nightshade.toolserv BI 0/0/64 1.05 linux-x64 | --------------------------------------------------------------------------------- | longrun-lx@yarrow.toolserver.o BI 0/0/64 0.02 linux-x64 | --------------------------------------------------------------------------------- | longrun-sol@willow.toolserver. BI 0/0/64 -NA- sol-amd64 au | --------------------------------------------------------------------------------- | medium-sol@ortelius.toolserver B 0/0/4 1.52 sol-amd64 | --------------------------------------------------------------------------------- | medium-sol@wolfsbane.toolserve B 0/0/4 0.64 sol-amd64 | --------------------------------------------------------------------------------- | longrun2-sol@clematis.toolserv B 0/0/8 0.03 sol-amd64 d | --------------------------------------------------------------------------------- | longrun2-sol@hawthorn.toolserv B 0/0/8 0.23 sol-amd64 d | --------------------------------------------------------------------------------- | longrun3-sol@willow.toolserver B 0/0/4 -NA- sol-amd64 aduE
I filed https://jira.toolserver.org/browse/TS-1650 on Monday to no avail so far.
Tim