Marlen Caemmerer <marlen.caemmerer(a)wikimedia.de> wrote:
I would like to reboot ortelius, one of the web
servers at
tomorrow, Tuesday 1830 UTC
Apparently, wolfsbane rebooted today as well:
| timl@wolfsbane:~$ uptime
| 16:49pm up 5:00, 2 users, load average: 1.16, 1.24, 1.47
| timl@wolfsbane:~$
Perhaps related to that, SGE queues on ortelius and wolfs-
bane are in state "au" (alarm, unknown):
| timl@wolfsbane:~$ qstat -f -explain a | sed -ne '1,2p' -e
'/ortelius\|wolfsbane/,/^-/p'
| queuename qtype resv/used/tot. load_avg arch states
| ---------------------------------------------------------------------------------
| short-sol(a)ortelius.toolserver. B 0/0/8 -NA- sol-amd64 au
| error: no value for "np_load_short" because execd is in unknown state
| error: no value for "np_load_avg" because execd is in unknown state
| error: no value for "cpu" because execd is in unknown state
| error: no value for "mem_free" because execd is in unknown state
| alarm gf:tmp_free=100G load-threshold=200M
| alarm gf:available=1 load-threshold=0
| ---------------------------------------------------------------------------------
| short-sol(a)wolfsbane.toolserver B 0/10/12 -NA- sol-amd64 au
| error: no value for "np_load_short" because execd is in unknown state
| error: no value for "np_load_avg" because execd is in unknown state
| error: no value for "cpu" because execd is in unknown state
| error: no value for "mem_free" because execd is in unknown state
| alarm gf:tmp_free=100G load-threshold=200M
| alarm gf:available=1 load-threshold=0
| ---------------------------------------------------------------------------------
| medium-sol(a)ortelius.toolserver B 0/0/4 -NA- sol-amd64 au
| error: no value for "np_load_short" because execd is in unknown state
| error: no value for "np_load_avg" because execd is in unknown state
| error: no value for "np_load_long" because execd is in unknown state
| error: no value for "cpu" because execd is in unknown state
| error: no value for "mem_free" because execd is in unknown state
| alarm gf:tmp_free=100G load-threshold=100M
| alarm gf:available=1 load-threshold=0
| ---------------------------------------------------------------------------------
| medium-sol(a)wolfsbane.toolserve B 0/3/4 -NA- sol-amd64 au
| error: no value for "np_load_short" because execd is in unknown state
| error: no value for "np_load_avg" because execd is in unknown state
| error: no value for "np_load_long" because execd is in unknown state
| error: no value for "cpu" because execd is in unknown state
| error: no value for "mem_free" because execd is in unknown state
| alarm gf:tmp_free=100G load-threshold=100M
| alarm gf:available=1 load-threshold=0
| ---------------------------------------------------------------------------------
| timl@wolfsbane:~$
Tim