Hello, At Tuesday 13 November 2012 14:04:40 DaB. wrote:
What about puppetd ? If it's still running, it could provide a way to restart the server.
stopped working with the other services. We have had the idea too :-).
Neither munin or nagios show problems with the memory at thyme before the crash, and normally the solaris SVC-system would restart SSH even if it was killed. The only strange thing I see in nagios is that thyme lost both SAN- connections during the first phase of the crash, regained them, recovered for 2 minutes, crashed again, but keep the SAN-connections this time.
Because thyme will be hard-restarted (power down/up) we will lost the "state" of the server, but maybe the syslog can tell us a little bit.
Sincerely, DaB.