Robert Ullmann wrote:
All servers should be monitored, on several levels (ping, various queries, checking processes)
Nagios should have been monitoring them.
Someone should be "watching" the monitor 24x7. (being right there, or by SMS, whatever ;)
Don't know if there can be a nagios "silent" failure, where it doesn't get disconnected from irc.
When restarted, the things it was doing should be restarted (this has not been done yet at this writing).
The worry bit is that it seems srv136 will now work as apache. So, where will dumps be done?