Notification Type: PROBLEM
Service: Check systemd state
Host: cloudcephmon1002
Address: 10.64.20.68
State: CRITICAL
Date/Time: Mon Oct 24 23:27:27 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
Acknowledged by :
Additional Info:
CRITICAL - degraded: The following units failed: ceph-mgr(a)cloudcephmon1002.service
Notification Type: RECOVERY
Service: Check systemd state
Host: cloudcephmon1001
Address: 10.64.20.67
State: OK
Date/Time: Mon Oct 24 23:22:37 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
Acknowledged by :
Additional Info:
OK - running: The system is fully operational
Notification Type: RECOVERY
Service: Check systemd state
Host: cloudcephmon1002
Address: 10.64.20.68
State: OK
Date/Time: Mon Oct 24 23:14:59 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
Acknowledged by :
Additional Info:
OK - running: The system is fully operational
Notification Type: PROBLEM
Service: Check systemd state
Host: cloudcephmon1002
Address: 10.64.20.68
State: CRITICAL
Date/Time: Mon Oct 24 22:52:17 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
Acknowledged by :
Additional Info:
CRITICAL - degraded: The following units failed: ceph-mgr(a)cloudcephmon1002.service
Notification Type: RECOVERY
Service: Check systemd state
Host: cloudservices1004
Address: 208.80.154.11
State: OK
Date/Time: Mon Oct 24 22:13:51 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
Acknowledged by :
Additional Info:
OK - running: The system is fully operational
Notification Type: RECOVERY
Service: haproxy service failover
Host: cloudcontrol1005
Address: 208.80.154.85
State: OK
Date/Time: Mon Oct 24 22:08:34 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/HAProxy
Acknowledged by :
Additional Info:
OK check_failover servers up 38 down 0:
Notification Type: RECOVERY
Service: haproxy service failover
Host: cloudcontrol1007
Address: 208.80.155.104
State: OK
Date/Time: Mon Oct 24 22:08:33 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/HAProxy
Acknowledged by :
Additional Info:
OK check_failover servers up 38 down 0:
Notification Type: RECOVERY
Service: haproxy service failover
Host: cloudcontrol1006
Address: 208.80.154.149
State: OK
Date/Time: Mon Oct 24 22:08:33 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/HAProxy
Acknowledged by :
Additional Info:
OK check_failover servers up 38 down 0: