Notification Type: PROBLEM
Service: Check systemd state
Host: labstore1004
Address: 10.64.37.19
State: CRITICAL
Date/Time: Mon Feb 10 15:05:15 UTC 2020
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
Acknowledged by :
Additional Info:
CRITICAL - degraded: The system is operational but one or more units failed.
Notification Type: PROBLEM
Service: Check systemd state
Host: cloudcontrol1004
Address: 208.80.154.132
State: CRITICAL
Date/Time: Mon Feb 10 15:05:07 UTC 2020
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
Acknowledged by :
Additional Info:
CRITICAL - degraded: The system is operational but one or more units failed.
Notification Type: PROBLEM
Host: cloudvirt1016.mgmt
State: DOWN
Address: 10.65.5.67
Info: PING CRITICAL - Packet loss = 100%
Date/Time: Fri Feb 7 18:17:38 UTC 2020
Acknowledged by :
Notification Type: RECOVERY
Service: Check systemd state
Host: cloudcontrol1003
Address: 208.80.154.23
State: OK
Date/Time: Fri Feb 7 00:09:27 UTC 2020
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
Acknowledged by :
Additional Info:
OK - running: The system is fully operational
Notification Type: PROBLEM
Service: Check systemd state
Host: cloudcontrol1003
Address: 208.80.154.23
State: CRITICAL
Date/Time: Fri Feb 7 00:04:15 UTC 2020
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
Acknowledged by :
Additional Info:
CRITICAL - degraded: The system is operational but one or more units failed.
Notification Type: RECOVERY
Service: tools project instance distribution
Host: cloudcontrol1003
Address: 208.80.154.23
State: OK
Date/Time: Mon Feb 3 14:13:19 UTC 2020
Notes URLs: https://wikitech.wikimedia.org/wiki/Portal:Cloud_VPS/Admin/Troubleshooting
Acknowledged by :
Additional Info:
OK: All critical instances are spread out enough
Notification Type: PROBLEM
Service: tools project instance distribution
Host: cloudcontrol1003
Address: 208.80.154.23
State: CRITICAL
Date/Time: Mon Feb 3 12:52:17 UTC 2020
Notes URLs: https://wikitech.wikimedia.org/wiki/Portal:Cloud_VPS/Admin/Troubleshooting
Acknowledged by :
Additional Info:
CRITICAL: prometheus class instances not spread out enough