Notification Type: RECOVERY
Service: Check unit status of labs-ip-alias-dump
Host: cloudservices1004
Address: 208.80.154.11
State: OK
Date/Time: Tue Mar 7 15:30:39 UTC 2023
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/systemd_unit_state
Acknowledged by :
Additional Info:
OK: Status of the systemd unit labs-ip-alias-dump
Notification Type: RECOVERY
Service: Check systemd state
Host: cloudservices1004
Address: 208.80.154.11
State: OK
Date/Time: Tue Mar 7 15:30:07 UTC 2023
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
Acknowledged by :
Additional Info:
OK - running: The system is fully operational
Notification Type: PROBLEM
Service: Check unit status of labs-ip-alias-dump
Host: cloudservices1004
Address: 208.80.154.11
State: CRITICAL
Date/Time: Tue Mar 7 14:49:42 UTC 2023
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/systemd_unit_state
Acknowledged by :
Additional Info:
CRITICAL: Status of the systemd unit labs-ip-alias-dump
Notification Type: PROBLEM
Service: Check unit status of wmcs_monitoring_graphite_rsync
Host: cloudmetrics1004
Address: 10.64.37.6
State: CRITICAL
Date/Time: Tue Mar 7 14:43:50 UTC 2023
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/systemd_unit_state
Acknowledged by :
Additional Info:
CRITICAL: Status of the systemd unit wmcs_monitoring_graphite_rsync
Notification Type: PROBLEM
Service: Check systemd state
Host: cloudservices1004
Address: 208.80.154.11
State: CRITICAL
Date/Time: Tue Mar 7 14:43:08 UTC 2023
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
Acknowledged by :
Additional Info:
CRITICAL - degraded: The following units failed: labs-ip-alias-dump.service
Notification Type: PROBLEM
Service: Check systemd state
Host: cloudmetrics1004
Address: 10.64.37.6
State: CRITICAL
Date/Time: Tue Mar 7 14:38:20 UTC 2023
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
Acknowledged by :
Additional Info:
CRITICAL - degraded: The following units failed: wmcs_monitoring_graphite_rsync.service
Notification Type: RECOVERY
Service: haproxy service failover
Host: cloudcontrol1006
Address: 208.80.154.149
State: OK
Date/Time: Tue Mar 7 14:35:38 UTC 2023
Notes URLs: https://wikitech.wikimedia.org/wiki/HAProxy
Acknowledged by :
Additional Info:
OK check_failover servers up 38 down 0:
Notification Type: RECOVERY
Service: haproxy service failover
Host: cloudcontrol1007
Address: 208.80.155.104
State: OK
Date/Time: Tue Mar 7 14:35:38 UTC 2023
Notes URLs: https://wikitech.wikimedia.org/wiki/HAProxy
Acknowledged by :
Additional Info:
OK check_failover servers up 38 down 0:
Notification Type: RECOVERY
Service: haproxy service failover
Host: cloudcontrol1005
Address: 208.80.154.85
State: OK
Date/Time: Tue Mar 7 14:35:24 UTC 2023
Notes URLs: https://wikitech.wikimedia.org/wiki/HAProxy
Acknowledged by :
Additional Info:
OK check_failover servers up 38 down 0:
Notification Type: PROBLEM
Service: haproxy service failover
Host: cloudcontrol1005
Address: 208.80.154.85
State: CRITICAL
Date/Time: Tue Mar 7 14:29:30 UTC 2023
Notes URLs: https://wikitech.wikimedia.org/wiki/HAProxy
Acknowledged by :
Additional Info:
CRITICAL check_failover servers up 37 down 1: