Notification Type: RECOVERY
Host: cloudvirt1051
State: UP
Address: 10.64.148.9
Info: PING OK - Packet loss = 0%, RTA = 0.26 ms
Date/Time: Thu Nov 9 19:41:13 UTC 2023
Acknowledged by :
Notification Type: PROBLEM
Host: cloudvirt1051
State: DOWN
Address: 10.64.148.9
Info: PING CRITICAL - Packet loss = 100%
Date/Time: Thu Nov 9 19:39:15 UTC 2023
Acknowledged by :
Notification Type: RECOVERY
Service: Check for large files in client bucket
Host: cloudvirt1060
Address: 10.64.149.12
State: OK
Date/Time: Thu Nov 9 19:38:05 UTC 2023
Notes URLs: https://wikitech.wikimedia.org/wiki/Puppet%23check_client_bucket_large_file
Acknowledged by :
Additional Info:
OK: client bucket file ok
Notification Type: PROBLEM
Service: Check for large files in client bucket
Host: cloudvirt1060
Address: 10.64.149.12
State: CRITICAL
Date/Time: Thu Nov 9 19:32:39 UTC 2023
Notes URLs: https://wikitech.wikimedia.org/wiki/Puppet%23check_client_bucket_large_file
Acknowledged by :
Additional Info:
CHECK_NRPE: Error - Could not connect to 10.64.149.12: Connection reset by peer
Notification Type: RECOVERY
Service: Check unit status of backup_vms
Host: cloudbackup1004
Address: 10.64.20.23
State: OK
Date/Time: Thu Nov 9 19:16:30 UTC 2023
Notes URLs: https://wikitech.wikimedia.org/wiki/Portal:Cloud_VPS/Admin/Runbooks/Check_u…
Acknowledged by :
Additional Info:
OK: Status of the systemd unit backup_vms
Notification Type: PROBLEM
Service: Check unit status of remove_dangling_cinder_snapshots
Host: cloudbackup2001
Address: 10.192.0.130
State: CRITICAL
Date/Time: Thu Nov 9 19:14:50 UTC 2023
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/systemd_unit_state
Acknowledged by :
Additional Info:
CRITICAL: Status of the systemd unit remove_dangling_cinder_snapshots
Notification Type: RECOVERY
Service: Check unit status of remove_dangling_cinder_snapshots
Host: cloudbackup2001
Address: 10.192.0.130
State: OK
Date/Time: Thu Nov 9 19:03:21 UTC 2023
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/systemd_unit_state
Acknowledged by :
Additional Info:
OK: Status of the systemd unit remove_dangling_cinder_snapshots