Notification Type: RECOVERY
Service: NFS port is open on cluster IP
Host: labstore1005
Address: 10.64.37.20
State: OK
Date/Time: Thu May 12 15:38:34 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Portal:Data_Services/Admin/Labstore
Acknowledged by :
Additional Info:
TCP OK - 0.000 second response time on 10.64.37.18 port 2049
Notification Type: RECOVERY
Service: NFS port is open on cluster IP
Host: labstore1004
Address: 10.64.37.19
State: OK
Date/Time: Thu May 12 15:33:58 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Portal:Data_Services/Admin/Labstore
Acknowledged by :
Additional Info:
TCP OK - 0.000 second response time on 10.64.37.18 port 2049
Notification Type: PROBLEM
Service: NFS port is open on cluster IP
Host: labstore1004
Address: 10.64.37.19
State: CRITICAL
Date/Time: Thu May 12 15:18:31 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Portal:Data_Services/Admin/Labstore
Acknowledged by :
Additional Info:
CRITICAL - Socket timeout after 2 seconds
Notification Type: PROBLEM
Service: Ensure NFS exports are maintained for new instances with NFS
Host: labstore1004
Address: 10.64.37.19
State: CRITICAL
Date/Time: Thu May 12 15:18:27 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/systemd_unit_state
Acknowledged by :
Additional Info:
NRPE: Command check_nfs-exportd-state not defined
Notification Type: PROBLEM
Service: NFS port is open on cluster IP
Host: labstore1005
Address: 10.64.37.20
State: CRITICAL
Date/Time: Thu May 12 15:11:01 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Portal:Data_Services/Admin/Labstore
Acknowledged by :
Additional Info:
CRITICAL - Socket timeout after 2 seconds
Notification Type: RECOVERY
Service: Check for VMs leaked by the nova-fullstack test
Host: cloudcontrol1003
Address: 208.80.154.23
State: OK
Date/Time: Thu May 12 04:34:42 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Portal:Cloud_VPS/Admin/Runbooks/Check_f…
Acknowledged by :
Additional Info:
1 instances in the admin-monitoring project
Notification Type: ACKNOWLEDGEMENT
Service: Check unit status of backup_cinder_volumes
Host: cloudcontrol1005
Address: 208.80.154.85
State: CRITICAL
Date/Time: Wed May 11 21:50:24 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Portal:Cloud_VPS/Admin/Runbooks/Check_u…
Acknowledged by andrew bogott: I will investigate this if I get a moment before bedtime
Additional Info:
CRITICAL: Status of the systemd unit backup_cinder_volumes
Notification Type: ACKNOWLEDGEMENT
Service: Check for VMs leaked by the nova-fullstack test
Host: cloudcontrol1003
Address: 208.80.154.23
State: CRITICAL
Date/Time: Wed May 11 21:50:24 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Portal:Cloud_VPS/Admin/Runbooks/Check_f…
Acknowledged by andrew bogott: I will investigate this if I get a moment before bedtime
Additional Info:
10 instances in the admin-monitoring project