Notification Type: ACKNOWLEDGEMENT
Service: ensure kvm processes are running
Host: cloudvirt1032
Address: 10.64.20.74
State: CRITICAL
Date/Time: Mon Aug 10 14:43:27 UTC 2020
Notes URLs: https://wikitech.wikimedia.org/wiki/Portal:Cloud_VPS/Admin/Troubleshooting
Acknowledged by andrew bogott: rebuild in progress
Additional Info:
CHECK_NRPE: Error - Could not connect to 10.64.20.74: Connection reset by peer
Notification Type: ACKNOWLEDGEMENT
Service: dhclient process
Host: cloudvirt1032
Address: 10.64.20.74
State: CRITICAL
Date/Time: Mon Aug 10 14:43:27 UTC 2020
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_dhclient
Acknowledged by andrew bogott: rebuild in progress
Additional Info:
CHECK_NRPE: Error - Could not connect to 10.64.20.74: Connection reset by peer
Notification Type: ACKNOWLEDGEMENT
Service: configured eth
Host: cloudvirt1032
Address: 10.64.20.74
State: CRITICAL
Date/Time: Mon Aug 10 14:43:27 UTC 2020
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_eth
Acknowledged by andrew bogott: rebuild in progress
Additional Info:
CHECK_NRPE: Error - Could not connect to 10.64.20.74: Connection reset by peer
Notification Type: ACKNOWLEDGEMENT
Service: Long running screen/tmux
Host: cloudvirt1032
Address: 10.64.20.74
State: CRITICAL
Date/Time: Mon Aug 10 14:43:27 UTC 2020
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/Long_running_screens
Acknowledged by andrew bogott: rebuild in progress
Additional Info:
connect to address 10.64.20.74 port 5666: Connection refused
Notification Type: ACKNOWLEDGEMENT
Service: IPMI Sensor Status
Host: cloudvirt1032
Address: 10.64.20.74
State: CRITICAL
Date/Time: Mon Aug 10 14:43:26 UTC 2020
Notes URLs: https://wikitech.wikimedia.org/wiki/Dc-operations/Hardware_Troubleshooting_…
Acknowledged by andrew bogott: rebuild in progress
Additional Info:
connect to address 10.64.20.74 port 5666: Connection refused
Notification Type: ACKNOWLEDGEMENT
Service: Check systemd state
Host: cloudvirt1032
Address: 10.64.20.74
State: CRITICAL
Date/Time: Mon Aug 10 14:43:26 UTC 2020
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
Acknowledged by andrew bogott: rebuild in progress
Additional Info:
CHECK_NRPE: Error - Could not connect to 10.64.20.74: Connection reset by peer
Notification Type: PROBLEM
Service: configured eth
Host: cloudvirt1032
Address: 10.64.20.74
State: CRITICAL
Date/Time: Mon Aug 10 14:41:50 UTC 2020
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_eth
Acknowledged by :
Additional Info:
CHECK_NRPE: Error - Could not connect to 10.64.20.74: Connection reset by peer