lists.wikimedia.org
Sign In
Sign Up
Sign In
Sign Up
Manage this list
×
Keyboard Shortcuts
Thread View
j
: Next unread message
k
: Previous unread message
j a
: Jump to all threads
j l
: Jump to MailingList overview
2024
December
November
October
September
August
July
June
May
April
March
February
January
2023
December
November
October
September
August
July
June
May
April
March
February
January
2022
December
November
October
September
August
July
June
May
April
March
February
January
2021
December
November
October
September
August
July
June
May
April
March
February
January
2020
December
November
October
September
August
July
June
May
April
March
February
January
2019
December
November
October
September
August
July
June
May
April
March
February
January
2018
December
November
October
September
August
July
June
May
April
March
February
List overview
Download
Cloud-admin-feed
January 2020
----- 2024 -----
December 2024
November 2024
October 2024
September 2024
August 2024
July 2024
June 2024
May 2024
April 2024
March 2024
February 2024
January 2024
----- 2023 -----
December 2023
November 2023
October 2023
September 2023
August 2023
July 2023
June 2023
May 2023
April 2023
March 2023
February 2023
January 2023
----- 2022 -----
December 2022
November 2022
October 2022
September 2022
August 2022
July 2022
June 2022
May 2022
April 2022
March 2022
February 2022
January 2022
----- 2021 -----
December 2021
November 2021
October 2021
September 2021
August 2021
July 2021
June 2021
May 2021
April 2021
March 2021
February 2021
January 2021
----- 2020 -----
December 2020
November 2020
October 2020
September 2020
August 2020
July 2020
June 2020
May 2020
April 2020
March 2020
February 2020
January 2020
----- 2019 -----
December 2019
November 2019
October 2019
September 2019
August 2019
July 2019
June 2019
May 2019
April 2019
March 2019
February 2019
January 2019
----- 2018 -----
December 2018
November 2018
October 2018
September 2018
August 2018
July 2018
June 2018
May 2018
April 2018
March 2018
February 2018
cloud-admin-feed@lists.wikimedia.org
1 participants
106 discussions
Start a n
N
ew thread
** RECOVERY alert - labstore1007/High 1m load average is OK **
by nagios@icinga1001.wikimedia.org
26 Jan '20
26 Jan '20
Notification Type: RECOVERY Service: High 1m load average Host: labstore1007 Address: 208.80.155.106 State: OK Date/Time: Sun Jan 26 15:01:07 UTC 2020 Notes URLs:
https://wikitech.wikimedia.org/wiki/Portal:Data_Services/Admin/Labstore
https://grafana.wikimedia.org/dashboard/db/labs-monitoring
Acknowledged by : Additional Info: All metrics within thresholds.
1
0
0
0
** PROBLEM alert - labstore1007/High 1m load average is CRITICAL **
by nagios@icinga1001.wikimedia.org
26 Jan '20
26 Jan '20
Notification Type: PROBLEM Service: High 1m load average Host: labstore1007 Address: 208.80.155.106 State: CRITICAL Date/Time: Sun Jan 26 14:38:43 UTC 2020 Notes URLs:
https://wikitech.wikimedia.org/wiki/Portal:Data_Services/Admin/Labstore
https://grafana.wikimedia.org/dashboard/db/labs-monitoring
Acknowledged by : Additional Info: cluster=wmcs instance=labstore1007:9100 job=node site=eqiad
1
0
0
0
** PROBLEM alert - labstore1007/High 1m load average is CRITICAL **
by nagios@icinga1001.wikimedia.org
26 Jan '20
26 Jan '20
Notification Type: PROBLEM Service: High 1m load average Host: labstore1007 Address: 208.80.155.106 State: CRITICAL Date/Time: Sun Jan 26 13:38:51 UTC 2020 Notes URLs:
https://wikitech.wikimedia.org/wiki/Portal:Data_Services/Admin/Labstore
https://grafana.wikimedia.org/dashboard/db/labs-monitoring
Acknowledged by : Additional Info: cluster=wmcs instance=labstore1007:9100 job=node site=eqiad
1
0
0
0
** ACKNOWLEDGEMENT alert - cloudvirt1024/MegaRAID is CRITICAL **
by nagios@icinga1001.wikimedia.org
24 Jan '20
24 Jan '20
Notification Type: ACKNOWLEDGEMENT Service: MegaRAID Host: cloudvirt1024 Address: 10.64.20.43 State: CRITICAL Date/Time: Fri Jan 24 15:31:07 UTC 2020 Notes URLs:
https://wikitech.wikimedia.org/wiki/MegaCli%23Monitoring
Acknowledged by nagiosadmin: RAID handler auto-ack:
https://phabricator.wikimedia.org/T243605
Additional Info: CRITICAL: 1 failed LD(s) (Degraded)
1
0
0
0
** ACKNOWLEDGEMENT alert - cloudvirt1024/MegaRAID is CRITICAL **
by nagios@icinga1001.wikimedia.org
23 Jan '20
23 Jan '20
Notification Type: ACKNOWLEDGEMENT Service: MegaRAID Host: cloudvirt1024 Address: 10.64.20.43 State: CRITICAL Date/Time: Thu Jan 23 21:53:56 UTC 2020 Notes URLs:
https://wikitech.wikimedia.org/wiki/MegaCli%23Monitoring
Acknowledged by nagiosadmin: RAID handler auto-ack:
https://phabricator.wikimedia.org/T243555
Additional Info: CRITICAL: 1 failed LD(s) (Degraded)
1
0
0
0
** RECOVERY alert - labstore1004/High 1m load average is OK **
by nagios@icinga1001.wikimedia.org
23 Jan '20
23 Jan '20
Notification Type: RECOVERY Service: High 1m load average Host: labstore1004 Address: 10.64.37.19 State: OK Date/Time: Thu Jan 23 19:49:42 UTC 2020 Notes URLs:
https://wikitech.wikimedia.org/wiki/Portal:Data_Services/Admin/Labstore
https://grafana.wikimedia.org/dashboard/db/labs-monitoring
Acknowledged by : Additional Info: All metrics within thresholds.
1
0
0
0
** PROBLEM alert - labstore1004/High 1m load average is CRITICAL **
by nagios@icinga1001.wikimedia.org
23 Jan '20
23 Jan '20
Notification Type: PROBLEM Service: High 1m load average Host: labstore1004 Address: 10.64.37.19 State: CRITICAL Date/Time: Thu Jan 23 19:33:22 UTC 2020 Notes URLs:
https://wikitech.wikimedia.org/wiki/Portal:Data_Services/Admin/Labstore
https://grafana.wikimedia.org/dashboard/db/labs-monitoring
Acknowledged by : Additional Info: cluster=labsnfs instance=labstore1004:9100 job=node site=eqiad
1
0
0
0
** RECOVERY alert - checker.tools.wmflabs.org/toolschecker: check mtime mod from tools cron job is OK **
by nagios@icinga1001.wikimedia.org
23 Jan '20
23 Jan '20
Notification Type: RECOVERY Service: toolschecker: check mtime mod from tools cron job Host:
checker.tools.wmflabs.org
Address:
checker.tools.wmflabs.org
State: OK Date/Time: Thu Jan 23 18:07:41 UTC 2020 Notes URLs:
https://wikitech.wikimedia.org/wiki/Portal:Toolforge/Admin/Toolschecker
Acknowledged by : Additional Info: HTTP OK: HTTP/1.1 200 OK - 158 bytes in 0.015 second response time
1
0
0
0
** PROBLEM alert - checker.tools.wmflabs.org/toolschecker: check mtime mod from tools cron job is CRITICAL **
by nagios@icinga1001.wikimedia.org
23 Jan '20
23 Jan '20
Notification Type: PROBLEM Service: toolschecker: check mtime mod from tools cron job Host:
checker.tools.wmflabs.org
Address:
checker.tools.wmflabs.org
State: CRITICAL Date/Time: Thu Jan 23 18:00:21 UTC 2020 Notes URLs:
https://wikitech.wikimedia.org/wiki/Portal:Toolforge/Admin/Toolschecker
Acknowledged by : Additional Info: HTTP CRITICAL: HTTP/1.1 503 SERVICE UNAVAILABLE - string OK not found on
http://checker.tools.wmflabs.org:80/cron
- 177 bytes in 0.010 second response time
1
0
0
0
** RECOVERY alert - checker.tools.wmflabs.org/toolschecker: All k8s worker nodes are healthy is OK **
by nagios@icinga1001.wikimedia.org
23 Jan '20
23 Jan '20
Notification Type: RECOVERY Service: toolschecker: All k8s worker nodes are healthy Host:
checker.tools.wmflabs.org
Address:
checker.tools.wmflabs.org
State: OK Date/Time: Thu Jan 23 17:06:30 UTC 2020 Notes URLs:
https://wikitech.wikimedia.org/wiki/Portal:Toolforge/Admin/Toolschecker
Acknowledged by : Additional Info: HTTP OK: HTTP/1.1 200 OK - 158 bytes in 0.195 second response time
1
0
0
0
← Newer
1
2
3
4
...
11
Older →
Jump to page:
1
2
3
4
5
6
7
8
9
10
11
Results per page:
10
25
50
100
200