Team,
We had an spike on EL yesterday nite that was caught by our alarms. The spike can be seen here:
http://graphite.wikimedia.org/render/?width=588&height=311&_salt=140...
As far as I can see it caused no service issues so there is no need to notify users. I am cc-ing the Media team on this e-mail cause from a brief inspection of events it looks like they started to log at a higher rate on the 25th.
Thanks,
Nuria
---------- Forwarded message ---------- From: icinga@neon.wikimedia.org Date: Wed, Jun 25, 2014 at 4:46 AM Subject: ** PROBLEM alert - tungsten/Throughput of event logging events is CRITICAL ** To: nuria@wikimedia.org
❤❤❤❤❤ Icinga ❤❤❤❤❤
Notification Type: PROBLEM
Service: Throughput of event logging events Host: tungsten Address: 10.64.0.18 State: CRITICAL
Date/Time: Wed Jun 25 02:46:22 UTC 2014
Additional Info:
CRITICAL: 7.14% of data exceeded the critical threshold [500.0] Love, Icinga