We had an spike on EL yesterday nite that was caught by our alarms. The spike can be seen here:
As far as I can see it caused no service issues so there is no need to notify users. I am cc-ing the Media team on this e-mail cause from a brief inspection of events it looks like they started to log at a higher rate on the 25th.
---------- Forwarded message ----------
From:
<icinga@neon.wikimedia.org>
Date: Wed, Jun 25, 2014 at 4:46 AM
Subject: ** PROBLEM alert - tungsten/Throughput of event logging events is CRITICAL **
To:
nuria@wikimedia.org❤❤❤❤❤ Icinga ❤❤❤❤❤
Notification Type: PROBLEM
Service: Throughput of event logging events
Host: tungsten
Address: 10.64.0.18
State: CRITICAL
Date/Time: Wed Jun 25 02:46:22 UTC 2014
Additional Info:
CRITICAL: 7.14% of data exceeded the critical threshold [500.0]
Love, Icinga