Hello,
There was an outage yesterday that took out the whole SCB cluster~[1]. Due to the number of services hosted there, this was a user-facing event that lasted approximately 20 minutes. More notes and thoughts are welcomed!
Cheers, Marko
[1] https://wikitech.wikimedia.org/wiki/Incident_documentation/20160608-SCB