[Labs-l] Partial outage in progress -- update
Andrew Bogott
abogott at wikimedia.org
Tue Feb 17 20:01:22 UTC 2015
On 2/17/15 10:49 AM, Andrew Bogott wrote:
> No data was lost. I'm currently migrating VMs from virt1005 onto a
> new, more trustworthy server. All affected instances will start back
> up one by one over the next hour or so.
There turn out to be some MASSIVE instances on that box, so the copy is
taking longer than I expected. Still chugging along though.
-A
>
> -Andrew
>
>
> On 2/17/15 9:31 AM, Andrew Bogott wrote:
>> One of the labs virtualization hosts, virt1005, is suffering a disk
>> failure. I'm restarting right now -- that may allow us to gradually
>> recover. If we're less lucky, then rebuilding from the outage will
>> be a prolonged process.
>>
>> A list of affected instances can be found here:
>>
>> https://phabricator.wikimedia.org/P305
>>
>> Note that this box was hosting the Tools web proxy, so the web
>> interface for most tools is currently down. That should be easy to
>> rebuild if necessary, and will be a high priority.
>>
>> Updates as events warrant!
>>
>> -Andrew
>
More information about the Labs-l
mailing list