[Labs-l] IMPORTANT: Many instances slated for reboot and downtime this weekend
Moritz Schubotz
physik at physikerwelt.de
Tue Sep 16 16:04:05 UTC 2014
Hi Andrew,
I just wanted to let you know that rebooting should not be a problem for
math: mws
Best
Moritz
On Tue, Sep 16, 2014 at 11:45 AM, Andrew Bogott <abogott at wikimedia.org> wrote:
> -- Executive Summary:
>
> Many instances will be rebooted at some point this weekend or next week.
> The total list of instances subject to reboot is here:
>
> https://wikitech.wikimedia.org/wiki/Virt1006_rebuild
>
> Tools and Beta users can ignore this email.
>
>
> -- The full story:
>
> Sorry about sending two different IMPORTANT emails this week; we generally
> try to keep labs crises to a minimum. Indeed, this email is about avoiding
> a potential crisis.
>
> The labs server known as 'virt1006' has been acting poorly lately. Several
> times in the last month we've seen instances that live on virt1006 get into
> inconsistent states during reboot... they reboot and never come back up, or
> they stay in a perpetual 'rebooting' state.
>
> So far we've been able to rescue such instances, but the misbehavior of a
> Labs server is very disconcerting. Rather than wait for a full collapse
> (and resulting sudden death of 50+ VMs) we've decided to migrate all
> instances instances off of virt1006 and then either rebuild the system or
> discard the hardware. Moving an instance off of a server is fairly
> painless, but it does require a few minutes of downtime and a reboot.
>
> I've spoken to a few of you directly about the reboots; the affected Tools
> and Deployment-prep instances have already been handled. There are a lot
> more to go, though. If your instance is stable and has its init scripts set
> up properly and a reboot is no big deal, then, congratulations! Otherwise,
> please take whatever steps you need to take to batten down the hatches and
> get ready for a reboot.
>
> If you need the reboot to happen at a scheduled time while you are standing
> by, that's totally fine. In that case please schedule a reboot window on
> this page:
>
> https://wikitech.wikimedia.org/wiki/Virt1006_rebuild
>
> Thanks for your cooperation.
>
> -Andrew
>
> _______________________________________________
> Labs-l mailing list
> Labs-l at lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/labs-l
--
Mit freundlichen Grüßen
Moritz Schubotz
Telefon (Büro): +49 30 314 22784
Telefon (Privat):+49 30 488 27330
E-Mail: schubotz at itp.physik.tu-berlin.de
Web: http://www.physikerwelt.de
Skype: Schubi87
ICQ: 200302764
Msn: Moritz at Schubotz.de
More information about the Labs-l
mailing list