[Labs-l] Labs instance hesitations over the coming week

Andrew Bogott abogott at wikimedia.org
Thu Apr 23 21:17:36 UTC 2015


We've encountered an unexpected problem with this -- there's a kernel 
bug running on the new hardware which is causing instances to behave poorly.

So, I need to upgrade the kernel and reboot.  I'm moving all tools 
instances out of the way first so they aren't hit by the reboot; a few 
other projects (notably deployment-prep and staging) will suffer rolling 
reboots.

Fortunately this issue appeared early enough that most instances are 
still running on the old hardware, so most of you will be unaffected.

-Andrew


On 4/22/15 11:09 AM, Andrew Bogott wrote:
> Greetings!
>
> I'll be gradually moving most labs instances to new hardware over the 
> coming 7-10 days.  For virtually all instances this move will be 
> invisible to users -- the worst case scenario is that an instance will 
> freeze for a minute or two during the final post-copy sync. I've 
> already moved several projects without incident.
>
> Nevertheless, services which have extremely touchy timeouts may error 
> out or throw warnings.  For example, a few things in deployment-prep 
> sent 'service down' alerts which lasted a few seconds before being 
> resolved.  So if you see something like that, it's probably a result 
> of the move.
>
> I will be moving the Tools instances first, starting tomorrow morning 
> (approximately 14:00 UTC).  The tools migration will take around 18 
> hours.  On Friday morning I'll start a scripted move of all other 
> instances; the complete move will take around 7 days.
>
> If this process concerns you and you'd like your instances moved at a 
> pre-determined time, feel free to contact me off-list and we can make 
> an appointment for your project.
>
> -Andrew
>
>




More information about the Labs-l mailing list