[Labs-l] Reboot post-mortem

Andrew Bogott abogott at wikimedia.org
Thu Mar 24 16:37:04 UTC 2016


On 3/24/16 10:06 AM, Stephen E Slevinski Jr wrote:
> Yesterday, during the reboot, one of my instances died.  It reported 
> it was active, but the web-proxy failed with a 502 bad gateway and ssh 
> failed with "channel 0: open failed: connect failed: No route to 
> host".  Manual reboots appeared successful, but the problems 
> continued.  On IRC, Yuvipanda was unable to debug the problem, so I 
> created a new instance this morning.
The kernel updates applied yesterday required us to mess with grub -- 
it's possible that this instance suffered from a series of unlucky 
coincidences and lost its bootloader entirely.

>
> I no longer need instance "signwriting-icon-server-2".  Should I 
> delete the instance or does someone want to do an autopsy?
I don't have time right this minute but I'd like to investigate it; I'll 
delete it when I'm done.  Thanks!

-A


>
> For project SignWriting, I was going to update the documentation to 
> reflect the new instance name, but I'm having more problems. The "Edit 
> documentation" link opens a blank page, rather than the project 
> documentation form.
> https://wikitech.wikimedia.org/wiki/Nova_Resource:Signwriting
>
> Additionally, the "Instances for this project" section does not report 
> the new instance yet, but this might be a caching issue that will 
> resolve on it's own eventually.
>
> Regards,
> ∼Steve
>
> _______________________________________________
> Labs-l mailing list
> Labs-l at lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/labs-l




More information about the Labs-l mailing list