[Labs-l] Reboot post-mortem
Andrew Bogott
abogott at wikimedia.org
Thu Mar 24 16:37:04 UTC 2016
On 3/24/16 10:06 AM, Stephen E Slevinski Jr wrote:
> Yesterday, during the reboot, one of my instances died. It reported
> it was active, but the web-proxy failed with a 502 bad gateway and ssh
> failed with "channel 0: open failed: connect failed: No route to
> host". Manual reboots appeared successful, but the problems
> continued. On IRC, Yuvipanda was unable to debug the problem, so I
> created a new instance this morning.
The kernel updates applied yesterday required us to mess with grub --
it's possible that this instance suffered from a series of unlucky
coincidences and lost its bootloader entirely.
>
> I no longer need instance "signwriting-icon-server-2". Should I
> delete the instance or does someone want to do an autopsy?
I don't have time right this minute but I'd like to investigate it; I'll
delete it when I'm done. Thanks!
-A
>
> For project SignWriting, I was going to update the documentation to
> reflect the new instance name, but I'm having more problems. The "Edit
> documentation" link opens a blank page, rather than the project
> documentation form.
> https://wikitech.wikimedia.org/wiki/Nova_Resource:Signwriting
>
> Additionally, the "Instances for this project" section does not report
> the new instance yet, but this might be a caching issue that will
> resolve on it's own eventually.
>
> Regards,
> ∼Steve
>
> _______________________________________________
> Labs-l mailing list
> Labs-l at lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/labs-l
More information about the Labs-l
mailing list