We are beginning the first step of this shortly. Again, it may cause some NFS actions to
fail briefly and some load rise on clients, but the condition should be brief and
temporary.
Brooke Storm
Operations Engineer
Wikimedia Cloud Services
bstorm(a)wikimedia.org <mailto:bstorm@wikimedia.org>
IRC: bstorm_
On Nov 28, 2018, at 3:42 PM, Brooke Storm
<bstorm(a)wikimedia.org> wrote:
On Monday, December 3rd, 2018 at 1700 UTC, we will be rebooting one of the two dumps NFS
servers (
labstore1006.wikimedia.org <http://labstore1006.wikimedia.org/>). This
should cause rising load issues briefly, but should be quick enough that failing over
services is likely to not be helpful. We will be failing over the web service before that
time and failing it back before rebooting the partner server (
labstore1007.wikimedia.org
<http://labstore1007.wikimedia.org/>) on Friday, December 7th at 1700 UTC. This
should not interrupt services to
dumps.wikimedia.org <http://dumps.wikimedia.org/>
(the site hosted on these systems) since that should be failed over to the non-rebooting
partner.
Brooke Storm
Operations Engineer
Wikimedia Cloud Services
bstorm(a)wikimedia.org <mailto:bstorm@wikimedia.org>
IRC: bstorm_