[Labs-l] Labs datacenter migration
Andrew Bogott
abogott at wikimedia.org
Fri Nov 15 19:29:13 UTC 2013
Almost a year ago, the Wikimedia Foundation migrated most of our
services from our old data center in Tampa to the new one in Ashburn
[1]. In the next couple of months Labs and Tool Labs will be following
suit -- we expect to have everything moved to Ashburn by mid-January at
the latest.
This move will provide some immediate benefits (lower latency with
production, quicker database replication) and many long-term benefits
(better stability, happier Operations staff). We don't yet have a
specific timeline for stages of the migration, but there are a few
things you can do now to help us prepare for the change and to bolster
your projects against possible disruption.
1) Subscribe to Labs-l, and read it. [2] Labs-l is low-volume, and
future migration announcements may not be sent to other lists.
2) Tool Labs users: As long as your tools are properly managed by the
grid engine and can survive stops and restarts, the migration will be
quite painless. If your tools aren't, or can't... fix them :)
3) Labs project admins: Clean up old projects and instances. If you
have instances that are no longer of interest, delete them. If you know
of entire projects that are no longer in use, please contact me directly
and I'll mop up.
4) Labs instance owners: Make sure that puppet is running properly on
your instances. If '$sudo puppetd -tv' produces any red lines, then fix
them or contact me for help with fixing. When instances move to the new
data center we'll be relying on puppet to update location-specific
settings, so instances without puppet may not survive the move. If your
instance uses self-hosting puppet (via puppetmaster::self or
role::puppet::self) then you will also need to update your local puppet
repo. [3]
5) All Labs users: if you have valuable data residing on local instance
storage, start backing it up to shared storage in /data/project. You
should be doing this anyway -- no instance is safe from catastrophe, ever.
6) If your project or tool generates log files, have a look at purging
old log data. The last time we did a data migration there was at least
one terabyte-sized logfile that really gummed up the works.
Updates about this change will be posted to this list as soon as we
know about them. Any potential downtime will be announced well in
advance. In the meantime, don't hesitate to ask questions about the
above steps on IRC or the mailing list.
-Andrew
[1]
https://blog.wikimedia.org/2013/01/19/wikimedia-sites-move-to-primary-data-center-in-ashburn-virginia/
[2] https://lists.wikimedia.org/mailman/listinfo/labs-l
[3] https://wikitech.wikimedia.org/wiki/Help:Self-hosted_puppetmaster#FAQ
More information about the Labs-l
mailing list