[Cloud-announce] Wiki Replica c1.labsdb and c3.labsdb to be shutdown 2017-12-13

List overview All Threads
Download

newer

older

toolforge Python library updated...

Tomorrow: Weekly Technical Advice...

Bryan Davis

19 Oct 2017 19 Oct '17

2:46 a.m.

The labsdb1001.eqiad.wmnet (aka c1.labsdb) and labsdb1003.eqiad.wmnet (aka c3.labsdb) servers are being shutdown and permanently removed from service on Wednesday 2017-12-13. TL;DR * Change your tools and scripts to use: - "*.web.db.svc.eqiad.wmflabs" (real-time response needed) - "*.analytics.db.svc.eqiad.wmflabs" (batch jobs; long queries) * Replace "*" with either a shard name (e.g. s1) or a wikidb name (e.g. enwiki). * The new servers do not support user created databases/tables because replication can't be guaranteed. See T156869 and below for more information. * Migrate your user created tables to tools.db.svc.eqiad.wmflabs (also known as tools.labsdb) and JOIN via application space logic rather than in-process in the database. What is changing? * Week of 2017-10-30 to 2017-11-03 (exact date to be determined) ** Reboot labsdb1001.eqiad.wmnet (aka c1.labsdb) for kernel updates ** There is a possibility of catastrophic hardware failure in this reboot. There will be no way to recover the server or the data it currently hosts if that happens. * Week of 2017-11-06 to 2017-11-10 (exact date to be determined) ** Reboot labsdb1003.eqiad.wmnet (aka c3.labsdb) for kernel updates ** There is a possibility of catastrophic hardware failure in this reboot. There will be no way to recover the server or the data it currently hosts if that happens. * Wednesday 2017-12-13 * "*.labsdb" service names switched to point at "*.web.db.svc.eqiad.wmflabs" equivalents. * User created tables will not be allowed on the new servers "c1.labsdb" and "c3.labsdb" point to. * labsdb1001.eqiad.wmnet removed from service. * labsdb1003.eqiad.wmnet removed from service. Why are we doing this? See <https://wikitech.wikimedia.org/wiki/Wiki_Replica_c1_and_c3_shutdown> and <https://phabricator.wikimedia.org/T142807> for a more complete description of the reasons for these changes. Bryan (on behalf of the Wikimedia Cloud Services and DBA teams) -- Bryan Davis Wikimedia Foundation <bd808(a)wikimedia.org> [[m:User:BDavis_(WMF)]] Manager, Cloud Services Boise, ID USA irc: bd808 v:415.839.6885 x6855 _______________________________________________ Wikimedia Cloud Services announce mailing list Cloud-announce(a)lists.wikimedia.org (formerly labs-announce(a)lists.wikimedia.org) https://lists.wikimedia.org/mailman/listinfo/cloud-announce

Show replies by thread

John

19 Oct 19 Oct

2:57 a.m.

New subject: [Cloud-announce] Wiki Replica c1.labsdb and c3.labsdb to be shutdown 2017-12-13

Would it not be better to trial the switch over with the *.labsdb aliases before risking catastrophic failure when rebooting? Doing a short test (up to 24 hours) would allow users to identify anything that may break before it becomes an unbreak now situation? If any critical systems are affected the change can be rolled back, issues identified and fixed before the final switch over. On Wed, Oct 18, 2017 at 8:46 PM, Bryan Davis <bd808(a)wikimedia.org> wrote:

...

Bryan Davis

3:15 a.m.

New subject: [Cloud-announce] Wiki Replica c1.labsdb and c3.labsdb to be shutdown 2017-12-13

On Wed, Oct 18, 2017 at 6:57 PM, John <phoenixoverride(a)gmail.com> wrote:

...

Switching the "c1.labsdb" and "c3.labsdb" aliases will break 100% of the Tools and other users that are using user created databases on the hosts they currently point to. The server reboots have a small chance of non-recoverable hardware failure, but there is really nothing we can do to prevent that. We considered putting the reboots off and just waiting for the servers to be decommissioned, but ultimately it was decided that the risk of continuing to run out of date Linux kernels was worse than the risk of hardware failure. There is a tool at <https://tools.wmflabs.org/tool-db-usage/> where you can see the local databases on the c1 and c3 servers. Anything that is using these tables will need to change behavior somehow. It is not a happy thing for us to force anyone to change their software, but as explained in the wiki page [0] we can not find a reliable method to ensure that the same user created tables are available on all three of the new backend servers, and we feel that in order to be able to properly maintain the new servers we need to have more flexibility in choosing which traffic goes to which physical server at any given point in time. If we recreate the current state where certain service names are guaranteed to point to specific physical servers Tools will break at arbitrary times when we are doing otherwise invisible maintenance. [0]: https://wikitech.wikimedia.org/wiki/Wiki_Replica_c1_and_c3_shutdown Bryan -- Bryan Davis Wikimedia Foundation <bd808(a)wikimedia.org> [[m:User:BDavis_(WMF)]] Manager, Cloud Services Boise, ID USA irc: bd808 v:415.839.6885 x6855

Platonides

4:01 a.m.

New subject: [Cloud-announce] Wiki Replica c1.labsdb and c3.labsdb to be shutdown 2017-12-13

...

It is not a happy thing for us to force anyone to change their software, but as explained in the wiki page [0] we can not find a reliable method to ensure that the same user created tables are available on all three of the new backend servers

Why not provide a fourth host and have those three servers act as slaves of it? Writes go to the first one, but reads and joins can go to the replicas. I'm afraid that "make the JOINs in user space" may end up on some cases with the app fetching all the rows of the tool table or the wiki replica in order to perform a JOIN that used to be straightforward. And the tools aren't really the place to implement all kinds of partial-fetch implementations and query estimates (without EXPLAIN, even!). Unless you know of such an efficient user space JOIN implementation, perhaps? Regards

Bryan Davis

7:06 a.m.

New subject: [Cloud-announce] Wiki Replica c1.labsdb and c3.labsdb to be shutdown 2017-12-13

On Wed, Oct 18, 2017 at 8:01 PM, Platonides <platonides(a)gmail.com> wrote:

...

Why not provide a fourth host and have those three servers act as slaves of it? Writes go to the first one, but reads and joins can go to the replicas.

That scenario makes that single master host a single point of failure where all writes to the co-located tables would have to be made. That would in turn mean that taking this one host down for maintenance would halt all ability to write to the tables. There is actually an open discussion for specially "curated" datasets that will likely work like this (<https://phabricator.wikimedia.org/T173511#3556395>). The details are still a work in progress, but the current rough idea is that to be replicated tables/databases will need to meet certain structural restrictions to enable robust replication and have well defined owners who will be responsible to assisting in responding to issues related to their tables. Another likely restriction will be that the tables must be populated by some sort of batch/bulk loading that can be reproduced as needed if the master server has to be rebuilt from scratch.

...

I'm afraid that "make the JOINs in user space" may end up on some cases with the app fetching all the rows of the tool table or the wiki replica in order to perform a JOIN that used to be straightforward. And the tools aren't really the place to implement all kinds of partial-fetch implementations and query estimates (without EXPLAIN, even!). Unless you know of such an efficient user space JOIN implementation, perhaps?

I do not have a universal user space JOIN algorithm that I'm hiding, but there are case by case efficient replacements for many direct JOIN queries. It's all in the specific details of what data is needed from each table however. For a straight join the solution may be interleaved fetches from two open cursors. For a join used to limit results from another table the solution may be batched queries using a WHERE x IN (...) constraint. Working with physically sharded databases is not a completely new constraint in computer science. I'm confident that reasonable solutions can be found for most use cases. I would personally be happy to help people try to find workable solutions on this mailing list, in Phabricator tasks, or even on irc. I think that there are others in our community of developers who would be willing to help out as well. Maybe we can all collaborate to document some common (and uncommon) solutions on a wikitech help page. Again, I want to acknowledge that the removal of this long standing feature is not the ideal outcome. I do not enjoy making decisions that make things more difficult for our technical community. I don't want to close the door on looking for better solutions, but the hard reality is that the labsdb1001 and labsdb1003 are dying. We need to get all of the traffic off of them as soon as we can. If either of them has a hardware failure we will not be able to restore them. This rushed timeline is not ideal, but at the same time I do not want to lock our new servers into unsustainable configurations to alleviate a short term pain. PS. I don't mean to "well actually" your EXPLAIN statement, but with the MariaDB servers EXPLAIN is available in a slightly modified form (<https://wikitech.wikimedia.org/wiki/Help:MySQL_queries#Optimizing_queries>) that works with the intervening view layer and without requiring elevated user privileges. Bryan -- Bryan Davis Wikimedia Foundation <bd808(a)wikimedia.org> [[m:User:BDavis_(WMF)]] Manager, Cloud Services Boise, ID USA irc: bd808 v:415.839.6885 x6855

Jaime Crespo

9:44 a.m.

New subject: [Cloud-announce] Wiki Replica c1.labsdb and c3.labsdb to be shutdown 2017-12-13

...

Why not provide a fourth host and have those three servers act as slaves

of it? While that is a reasonable way to do it- that is exactly how we do it in production; that is almost impossible to do it for all users. First, blind read-write split doesn't work; it needs to be curated for each application, and we do not have control over when reads and writes happen; so in the case there is lag, most applications will break. Now I will explain why always there would be lag- Second, it creates a SPOF on replication, for each individual application. While 90% of the data services users do reasonable queries and data structure; a minority do extremely fast imports, create tables on MyISAM format (which corrupts on every crash) or do other things that would make the replication lag or break. **That means, because a single user does heavy writes, all users, including of simple production replica reads would be affected**. Also, being able to create tables and perform writes on the wikireplicas caused lots of complains from other users as they often locked the wiki data, causing lag. As Bryan says, we are open to other methods (e.g. we could pregenerate/import summary tables from production), but those should be puppetized and automatized and checked for sanity (InnoDB, no blocking/breakage of replication, etc.). Creating summary tables through replication means people can share that data instead of it being replicated multiple times. Those are not theoretical, we had to blacklist some databases from replication on toolsdb because they were causing similar replication issues: https://phabricator.wikimedia.org/T127164 Another thing we could do is to experiment with FEDERATED/CONECTX tables (make virtual tables from wikireplicas available from toolsdb), something that was done in the past, but it has the same problem of userspace joins- its performance is not perfect. It is important to note that to take this decision, we checked existing users of user tables on wikireplicas and saw that a) the number of users of local databases was much lower than the total users b) in most cases, they were just used for summary tables INSERT...SELECT (which by the way, was the main cause of lag on the replica servers), and is trivial to migrate to SELECT -> user space -> INSERT c) in those cases where actual JOINs are used, which we believe to be very few, we are open for solutions like the ones commented above. Moving to tools db will mean more available resources in most cases, and a better availability for both the replicas and the user dbs, as if the master hardware crashes, there is a passive replica (unlike on local tables). On Thu, Oct 19, 2017 at 4:01 AM, Platonides <platonides(a)gmail.com> wrote: > > It is not a happy thing for us to force anyone to change their > > software, but as explained in the wiki page [0] we can not find a > > reliable method to ensure that the same user created tables are > > available on all three of the new backend servers >

...

Why not provide a fourth host and have those three servers act as slaves

> of it? > > Writes go to the first one, but reads and joins can go to the replicas. > > I'm afraid that "make the JOINs in user space" may end up on some > cases with the app fetching all the rows of the tool table or the wiki > replica in order to perform a JOIN that used to be straightforward. > And the tools aren't really the place to implement all kinds of > partial-fetch implementations and query estimates (without EXPLAIN, > even!). Unless you know of such an efficient user space JOIN > implementation, perhaps? > > Regards > > _______________________________________________ > Wikimedia Cloud Services mailing list > Cloud(a)lists.wikimedia.org (formerly labs-l(a)lists.wikimedia.org) > https://lists.wikimedia.org/mailman/listinfo/cloud > -- Jaime Crespo <http://wikimedia.org>

Madhumitha Viswanathan

24 Oct 24 Oct

10:09 p.m.

New subject: [Cloud-announce] Wiki Replica c1.labsdb and c3.labsdb to be shutdown 2017-12-13

Hi all, The proposed dates and times for the reboots are: labsdb1001: Monday Oct 30 2017, 14:30 UTC labsdb1003: Tuesday Nov 07 2017, 14:30 UTC These servers are very old, and may run into hardware issues during the reboots, so we are not defining an outage window. I'll keep the list updated during and after the maintenance. As always, feel free to reach out with any questions or concerns here or on #wikimedia-cloud, or on task here - https://phabricator.wikimedia.org/T168584. Best, On Thu, Oct 19, 2017 at 12:44 AM, Jaime Crespo <jcrespo(a)wikimedia.org> wrote:

...

Why not provide a fourth host and have those three servers act as slaves

-- Jaime Crespo <http://wikimedia.org> _______________________________________________ Wikimedia Cloud Services mailing list Cloud(a)lists.wikimedia.org (formerly labs-l(a)lists.wikimedia.org) https://lists.wikimedia.org/mailman/listinfo/cloud

-- Madhumitha Viswanathan Operations Engineer, Cloud Services

2374

days inactive

2379

days old

cloud@lists.wikimedia.org

Manage subscription

6 comments

5 participants

tags (0)

participants (5)

Bryan Davis
Jaime Crespo
John
Madhumitha Viswanathan
Platonides