On Thu, May 17, 2012 at 12:45 AM, John <phoenixoverride(a)gmail.com> wrote:
Simple.wikipedia is nothing like en.wikipedia I care
to dispute that
statement, All WMF wikis are setup basically the same (an odd extension here
or there is different, and different namespace names at times) but for the
purpose of recovery simplewiki_p is a very standard example. this issue isnt
just about enwiki_p but *all* wmf wikis. Doing a data recovery for enwiki vs
simplewiki is just a matter of time, for enwiki a 5 day estimate would be
fairly standard (depending on server setup) and lower times for smaller
databases. typically you can explain it in a rate of X revisions processed
per Y time unit, regardless of the project. and that rate should be similar
for everything given the same hardware setup.
Are you compressing old revisions, or not? Does the WMF database
compress old revisions, or not?
In any case, I'm sorry, a 20 gig mysql database does not scale
linearly to a 20 terabyte mysql database.