Hi,
That is excellent.
However, it does not solve the longstanding problem of having current pages
dumps and all-history dumps in the same queue. The current pages dump for a
small project that takes a few minutes is thus queued behind history dumps
for large projects that take weeks.
It is essential that the history dumps be in a separate queue, or that
threads are reserved for smaller projects.
best,
Robert
FYI: for anyone interested (although I suspect anyone on the en.wikt already
knows this): there are daily XML dumps for the en.wikt available at
http://devtionary.info/w/dump/xmlu/ ... these are done by incremental
revisions to the previous dump (i.e. not by magic ;-)
On Mon, Oct 6, 2008 at 7:39 PM, Brion Vibber <brion(a)wikimedia.org> wrote:
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Mathias Schindler wrote:
On Mon, Sep 22, 2008 at 6:59 PM, Brion Vibber
<brion(a)wikimedia.org>
wrote:
We ended up with an incompatible disk array for
the new dumps server;
replacement delivery ETA is September 29.
Thanks for the info. In the meantime, would it be possible just to
produce pages-articles.xml.bz2 files without the history part, saving
enough disk space for this task to be run?
Well, there's no *meantime* left -- I'll just start them all today.
- -- brion
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.8 (Darwin)
Comment: Using GnuPG with Mozilla -
http://enigmail.mozdev.org
iEYEARECAAYFAkjqP0YACgkQwRnhpk1wk44rEwCfRk1A4bMZBeHxozrzfdjJRIXI
hZoAnjz9cf2+oSbJZ+f2HWcuSEKZxzIz
=yFzn
-----END PGP SIGNATURE-----
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l