Hi,
That is excellent.
However, it does not solve the longstanding problem of having current pages dumps and all-history dumps in the same queue. The current pages dump for a small project that takes a few minutes is thus queued behind history dumps for large projects that take weeks.
It is essential that the history dumps be in a separate queue, or that threads are reserved for smaller projects.
best, Robert
FYI: for anyone interested (although I suspect anyone on the en.wikt already knows this): there are daily XML dumps for the en.wikt available at http://devtionary.info/w/dump/xmlu/ ... these are done by incremental revisions to the previous dump (i.e. not by magic ;-)
On Mon, Oct 6, 2008 at 7:39 PM, Brion Vibber brion@wikimedia.org wrote:
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Mathias Schindler wrote:
On Mon, Sep 22, 2008 at 6:59 PM, Brion Vibber brion@wikimedia.org
wrote:
We ended up with an incompatible disk array for the new dumps server; replacement delivery ETA is September 29.
Thanks for the info. In the meantime, would it be possible just to produce pages-articles.xml.bz2 files without the history part, saving enough disk space for this task to be run?
Well, there's no *meantime* left -- I'll just start them all today.
- -- brion
-----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.8 (Darwin) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
iEYEARECAAYFAkjqP0YACgkQwRnhpk1wk44rEwCfRk1A4bMZBeHxozrzfdjJRIXI hZoAnjz9cf2+oSbJZ+f2HWcuSEKZxzIz =yFzn -----END PGP SIGNATURE-----
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l