Greetings XML Dump users and contributors!
This is your automatic monthly Dumps FAQ update email. This update
contains figures for the 20230501 full revision history content run.
We are currently dumping 971 projects in total.
---------------------
Stats for brwiki on date 20230501
Total size of page content dump files for articles, current content only:
262,313,241
Total size of page content dump files for all pages, current content only:
286,771,185
Total size of page content dump files for all pages, all revisions:
10,115,030,157
---------------------
Stats for enwiki on date 20230501
Total size of page content dump files for articles, current content only:
94,233,153,457
Total size of page content dump files for all pages, current content only:
194,978,260,880
Total size of page content dump files for all pages, all revisions:
26,666,705,143,224
---------------------
Sincerely,
Your friendly Wikimedia Dump Info Collector
Hi, I'm starting a project that will involve repeated processing of HTML
wikipedia articles.
Using the enterprise dumps seems like it would be much simpler than
converting the XML dumps, but I don't know what the "experimental"
status really means.
I see in the original announcement post from a year and a half ago that
there is a warning about bugs and downtime, but the meta wiki page and
dumps site don't have any more information.
Is there less of a commitment to keep posting the enterprise dumps
compared to the database XML dumps?
Thanks,
Evan