Greetings XML Dump users and contributors!
This is your automatic monthly Dumps FAQ update email. This update
contains figures for the 20181001 full revision history content run.
We are currently dumping 912 projects in total.
---------------------
Stats for lnwiktionary on date 20181001
Total size of page content dump files for articles, current content only:
1174386
Total size of page content dump files for all pages, current content only:
1783775
Total size of page content dump files for all pages, all revisions:
21197429
---------------------
Stats for enwiki on date 20181001
Total size of page content dump files for articles, current content only:
69100524487
Total size of page content dump files for all pages, current content only:
154486924229
Total size of page content dump files for all pages, all revisions:
18083706962235
---------------------
Sincerely,
Your friendly Wikimedia Dump Info Collector
Hi,
The dump for enwiki seems to be blocked in "First-pass for page XML data
dumps" :
2018-10-20 10:49:29 in-progress First-pass for page XML data dumps
2018-10-22 09:32:02: enwiki (ID 298305) 416 pages (0.8|30.1/sec all|curr),
36000 revs (70.5|72.4/sec all|curr), ETA 2019-03-13 10:20:01 [max 865183738]
Other dumps like frwiki seem blocked also, maybe waiting for enwiki ?
Nico
If you are a user of the adds-changes (so-called "incremental") dumps, read
on.
All dumps use database servers in our eqiad data center. For the past
month, the wiki projects have used primary database masters out of our
codfw data center; on one of these days, a number of revisions did not
replicate properly to eqiad for about 50 minutes. This was not discovered
until we switched back to using the database servers in eqiad as primary
masters.
The date that replication was broken was September 13th. Because dumps use
eqiad database servers, the adds-changes dumps published on that date are
also missing that data.
We suggest that if you are using adds-changes dumps for a local mirror, you
import the next regular run of xml/sql dumps, starting on Oct 20th (for
current revisons only) or Nov 1st (for full history), which should contain
the missing revisions.
For more information on this incident, you may follow:
https://phabricator.wikimedia.org/T206743
Ariel
Hello dumps users!
You may have noticed that a number of wikis have had dumps failures on the
flow dumps step. The cause is known (a cleanup of mediawiki core that
didn't carry over to the extension) and these jobs should be fixed up today
or tomorrow.
Ariel
The failure was a side effect of a configuration change that will,
ironically enough, make it easier to test the 'other' dumps, including
eventually these ones, in mediawiki-vagrant; see
https://phabricator.wikimedia.org/T201478 for more information about that.
They should run tomorrow and contain the content for the missing run as
well.
Apologies for the inconvenience.
Ariel
Greetings XML Dump users and contributors!
This is your automatic monthly Dumps FAQ update email. This update
contains figures for the 20180901 full revision history content run.
We are currently dumping 912 projects in total.
---------------------
Stats for pamwiki on date 20180901
Total size of page content dump files for articles, current content only:
41777521
Total size of page content dump files for all pages, current content only:
44732421
Total size of page content dump files for all pages, all revisions:
1314983462
---------------------
Stats for enwiki on date 20180901
Total size of page content dump files for articles, current content only:
68715508465
Total size of page content dump files for all pages, current content only:
153659978710
Total size of page content dump files for all pages, all revisions:
17959415517241
---------------------
Sincerely,
Your friendly Wikimedia Dump Info Collector