Happy almost March, everyone!
Kowiki dumps jobs now take long enough to run for certain steps that the
wiki has been moved to the 'big wikis' list. This means that 6 parallel
jobs will produce output for stubs and page content dumps, similarly to
frwiki, dewiki and so on. See [1] for more.
This will take effect with the next dump run, starting tomorrow.
Please adjust your scripts accordingly.
Ariel
[1] https://phabricator.wikimedia.org/T245721
Hi, Xmldatadumps team
As you know, the format of xml dumps should change after February 1, 2020:
https://lists.wikimedia.org/pipermail/xmldatadumps-l/2019-November/001508.h…
However, I cannot find any changes on the Japanese dumps such as jawiki-20200201-pages-articles.xml.bz2.
If the format change plan was postponed, could you tell me the date when this change will occur?
Cheers,
Greetings XML Dump users and contributors!
This is your automatic monthly Dumps FAQ update email. This update
contains figures for the 20200101 full revision history content run.
We are currently dumping 910 projects in total.
---------------------
Stats for mgwikibooks on date 20200101
Total size of page content dump files for articles, current content only:
329426
Total size of page content dump files for all pages, current content only:
874963
Total size of page content dump files for all pages, all revisions:
4294486
---------------------
Stats for enwiki on date 20200101
Total size of page content dump files for articles, current content only:
75163254305
Total size of page content dump files for all pages, current content only:
167460765690
Total size of page content dump files for all pages, all revisions:
20097373129222
---------------------
Sincerely,
Your friendly Wikimedia Dump Info Collector