Greetings XML Dump users and contributors!
This is your automatic monthly Dumps FAQ update email. This update
contains figures for the 20190701 full revision history content run.
We are currently dumping 918 projects in total.
---------------------
Stats for hywwiki on date 20190701
Total size of page content dump files for articles, current content only:
65820501
Total size of page content dump files for all pages, current content only:
67052501
Total size of page content dump files for all pages, all revisions:
1245901540
---------------------
Stats for enwiki on date 20190701
Total size of page content dump files for articles, current content only:
72868882777
Total size of page content dump files for all pages, current content only:
162570052534
Total size of page content dump files for all pages, all revisions:
19266708182163
---------------------
Sincerely,
Your friendly Wikimedia Dump Info Collector
The production of revision history content files takes between 2.5 and 3
days for each of them; these are the longest to run of the wikis not yet
parallelized.
I plan to switch them over for the August 1st run; please adjust your
scripts accordingly. Follow along if you are interested, at
https://phabricator.wikimedia.org/T228558
Thanks!
Hello all,
Due to technical issues, the JSON dumps for this week and last week
couldn't be properly generated.
- 20190624 because of a commit that was fixing another bug (phab:T226601
<https://phabricator.wikimedia.org/T226601>)
- 20190701 due to an issue with qualifier hashes (phab:T227207
<https://phabricator.wikimedia.org/T227207>)
We apologize for this inconvenience. The problem is about to be solved, so
we expect the situation to be back to normal next week. However, we will
not generate the previous failed dumps.
Thanks for your understanding,
--
Léa Lacroix
Project Manager Community Communication for Wikidata
Wikimedia Deutschland e.V.
Tempelhofer Ufer 23-24
10963 Berlin
www.wikimedia.de
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/029/42207.
*(sorry for cross-posting)*
Hello all,
Starting on July 15th, the name of the Wikidata RDF dumps will be changed
to *remove the "beta"*. For example, the file that would have been named
wikidata-20190717-all-BETA.ttl.bz2 with the former name format will be
named wikidata-20190717-all.ttl.bz2.
This will only impact the *new generated dumps*, not the previous ones.
If you have questions or issues, feel free to ask on the Phabricator ticket
<https://phabricator.wikimedia.org/T226153>.
Cheers,
--
Léa Lacroix
Project Manager Community Communication for Wikidata
Wikimedia Deutschland e.V.
Tempelhofer Ufer 23-24
10963 Berlin
www.wikimedia.de
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/029/42207.
Good afternoon,
I have a question regarding the Wikidata Entities data dump and I was not able to find a suitable place where I could ask it.
We have been using the Wikidata Entities data dump for quite a while, but the last two weeks we have been having an issue where the data dump archive has disappeared from the website, or it has not been there at all.
I mean here: https://dumps.wikimedia.org/other/wikidata/ <https://dumps.wikimedia.org/other/wikidata/>
20190624.json.gz returns a File Not Found.
Could you please tell me where I could find this file or redirect me to someone who could give me more information?
Thank you very much for your great work.
Kind regards,
Petra K.
Greetings XML Dump users and contributors!
This is your automatic monthly Dumps FAQ update email. This update
contains figures for the 20190601 full revision history content run.
We are currently dumping 918 projects in total.
---------------------
Stats for cswikisource on date 20190601
Total size of page content dump files for articles, current content only:
274550327
Total size of page content dump files for all pages, current content only:
283612553
Total size of page content dump files for all pages, all revisions:
2123732295
---------------------
Stats for enwiki on date 20190601
Total size of page content dump files for articles, current content only:
72489507264
Total size of page content dump files for all pages, current content only:
161816742721
Total size of page content dump files for all pages, all revisions:
19134735092919
---------------------
Sincerely,
Your friendly Wikimedia Dump Info Collector