Hi,
Since the dumps without history show a tremendous increase in size, I assume something interesing happened in the recent history:
3.5G Oct 12 10:13 wikidatawiki-20141009-pages-articles.xml.bz2 3.9G Nov 9 10:11 wikidatawiki-20141106-pages-articles.xml.bz2 3.9G Dec 8 05:25 wikidatawiki-20141205-pages-articles.xml.bz2 4.6G Jan 16 23:27 wikidatawiki-20150113-pages-articles.xml.bz2
But there seems to be a problem with the latest dumps with complete history, so it is not possible to investigate this events.
The last dump that is available is: wikidatawiki-20141106-pages-meta-history.xml.bz2
Can someone have a look at the dump process and tell we when the next actual dump with history will be available?
Lukas
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Am 06.02.2015 um 11:13 schrieb Lukas Benedix:
Hi,
Since the dumps without history show a tremendous increase in size, I assume something interesing happened in the recent history:
We changed the serialization format used in the dumps around that time. From that, I may have expected an increase or 50% in uncompressed size, but I didn't expect an increase of 20% in compressed site. So I'm not sure that's the issue.
I do see a newer full dump though:
on https://dumps.wikimedia.org/wikidatawiki/20150113/, there is wikidatawiki-20150113-pages-meta-history.xml.bz2 9.7 GB. Just the 7z version seems to be missing for some reason.
- -- Daniel Kinzler Senior Software Developer
Wikimedia Deutschland Gesellschaft zur Förderung Freien Wissens e.V.
Nope, the dump from 20150113 is not finished, it tells "in-progress" since about a month.
LB
Am Fr 06.02.2015 um 11:22 schrieb Daniel Kinzler:
Am 06.02.2015 um 11:13 schrieb Lukas Benedix:
Hi,
Since the dumps without history show a tremendous increase in size, I assume something interesing happened in the recent history:
We changed the serialization format used in the dumps around that time. From that, I may have expected an increase or 50% in uncompressed size, but I didn't expect an increase of 20% in compressed site. So I'm not sure that's the issue.
I do see a newer full dump though:
on https://dumps.wikimedia.org/wikidatawiki/20150113/, there is wikidatawiki-20150113-pages-meta-history.xml.bz2 9.7 GB. Just the 7z version seems to be missing for some reason.
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l