Different keys can still be found in the actual xml dump wikidatawiki-20141009-pages-articles.xml.bz2. I'm not sure if this bug is present in the dump with history.
page_id, wd_id, keys 111, Q15, ['aliases', 'claims', 'descriptions', 'id', 'labels', 'sitelinks', 'type'] 137, Q24, ['aliases', 'claims', 'description', 'entity', 'label', 'links'] 31500, Q28119, ['aliases', 'description', 'entity', 'label', 'links'] 225144, ?, ['entity', 'redirect']
Lukas
Am Do 09.10.2014 19:32, schrieb Lydia Pintscher:
On Thu, Oct 9, 2014 at 3:19 PM, Magnus Manske magnusmanske@googlemail.com wrote:
I managed to do the task at hand by switching to JSON dumps (because that's the new, officially supported, long-term-stable Wikidata dump format, right? Right???), so no hurry there.
Maybe the XML dump process was run in the middle of the switch to the new format, or got a stale cache for some items?
It looks like the switch happened in the middle of a dump creation so this one is half old and half new format mixed. The ones after that should be all new format. And yay for switching to JSON!
Cheers Lydia