Different keys can still be found in the actual xml dump
wikidatawiki-20141009-pages-articles.xml.bz2.
This bug/feature
is also present in the current dump with history.
page_id wd_id keys
111 Q15 ['aliases', 'claims', 'descriptions', 'id', 'labels',
'sitelinks', 'type']
137 Q24 ['aliases', 'claims', 'description', 'entity',
'label', 'links']
31500 Q28119 ['aliases', 'description', 'entity', 'label', 'links']
225144 ? ['entity', 'redirect']
3916689 P6 ['aliases', 'claims', 'datatype', 'descriptions',
'id', 'labels', 'type']
3916937 P10 ['aliases', 'claims', 'datatype', 'description',
'entity', 'label']
Lukas
Am Do 09.10.2014 19:32, schrieb Lydia Pintscher:
> On Thu, Oct 9, 2014 at 3:19 PM, Magnus Manske
> <magnusmanske(a)googlemail.com> wrote:
>> I managed to do the task at hand by switching to JSON dumps (because that's
>> the new, officially supported, long-term-stable Wikidata dump format, right?
>> Right???), so no hurry there.
>>
>> Maybe the XML dump process was run in the middle of the switch to the new
>> format, or got a stale cache for some items?
>
> It looks like the switch happened in the middle of a dump creation so
> this one is half old and half new format mixed. The ones after that
> should be all new format. And yay for switching to JSON!
>
>
> Cheers
> Lydia
>