Different keys can still be found in the actual xml dump
wikidatawiki-20141009-pages-articles.xml.bz2. I'm not sure if this bug
is present in the dump with history.
page_id, wd_id, keys
111, Q15, ['aliases', 'claims', 'descriptions', 'id',
'labels',
'sitelinks', 'type']
137, Q24, ['aliases', 'claims', 'description', 'entity',
'label', 'links']
31500, Q28119, ['aliases', 'description', 'entity',
'label', 'links']
225144, ?, ['entity', 'redirect']
Lukas
Am Do 09.10.2014 19:32, schrieb Lydia Pintscher:
On Thu, Oct 9, 2014 at 3:19 PM, Magnus Manske
<magnusmanske(a)googlemail.com> wrote:
I managed to do the task at hand by switching to
JSON dumps (because that's
the new, officially supported, long-term-stable Wikidata dump format, right?
Right???), so no hurry there.
Maybe the XML dump process was run in the middle of the switch to the new
format, or got a stale cache for some items?
It looks like the switch happened in the middle of a dump creation so
this one is half old and half new format mixed. The ones after that
should be all new format. And yay for switching to JSON!
Cheers
Lydia