Haven't checked, really; I'll now ignore the XML dumps, which are obviously broken for the time being, and use the JSON dumps.
Speaking of which, the last one seems to have failed; 20141006.json.gz is stuck at 700 bytes, for two days now.
On Wed, Oct 8, 2014 at 9:31 PM, Lukas Benedix lukas.benedix@fu-berlin.de wrote:
Is the problem with the different representation of empty values still in there?
links: [] vs. links: ""
Lukas
Am Mi 08.10.2014 21:29, schrieb Magnus Manske:
Oh, I just noticed: same with "links" and "sitelinks".
On Wed, Oct 8, 2014 at 8:28 PM, Magnus Manske <
magnusmanske@googlemail.com>
wrote:
Hi all,
in the dump file wikidatawiki-20140912-pages-articles.xml.bz2
I seem to find some items with a key "description", some with "descriptions".
For example, near the beginning of the file: Q15 seems to have key "description" Q17 seems to have key "descriptions"
This is rather unhelpful when running e.g. my stats script.
a) Can someone please confirm that I'm not crazy? I mean, in this
instance.
b) Is this a bug, or a feature? c) If a bug, is it already fixed for the next dump? Which key will it
be?
(If a feature: why?)
Thanks, Magnus
Wikidata-tech mailing list Wikidata-tech@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-tech
Wikidata-tech mailing list Wikidata-tech@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-tech