On Tue, Sep 2, 2014 at 5:21 PM, Markus Krötzsch < markus@semantic-mediawiki.org> wrote:
Hi again,
I am in Berlin today and got my answers first hand, so for the record, here they are:
(2a) If the answer to (1) is no: what are/will be the first (or last) full/current/daily dump files that use the new format?
I did not get an answer to this question,
It looks like http://dumps.wikimedia.org/wikidatawiki/20140823/ uses the new format, although these dumps started before the format switch and ended after. There's a possibility that they have some strange mix of both formats. (?)
Next full xml dumps will have the new format. Switch for daily dumps should have been on August 27.
Cheers, Katie
but since it is certain that each file is in a single format, a viable
strategy is to parse with the new format first; if there are errors, try parsing with the old format; if this succeeds even once, the whole remaining file should be parsed in the old format.
(2b) If the answer to (1) is yes: what is the revision number at which the change was made (i.e., what is the largest revision number that is still in the old format)?
Not applicable.
Markus
Wikidata-tech mailing list Wikidata-tech@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-tech