On Wed, Aug 11, 2010 at 6:02 PM, David Gerard dgerard@gmail.com wrote:
On 11 August 2010 17:11, Liangent liangent@gmail.com wrote:
On 8/11/10, Daniel Kinzler daniel@brightbyte.de wrote:
Or keep the old parser around to deal with old revisions. Revisions that work with the new parser can be flagged as such.
This made me think of Quirks mode in browsers. We can make the new parser (and standardized wikitext syntax) more strict by treating many edge cases as errors, and if the new parser detected any error when parsing, use the old parser instead.
If Tim will buy it :-) The non-quirks mode had better cover *almost all* current revisions on the major WMF wikis, at the least. (The most recent current version dumps would be suitable test data.)
Do we have a short list of "worst case scenario" pages, which use lots of special cases for some reason, and that we could use as a test set? Not something specially constructed, but real, live wikipedia pages.
Magnus