Jeremy Dunck wrote:
I'm planning to parse the page text from wikipedia downloads.
Is there a document of all the supported markups (past and present), or is the PHP code all there is to go off of?
Unfortunately there is no formal, official grammar for the wiki markup. I'm afraid you'll have to work off the code (which isn't always 'right') and various help pages (which aren't always right either ;)
There is a set of parser test cases in maintenance/parserTests.txt (not all of which are passed by the current code).
-- brion vibber (brion @ pobox.com)