On Wed, 7 May 2003, Nick Reinking wrote:
Just to update everybody on my progress with the C wikitext parser:
To do:
- Lists of any sort
*shudder* :)
- Sections, subsections, and subsubsections (==, ===, and ==== respectively)
Should work from = to ====== (h1 to h6).
- Emphasis, strong emphasis, and very strong emphasis ('', ''', and ''''')
Make sure the following cases work (and produce correct HTML, unlike our current code): ''italic '''bold-italic''' italic'' '''''bold-italic''' italic'' ''italic '''bold-italic'''''
'''bold ''bold-italic'' bold''' '''''bold-italic'' bold''' '''bold ''bold-italic'''''
Must be done by PHP:
- Handle links / link lookup
- Ignore links in <nowiki>
<nowiki> and <math> sections should probably be pulled out _before_ parsing, and their contents processed and reinserted after parsing.
When Wikitext is pulled from the database, what are the newlines? Are they always \n?
They sure should be...
Also, what format is the wikitext stored in the database as? UTF-8? UTF-16?
At the moment, ISO-8859-1 for the following languages: English, Danish, German, French, Dutch, Spanish, Swedish
UTF-8 for everything else that's on phase 3. The remaining latin-1s will get bumped up to UTF-8 at some point, once someone gets around to ensuring that it won't break with browsers that are violently unfriendly to editing UTF-8 text in forms.
-- brion vibber (brion @ pobox.com)