Re: [Wikitech-l] Re: Chat about Wikipedia performance?

8 May 2003

      On Wed, 7 May 2003, Nick Reinking wrote:
...
Just to update everybody on my progress with the C wikitext parser:
To do:

Lists of any sort

*shudder* :)
...

Sections, subsections, and subsubsections (==, ===, and ====
respectively)

Should work from = to ====== (h1 to h6).
...

Emphasis, strong emphasis, and very strong emphasis ('', ''', and
''''')

Make sure the following cases work (and produce correct HTML, unlike our
current code):
''italic '''bold-italic''' italic''
'''''bold-italic''' italic''
''italic '''bold-italic'''''
'''bold ''bold-italic'' bold'''
'''''bold-italic'' bold'''
'''bold ''bold-italic'''''
...
Must be done by PHP:

Handle links / link lookup
Ignore links in <nowiki>

<nowiki> and <math> sections should probably be pulled out _before_
parsing, and their contents processed and reinserted after parsing.
...
When Wikitext is pulled from the database, what are the newlines?
Are they always \n?
They sure should be...
...
Also, what format is the
wikitext stored in the database as?  UTF-8?  UTF-16?
At the moment, ISO-8859-1 for the following languages:
English, Danish, German, French, Dutch, Spanish, Swedish
UTF-8 for everything else that's on phase 3. The remaining latin-1s will
get bumped up to UTF-8 at some point, once someone gets around to ensuring
that it won't break with browsers that are violently unfriendly to
editing UTF-8 text in forms.
-- brion vibber (brion @ pobox.com)

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] Re: Chat about Wikipedia performance?