On Thu, 2004-07-29 at 19:59 +0000, Ævar Arnfjörð Bjarmason wrote:
So in other words the code exists but is not currently enabled?
The code exists in php, but is a few times slower than the current parser. That tokenizer also handles only a few of [count > 20] passes.
I'm currently writing a new parser using BisonGen (builds both a C python module and a pure python parser) that handles the entire parsing in one step. The C version also performs very well (0.014 seconds vs. 0.17 seconds for the pure python version). The output will be a DOM object tree, includes and the like will be handled by manipulating that tree before dumping it as [insert your favourite format here]. Where feasible, this parser also supports the current Moin syntax additional to the MW one, it's intended to work with Moin of course (which has a relatively clean design and profits from the python infrastructure). Some more details at http://moinmoin.wikiwikiweb.de/NewWikiParser.