Hi everyone. I just joined the list; this is my first post, but I've been following the developments with interest.
I just learned about another promising MW parser, from the AboutUs guys, and thought I'd share it here in the list:
Kiwi: A Fast, Formal WikiText Parser - http://blog.aboutus.org/2011/01/31/kiwi-a-fast-formal-wikitext-parser/
Cheers, Waldir
On Tue, Feb 01, 2011 at 07:39:45PM +0000, Waldir Pimenta wrote:
Hi everyone. I just joined the list; this is my first post, but I've been following the developments with interest.
I just learned about another promising MW parser, from the AboutUs guys, and thought I'd share it here in the list:
Kiwi: A Fast, Formal WikiText Parser - [1]http://blog.aboutus.org/2011/01/31/kiwi-a-fast-formal-wikitext-parser/
Cheers, Waldir
Interesting. Is the PEG grammar available for this parser?
-Alan
Alan Post wrote:
Interesting. Is the PEG grammar available for this parser?
-Alan
It's at https://github.com/AboutUs/kiwi/blob/master/src/syntax.leg
Get peg/leg from http://piumarta.com/software/peg/
I just tried it and already found a bug on the first Hello World (it surrounds headers inside paragraphs). It strangely converts templates into underscored words. They may be expecting some other parser piece to restore it. I'm pretty sure there are corner cases in the preprocessor (eg. just looking at the peg file they don't handle mixed case noincludes), but I don't think that should need to be handled by the parser itself.
The grammar looks elegant. I doubt it can really handle full wikitext. But it would be so nice if it did...
wikitext-l@lists.wikimedia.org