Re: [Mediawiki-l] Trying to write a generic wikitext parser

18 Dec 2005

Am Samstag, den 17.12.2005, 17:33 +0200 schrieb Yaroslav Fedevych:
...
  The main goal is, of course, to be able to convert
wikitext to as many
 output formats as possible. That is why I'll first  try to write a
 parser function to return me the document tree I would be able to parse.

 So I sat down for half an hour and tried to identify each and every
 entity wikitext might consist of.

 The results are available here: http://docs.linux.org.ua/~jaroslaw/?p=8

 If anyone is interested in this, I encourage to comment on this table,
 especially if I've got something wrong, or the table is not complete.

For a wiki-to-XML converter that works reasonably well (a little slow
though, especially with tables), see CVS, "wiki2xml" module, "php"
directory. Or:

http://cvs.sourceforge.net/viewcvs.py/wikipedia/wiki2xml/php/

It works for wikitext as a source, or a list of page titles. It can even
replace templates "live".

I've started to add an OpenDocument text output. Any help would be
appreciated.

Magnus

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

Re: [Mediawiki-l] Trying to write a generic wikitext parser