Saqib wrote:
I'm looking to parse the text contents
("old_text" field) of articles in the
"text" table from a pages_articles.xml dump into MySQL, without using
MediaWiki.
It's polite to read the list archives before asking questions; a
question to this effect was asked 13 days ago.
You'll want to read Magnus' message:
http://mail.wikipedia.org/pipermail/wikitech-l/2005-November/032395.html
Fully outside of MediaWiki, building on Magnus' code is your best bet.
You could either replace what gets emitted directly, or use XSLT to do
the simple transformations you need on the XML and produce HTML.
--
Ivan Krstic <krstic(a)fas.harvard.edu> | 0x147C722D