Hi,
I am looking for a convenient way to convert individual documents returned from the MediaWiki API to standalone HTML documents. Currently I am retrieving documents via the action/render construct
http://en.wikipedia.org/w/index.php?action=render&title="
I am encountering two problems:
1) I have to put a "shell" of crudely cut and pasted <html><head><body> etc. around the rendered html. 2) I need to strip out the external hrefs from the results, and I don't have a good way to do this.
I am taking a fresh look and wondering whether I should be retrieving the docs in JSON or XML and then using a conversion program to turn those into nice clean HTML docs. Does anyone have any suggestions or working examples?
FredZ
----------------------------------------------------- Subscribe to the Nimble Books Mailing List http://eepurl.com/czS- for monthly updates
mediawiki-api@lists.wikimedia.org