Hi,

I am looking for a convenient way to convert individual documents returned from the MediaWiki API to standalone HTML documents. Currently I am retrieving documents via the action/render construct

http://en.wikipedia.org/w/index.php?action=render&title="

I am encountering two problems:

1) I have to put a "shell" of crudely cut and pasted <html><head><body> etc. around the rendered html.
2) I need to strip out the external hrefs from the results, and I don't have a good way to do this.

I am taking a fresh look and wondering whether I should be retrieving the docs in JSON or XML and then using a conversion program to turn those into nice clean HTML docs.  Does anyone have any suggestions or working examples?

FredZ


-----------------------------------------------------
Subscribe to the Nimble Books Mailing List  http://eepurl.com/czS- for monthly updates