Hi,
I am looking for a convenient way to convert individual documents returned
from the MediaWiki API to standalone HTML documents. Currently I am
retrieving documents via the action/render construct
http://en.wikipedia.org/w/index.php?action=render&title="
I am encountering two problems:
1) I have to put a "shell" of crudely cut and pasted
<html><head><body>
etc. around the rendered html.
2) I need to strip out the external hrefs from the results, and I don't
have a good way to do this.
I am taking a fresh look and wondering whether I should be retrieving the
docs in JSON or XML and then using a conversion program to turn those into
nice clean HTML docs. Does anyone have any suggestions or working examples?
FredZ
-----------------------------------------------------
Subscribe to the Nimble Books Mailing List
http://eepurl.com/czS- for
monthly updates