On Thu, May 11, 2006 at 10:23:58AM +0200, Tels wrote:
Just for the record, I am also interested (and working) on the wiki2xml project. The goal is to extract a "tree" from:
http://bloodgate.com/wiki/Wiki-Presentations
automatically, e.g. get all pages and convert them in one go into an OpenOffice (or whatever) document.
At the moment I am still struggling with finding my way around Magnus' code, but eventually we get there :)
The current idea is to use a template which lists all the pages as a source, but of course it would be equally possible to generate the list of pages to extract from either a category, or spidering it from a start page or whatever. Producing the list of articles to be extracted is really a seperate issue from converting one or more articles to another format :)
Spidering it was my preferred approach, yes, though something someone alluded to which amounted to "spider out the page names into a file (with visible indention) and then run over that file, as amended by a human" rather then automatically spidering directly into the conversion code, is likely a better approach. Indeed, this approach would allow you to select a section numbering protocol and see how it would look before doing The Big Run.
Cheers, -- jra