Robert Carter wrote:
I've been experimenting with the parameters to Special:Export to retrieve the whole history of an article. I haven't been able to get more than 1000 revisions (from en wikipedia).
Does anyone know of a way to obtain the full history of an article? Those huge 7z exports seem too crazy to work with to extract data for only one page.
You can use api.php with rvprop=content and rvcontinue to fetch the text of all revisions of a page. Please do this in a single thread with a substantial delay between requests, since this is a very expensive operation for our servers. Do not attempt to do it for a large number of pages, for that, use the XML download instead. Do not do it regularly or set up a web gateway which allows users to initiate these requests.
-- Tim Starling