Robert Carter wrote:
I've been experimenting with the parameters to
Special:Export to
retrieve the whole history of an article. I haven't been able to get
more than 1000 revisions (from en wikipedia).
Does anyone know of a way to obtain the full history of an article?
Those huge 7z exports seem too crazy to work with to extract data for
only one page.
You can use api.php with rvprop=content and rvcontinue to fetch the
text of all revisions of a page. Please do this in a single thread
with a substantial delay between requests, since this is a very
expensive operation for our servers. Do not attempt to do it for a
large number of pages, for that, use the XML download instead. Do not
do it regularly or set up a web gateway which allows users to initiate
these requests.
-- Tim Starling