On Fri, Apr 4, 2008 at 7:33 PM, White Cat wikipedia.kawaii.neko@gmail.com wrote:
That may be very difficult. Such a query would be very expensive both CPU-wise and BW-wise.
It could be run over several days, giving the server some time between requests to avoid DoSing it. The list of redirects could be obtained with a simple script using http://en.wikipedia.org/w/api.php?action=query&list=allpages&apfilterredir=redirects&aplimit=500 as a base, setting the apfrom parameter as necessary. Then http://en.wikipedia.org/w/api.php?action=query&prop=revisions&titles=...&rvlimit=50 for looking at revisions. It would not be a great system but give it a week or so and you'd have a good chunk of data to look at.
Optionally, someone with toolserver access could cook up a nice SQL query to kill the DB server with. :)