[WikiEN-l] exporting sets of pages

Tim Starling tstarling at wikimedia.org
Wed Jan 5 12:06:01 UTC 2011


On 05/01/11 10:03, Rajarshi Guha wrote:
> Is there a way I could export the articles containing Drugboxes or do
> I need to install Wikipedia locally?

The best way to do it would be to get the list of articles using the API:

http://www.mediawiki.org/wiki/API

If that's too hard, you could could download templatelinks.sql.gz from

http://download.wikimedia.org/enwiki/latest/

and load them into a MySQL database, and then use that to get the list
of articles. But it's a big file and it's out of date. Either way, you
should get a list of articles and then download them in small batches
(say 10 articles at a time) using Special:Export. This may require a
small amount of scripting.

-- Tim Starling




More information about the WikiEN-l mailing list