[WikiEN-l] exporting sets of pages
Tim Starling
tstarling at wikimedia.org
Wed Jan 5 12:06:01 UTC 2011
On 05/01/11 10:03, Rajarshi Guha wrote:
> Is there a way I could export the articles containing Drugboxes or do
> I need to install Wikipedia locally?
The best way to do it would be to get the list of articles using the API:
http://www.mediawiki.org/wiki/API
If that's too hard, you could could download templatelinks.sql.gz from
http://download.wikimedia.org/enwiki/latest/
and load them into a MySQL database, and then use that to get the list
of articles. But it's a big file and it's out of date. Either way, you
should get a list of articles and then download them in small batches
(say 10 articles at a time) using Special:Export. This may require a
small amount of scripting.
-- Tim Starling
More information about the WikiEN-l
mailing list