Since the archives are fairly opaque on the subject, does anyone know if it's possible to spider the database dumps offline through the use of WikiFilter? Or do you have to be running the full MediaWiki software locally?
Thanks; got a catch-the-mis-categorization project in mind.
Regards, David R
David Reynolds wrote:
Since the archives are fairly opaque on the subject, does anyone know if it's possible to spider the database dumps offline through the use of WikiFilter? Or do you have to be running the full MediaWiki software locally?
I don't know what WikiFilter is so I couldn't say what it can or can't do.
But you certainly are free to process the dumps however you like; one thing you can do with them is import them into a MediaWiki instance, but that's just one thing.
See http://meta.wikimedia.org/wiki/Data_dumps for some more information on the dumps we provide and the format.
-- brion vibber (brion @ pobox.com)
wikitech-l@lists.wikimedia.org