Hi Jamie,
Am 17.02.2010 11:41, schrieb Jamie Morken:
am sure that the intentions are good for what they are doing, people just want to protect wikipedia (including me). I am most interested in learning about how to make a local searchable backup of the full wikipedia, it seems a bit tricky with the xml format. I plan on making a install of apache/lucene/mysql/php/mediawiki and see how it works! :)
have you ever looked into openZIM? The ZIM file format offers a highly compressed storage which includes a fulltext search index.
Currently there are Kiwix as a GUI reader application and the zimreader as a webserver available, along with special readers for embedded devices etc.
The Wikimedia Foundation is working on providing regular ZIM dumps of their wikis along with the SQL and XML dumps.
/Manuel