A few months ago I successfully downloaded the November 2006 HTML version of Wikipedia (about 6GB expanding to 90GB) and the October 2008 xml.bz2 file (4.1GB converted to 7.1GB Wikitaxi format). I have just downloaded the June 2008 HTML version in .tar.7z format and extracted into .tar format (14.3GB.to 230GB). I now have no idea what to do next. I ran WinRAR on it and it gave up after more than 6 million files. 1. How do I actually access all this information? I use the Wikitaxi version, but only the HTML version allows access to, for instance, categories, so the latest version would be useful. 2. Is there any way to recompress it to a reasonable size such that I can still access it without it occupying nearly all my disk?3. Or, failing that, is there any way to access the original .tar.7z file, as BzReader can access .xml.bz2 files?
wikitech-l@lists.wikimedia.org