On 03/12/2011 13:18, Tim Starling wrote:
On 03/12/11 08:58, Platonides wrote:
On 02/12/11 22:33, Khalida BEN SIDI AHMED wrote:
Hello, I need an html dump of Wikipedia but the link http://static.wikipedia.org/ does not work. I'd appreciate any explanation or suggestion.
Regards Ben Sidi Ahmed
Why do oyu need an html dump of Wikipedia?
It's a huge task to set up MediaWiki in precisely the same way as it is on Wikimedia, to import an XML dump and to generate HTML. It takes a serious amount of hardware and software development resources. That's why I spent so much time making HTML dump scripts. It's just a pity that nobody cared enough about it to keep the project going.
The DumpHTML Mediawiki extension is an essential piece of software: https://www.mediawiki.org/wiki/Extension:DumpHTML
This is IMO the good approach and the only way to do high-quality static dumps. I have been using it since many years and all ZIM files I made were done using Tim's Mediawiki DumpHTML extension. http://download.kiwix.org/zim/0.9/
At Kiwix we currently pretty much focus on the end-user software but we still want to do everything necessary for having an open/efficient/handful toolchain to create static dumps from Mediawiki instances (in particular in the ZIM format).
That is the reason why we have an small action plan to improve DumpHTML http://www.kiwix.org/index.php/Mediawiki_DumpHTML_extension_improvement Any comment or critic is welcome.
If hackers are interested in working on DumpHTML, please let me know ; we currently work to get a grant for that, and this is on the good way.
Emmanuel