Hello,
I have been trying for a few days to download the xml wikipedia dump (pages-articles.xml.bz2) but the official links point to a file that is less than 1k and bzip2 says the file is corrupted. I can't seem to find one good version of either the standard wikipedia xml dump or the one with comments and user pages. Either will do, but I am getting desperate. Could someone please help? If you could point me to an older dump, anything at all, I would really appreciate it. I have trying everything that came to mind, but i can't even get someone to tell me if I am doing something stupid and that they can download the fie at http://download.wikimedia.org/enwiki/20070402/.
If you could help me I would be eternally grateful.
Thank you very much,
Vasco Calais Pedro
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
vasco@cs.cmu.edu wrote:
I have been trying for a few days to download the xml wikipedia dump
(pages-articles.xml.bz2) but the official links point to a file that is less than 1k and bzip2 says the file is corrupted.
Most likely your download utility, or perhaps a proxy on your end, is broken and does not handle files larger than 2 gigabytes correctly.
- -- brion vibber (brion @ wikimedia.org)