You might find it worth trying a command line tool for extraction. Some
of the Windows GUI tools seem to have problems with substantial files.
It would be great to get all the historical dumps on torrents too.
On 26/02/2011 15:43, White Cat wrote:
http://dumps.wikimedia.org/enwiki/20110115/
Hi, has anyone got plans to create individual torrents for "All pages
with complete page edit history (.bz2)" ? I downloaded them and turns
out I have several files that seem to be corrupted. I am unable
to re-download them but feel the torrent would be able to fix the
corrupted parts. All of the individual parts for the dumps except
1st,8th,9th,10th ones are complete.
I need these dumps because I will analyse revisions in hopes of better
identifying vandalism on the wikis through machine learning. I however
need the database to process this soon as my assignment is due in
about a month.
_______________________________________________
Xmldatadumps-l mailing list
Xmldatadumps-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l