You might find it worth trying a command line tool for extraction.  Some of the Windows GUI tools seem to have problems with substantial files.

It would be great to get all the historical dumps on torrents too.

On 26/02/2011 15:43, White Cat wrote:
http://dumps.wikimedia.org/enwiki/20110115/

Hi, has anyone got plans to create individual torrents for "All pages with complete page edit history (.bz2)" ? I downloaded them and turns out I have several files that seem to be corrupted. I am unable to re-download them but feel the torrent would be able to fix the corrupted parts. All of the individual parts for the dumps except 1st,8th,9th,10th ones are complete.

I need these dumps because I will analyse revisions in hopes of better identifying vandalism on the wikis through machine learning. I however need the database to process this soon as my assignment is due in about a month.
_______________________________________________ Xmldatadumps-l mailing list Xmldatadumps-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l