I downloaded the whole english wikipedia from this link http://static.wikipedia.org/downloads/2008-06/en/wikipedia-en-html.tar.7z. When I tried to unzip the file using 7-zip, its giving a 'File is Broken' error. Any suggestion, or alternate method to get the english static html wikipedia dump will be appreciated.
Thanking you in anticipation, D.Raghuram.
From: "raghu0891" raghu0891@gmail.com
I downloaded the whole english wikipedia from this link http://static.wikipedia.org/downloads/2008-06/en/wikipedia-en-html.tar.7z. When I tried to unzip the file using 7-zip, its giving a 'File is Broken' error. Any suggestion, or alternate method to get the english static html wikipedia dump will be appreciated.
All methods available should be on a web page. FTP is more reliable (and restartable) if it's available. If it doesn't work the second time, then maybe it is corrupt at source. That compression format is totally new to me, so bugz might be in it.
2008/11/13 raghu0891 raghu0891@gmail.com:
I downloaded the whole english wikipedia from this link http://static.wikipedia.org/downloads/2008-06/en/wikipedia-en-html.tar.7z. When I tried to unzip the file using 7-zip, its giving a 'File is Broken' error. Any suggestion, or alternate method to get the english static html wikipedia dump will be appreciated.
The file is about 14GB. Maybe you didn't download it all ?
/Martin
Martin, The downloaded file shows exactly 14.3GB.
Martin Møller Skarbiniks Pedersen wrote:
2008/11/13 raghu0891 raghu0891@gmail.com:
I downloaded the whole english wikipedia from this link http://static.wikipedia.org/downloads/2008-06/en/wikipedia-en-html.tar.7z. When I tried to unzip the file using 7-zip, its giving a 'File is Broken' error. Any suggestion, or alternate method to get the english static html wikipedia dump will be appreciated.
The file is about 14GB. Maybe you didn't download it all ?
/Martin
WikiEN-l mailing list WikiEN-l@lists.wikimedia.org To unsubscribe from this mailing list, visit: https://lists.wikimedia.org/mailman/listinfo/wikien-l