Alfio Puglisi wrote in gmane.science.linguistics.wikipedia.technical:
On Sun, 29 May 2005, Kate Turner wrote:
please let me know about any problems with these files, particularly if they don't extract correctly.
Test using http://dumps.wikimedia.org/images/wikipedia/fi/20050530_upload.tar
GNU tar on Cygwin extracts all files correctly, except for the last ones (outside any subdirectory), where I get lots of "tar: Skipping to next header" errors, and a bunch of invalid gif and png files (button_bold.gif, button_bold.png and so on).
thanks.
these files are symlinks in the image directory. there seems to be a bug where symlinks are not handled correctly when creating the tar file. nonetheless, since the desired behaviour is that they are not included at all, this shouldn't be an issue (assuming the rest of the files are extracted okay). i'll try to fix this for the next dump.
pax doesn't seem to be available for Cygwin. I found with surprise that there's one included with windows, but it doesn't work:
pax: - : This doesn't look like a tar archive pax: - : Skipping to next file...
it's possible this pax doesn't understand the newer POSIX format. do you know where it comes from?
only the first 832 files are extracted, out of 3000.
Alfio
kate.