Tim Starling wrote:
Gregory Maxwell wrote:
On Sat, Sep 20, 2008 at 9:38 PM, Tim Starling tstarling@wikimedia.org wrote:
Yes. The bug itself deleted the file and put a directory entry in its place. I wrote a shell script to remove the directory entry and then do a wget to fetch the file from the squid cache. Wget created a zero-length file for all the cache misses. Some of those files were subsequently deleted.
So does that mean there are more files which were not included in the prior list of missing files that I should check for?
The list of missing files was derived from the initial scan for directory entries where files should have been. I'm not sure why files would have been missing from that list. We'll probably have to check the whole file repository against the DB. You could write a script for that if you feel like doing something to help.
Just running something like "find -type d" on the image directory and filtering out the expected legitimate entries would be a good start.
By the way, there also seem to be plenty of these under the "archive" directory, e.g. /wikipedia/en/archive/0/00/20060414204303!Uakari_male.jpg/ This probably has something to do with the problems we've been having with thumbnail generation in image histories.