the Verein
separately spent ~20'000 EUR on a server to replicate
Wikimedia uploads in Amsterdam.
Oh! Is that server accessible from the toolserver cluster? It would make a few
applications quite a bit easier/faster (I have a bot that detects animated
GIFs, and one that extracts GPS data from the EXIF block in files (no, its
not in the DB!))
Dschwen
Gmaxwell has a node with a copy of the images. Ask him for access.
I download there all commons uploads, if you need to be processing new
files I could extract the data for you at the same time.