Daniel Schwen wrote:
the Verein separately spent ~20'000 EUR on a server to replicate Wikimedia uploads in Amsterdam.
Oh! Is that server accessible from the toolserver cluster? It would make a few applications quite a bit easier/faster (I have a bot that detects animated GIFs, and one that extracts GPS data from the EXIF block in files (no, its not in the DB!))
Dschwen
Gmaxwell has a node with a copy of the images. Ask him for access.
I download there all commons uploads, if you need to be processing new files I could extract the data for you at the same time.