Maarten Dammers wrote:
Op 19-5-2010 13:45, Lars Aronsson schreef:
I tried to parse the year, I was successful for 3.5 million files. (Maybe I didn't try very hard.)
I guess you used a regex. Which one exactly? Or did you publish your code somewhere?
No, I did not publish my code or regex, and I don't intend to. This was a quick hack, and I know I might have missed lots of files. For example, just one random image from the huge Bundesarchiv image donation has a "Date=0-00-00", http://commons.wikimedia.org/wiki/File:Bundesarchiv_Bild_147-0435,_Wolfgang_...
(It's a mystery to me, why this is displayed as "november 1999".)
Then again, another random Bundesarchive image has "Date=1950-07-05", which should be covered by my hack, http://commons.wikimedia.org/wiki/File:Bodo_Uhse.jpg
We would have far fewer images from the 1950s if it weren't for this donation.
I want to encourage others to invent their own regex and see if they can find other results than mine. My numbers are posted on the talk page of the graph.