Hi MZMcBride,
Op 20-2-2012 23:24, MZMcBride schreef:
Maarten Dammers wrote:
Some time ago I played around with computer vision to get images categorized on Commons. I documented this at https://commons.wikimedia.org/wiki/User:Multichill/Using_OpenCV_to_categoriz... files . I don't think I'm going to spend time on it soon, but the results were quite promising, so maybe someone else feels like working on this? Would probably be a pretty nice student project or just fun to do.
Neat! :-)
I've had some thoughts about the uses of this. Particularly doing fuzzy image matching to spot duplicates, rather than relying on the primitive hash of the contents of the file.
If you want to find duplicate images you might want to play around with http://svn.wikimedia.org/viewvc/pywikipedia/trunk/pywikipedia/match_images.p...
Maarten