On 4 December 2014 at 09:08, James Heald <j.heald(a)ucl.ac.uk> wrote:
As well as exact duplicates, there may often also be
different versions of
the same painting with different lighting, or scans of slightly different
reproductions of the same work. I don't know whether the algorithm is
permissive enough to pick all of these up, but as many as can be picked up
would be good to tag as "other versions" of the same underlying image.
Careful here - algorithms that spot almost-duplicates will happily
flag different shots from the same shoot. Definitely not something to
act upon without close human inspection.
In general, we probably wouldn't *remove*
duplicate images, but we would
want to identify them as versions of each other.
Oh yeah, this'll be useful.
- d.