Erik Moeller wrote:
Using the Flickr API, I am building a database of free photos from Flickr, and users can apply for access to the frontend to review slices of 1,000 photos each. After a slice is finished, I review it and run the upload bot to upload the selected photos to the Commons. See the above page for more information.
This sounds fantastic, but I worry about a few things.
How accurate is the metadata at flickr? Presumably for photos that people take themselves and upload, it is 100% accurate by definition.
But I worry about copyvios at Flickr leaking into Commons.
One of the things that prevents rampant copyvios at Wikimedia projects generally is community reputation. It is essentially impossible to imagine any prominent contributor uploading copyvios and lying about the license data to Wikipedia itself. And if we ever caught someone doing so, we would quickly review all of their contributions and nuke them all.
But if we're importing large quantities of questionably-licensed data from Flickr, and then Flickr bans the person for doing something wrong, how do we know about it?
This is not an insurmountable problem of course.
Reviewing things 1,000 at a time sounds reasonable, but we need to be pretty rigorous somehow.
Please help by applying for access to a slice of Flickr. Best send me a private email with a link to your username so I can look at your past contributions.
I'm known as user Jimbo Wales in most projects. I have the most edits in English Wikipedia, but still not that many. I think if you ask around, though, despite my weak history of editing, a lot of people know me and will tell you that I'm ok. :-)
--Jimbo