GWT should prevent the upload of duplicates. Not enough users are working on the backlog...
https://bugzilla.wikimedia.org/show_bug.cgi?id=64831
Date: Sat, 3 May 2014 04:29:12 -0300 From: bawolff@gmail.com To: glamtools@lists.wikimedia.org Subject: Re: [Glamtools] Advice on uploading a batch from a GLAM when individuals have already uploaded some of that GLAMs images?
On May 2, 2014 5:40 AM, "Fæ" faewik@gmail.com wrote:
I have had many issues around this in the past. If the images are the same in quality/resolution then avoid duplicating what is currently on Commons. However if your versions are, in your view, better quality then there is no problem uploading them as they are not true duplicates. Digitally identical duplicates should be rejected automatically at upload as the files have matching SHA-1 checks.
Thats from normal upload. Gwtoolset may be different. Anyways they can be dealt with after the fact too if they are exactly identical as its easy to detect later.
For most of my batch uploads I do a check of whatever unique ID is suitable to see if there are matches. This can be highly useful when re-running uploads as a check of matching filenames or image page text is a lot less processing/data volumes than downloading the image file to create SHA-1 values.
PS this feels like "advanced class" techniques. Apologies if I'm a crappy teacher. :-)
Fae
On 1 May 2014 13:46, David Haskiya david.haskiya@europeana.eu wrote:
Hi,
On behalf of the Amsterdam Museum I'm prepping a batch upload of about 300 images of their collection of paintings. They've made the selection.
I note that a number of their paintings have already been uploaded by individual Commonists to here https://commons.wikimedia.org/wiki/Category:Paintings_in_the_Amsterdam_Museu...
Question: Should I upload all images in "my" batch anyway even though this risks duplicating images? Is there a best practise for cases like this?
Cheers,
David Haskiya
David Haskiya
Product Development Manager
T: +31 (0)70 314 0696
M: +31 (0)64 217 2542
E: david.haskiya@europeana.eu
Skype: davidhaskiya
Europeana makes Europe’s culture available for all, across borders and generations and for creative re-use – follow how at #AllezCulture
Disclaimer: This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you have received this email in error please notify the system manager. If you are not the named addressee you should not disseminate, distribute or copy this email. Please notify the sender immediately by email if you have received this email by mistake and delete this email from your system.
Glamtools mailing list
Glamtools@lists.wikimedia.org
--
faewik@gmail.com https://commons.wikimedia.org/wiki/User:Fae
Personal and confidential, please do not circulate or re-quote.
Glamtools mailing list
Glamtools@lists.wikimedia.org
_______________________________________________ Glamtools mailing list Glamtools@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/glamtools