On May 2, 2014 5:40 AM, "Fæ" faewik@gmail.com wrote:
I have had many issues around this in the past. If the images are the
same in quality/resolution then avoid duplicating what is currently on Commons. However if your versions are, in your view, better quality then there is no problem uploading them as they are not true duplicates. Digitally identical duplicates should be rejected automatically at upload as the files have matching SHA-1 checks.
Thats from normal upload. Gwtoolset may be different. Anyways they can be dealt with after the fact too if they are exactly identical as its easy to detect later.
For most of my batch uploads I do a check of whatever unique ID is
suitable to see if there are matches. This can be highly useful when re-running uploads as a check of matching filenames or image page text is a lot less processing/data volumes than downloading the image file to create SHA-1 values.
PS this feels like "advanced class" techniques. Apologies if I'm a crappy
teacher. :-)
Fae
On 1 May 2014 13:46, David Haskiya david.haskiya@europeana.eu wrote:
Hi, On behalf of the Amsterdam Museum I'm prepping a batch upload of about
300 images of their collection of paintings. They've made the selection.
I note that a number of their paintings have already been uploaded by
individual Commonists to here https://commons.wikimedia.org/wiki/Category:Paintings_in_the_Amsterdam_Museu...
Question: Should I upload all images in "my" batch anyway even though
this risks duplicating images? Is there a best practise for cases like this?
Cheers, David Haskiya
David Haskiya
Product Development Manager
T: +31 (0)70 314 0696 M: +31 (0)64 217 2542 E: david.haskiya@europeana.eu
Skype: davidhaskiya
Europeana makes Europe’s culture available for all, across borders and
generations and for creative re-use – follow how at #AllezCulture
Disclaimer: This email and any files transmitted with it are
confidential and intended solely for the use of the individual or entity to whom they are addressed. If you have received this email in error please notify the system manager. If you are not the named addressee you should not disseminate, distribute or copy this email. Please notify the sender immediately by email if you have received this email by mistake and delete this email from your system.
Glamtools mailing list Glamtools@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/glamtools
-- faewik@gmail.com https://commons.wikimedia.org/wiki/User:Fae Personal and confidential, please do not circulate or re-quote.
Glamtools mailing list Glamtools@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/glamtools