GWT should prevent the upload of duplicates. Not enough users are working on the backlog...

https://bugzilla.wikimedia.org/show_bug.cgi?id=64831


Date: Sat, 3 May 2014 04:29:12 -0300
From: bawolff@gmail.com
To: glamtools@lists.wikimedia.org
Subject: Re: [Glamtools] Advice on uploading a batch from a GLAM when individuals have already uploaded some of that GLAMs images?


On May 2, 2014 5:40 AM, "Fæ" <faewik@gmail.com> wrote:
>
> I have had many issues around this in the past. If the images are the same in quality/resolution then avoid duplicating what is currently on Commons. However if your versions are, in your view, better quality then there is no problem uploading them as they are not true duplicates. Digitally identical duplicates should be rejected automatically at upload as the files have matching SHA-1 checks.

Thats from normal upload. Gwtoolset may be different. Anyways they can be dealt with after the fact too if they are exactly identical as its easy to detect later.

> For most of my batch uploads I do a check of whatever unique ID is suitable to see if there are matches. This can be highly useful when re-running uploads as a check of matching filenames or image page text is a lot less processing/data volumes than downloading the image file to create SHA-1 values.
>
> PS this feels like "advanced class" techniques. Apologies if I'm a crappy teacher. :-)
>
> Fae
>
>
> On 1 May 2014 13:46, David Haskiya <david.haskiya@europeana.eu> wrote:
>>
>> Hi,
>> On behalf of the Amsterdam Museum I'm prepping a batch upload of about 300 images of their collection of paintings. They've made the selection.
>>
>> I note that a number of their paintings have already been uploaded by individual Commonists to here https://commons.wikimedia.org/wiki/Category:Paintings_in_the_Amsterdam_Museum 
>>
>> Question: Should I upload all images in "my" batch anyway even though this risks duplicating images? Is there a best practise for cases like this?
>>
>> Cheers,
>> David Haskiya
>>
>> David Haskiya
>>
>> Product Development Manager
>>
>>  
>>
>> T: +31 (0)70 314 0696
>> M: +31 (0)64 217 2542
>> E: david.haskiya@europeana.eu
>>
>> Skype: davidhaskiya
>>
>>  
>>
>> Europeana makes Europe’s culture available for all, across borders and generations and for creative re-use – follow how at #AllezCulture 
>>
>>  
>>
>> Disclaimer: This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you have received this email in error please notify the system manager. If you are not the named addressee you should not disseminate, distribute or copy this email. Please notify the sender immediately by email if you have received this email by mistake and delete this email from your system.
>>
>>
>>
>> _______________________________________________
>> Glamtools mailing list
>> Glamtools@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/glamtools
>>
>
>
>
> --
> faewik@gmail.com https://commons.wikimedia.org/wiki/User:Fae
> Personal and confidential, please do not circulate or re-quote.
>
> _______________________________________________
> Glamtools mailing list
> Glamtools@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/glamtools
>


_______________________________________________ Glamtools mailing list Glamtools@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/glamtools