GWT should prevent the upload of duplicates. Not enough users are working on the
backlog...
https://bugzilla.wikimedia.org/show_bug.cgi?id=64831
Date: Sat, 3 May 2014 04:29:12 -0300
From: bawolff(a)gmail.com
To: glamtools(a)lists.wikimedia.org
Subject: Re: [Glamtools] Advice on uploading a batch from a GLAM when individuals have
already uploaded some of that GLAMs images?
On May 2, 2014 5:40 AM, "Fæ" <faewik(a)gmail.com> wrote:
I have had many issues around this in the past. If the
images are the same in quality/resolution then avoid duplicating what is currently on
Commons. However if your versions are, in your view, better quality then there is no
problem uploading them as they are not true duplicates. Digitally identical duplicates
should be rejected automatically at upload as the files have matching SHA-1 checks.
Thats from normal upload. Gwtoolset may be different. Anyways they can be dealt with after
the fact too if they are exactly identical as its easy to detect later.
For most of my batch uploads I do a check of whatever
unique ID is suitable to see if there are matches. This can be highly useful when
re-running uploads as a check of matching filenames or image page text is a lot less
processing/data volumes than downloading the image file to create SHA-1 values.
PS this feels like "advanced class"
techniques. Apologies if I'm a crappy teacher. :-)
Fae
On 1 May 2014 13:46, David Haskiya
<david.haskiya(a)europeana.eu> wrote:
>
> Hi,
> On behalf of the Amsterdam Museum I'm prepping
a batch upload of about 300 images of their collection of paintings. They've made the
selection.
>
> I note that a number of their paintings have
already been uploaded by individual Commonists to here
https://commons.wikimedia.org/wiki/Category:Paintings_in_the_Amsterdam_Muse…
>
> Question: Should I upload all images in
"my" batch anyway even though this risks duplicating images? Is there a best
practise for cases like this?
>
> Cheers,
> David Haskiya
>
> David Haskiya
>
> Product Development Manager
>
>
>
> T: +31 (0)70 314 0696
> M: +31 (0)64 217 2542
> E: david.haskiya(a)europeana.eu
>
> Skype: davidhaskiya
>
>
>
> Europeana makes Europe’s culture available for
all, across borders and generations and for creative re-use – follow how at #AllezCulture
>
>
>
> Disclaimer: This email and any files transmitted
with it are confidential and intended solely for the use of the individual or entity to
whom they are addressed. If you have received this email in error please notify the system
manager. If you are not the named addressee you should not disseminate, distribute or copy
this email. Please notify the sender immediately by email if you have received this email
by mistake and delete this email from your system.
>
>
>
> _______________________________________________
> Glamtools mailing list
> Glamtools(a)lists.wikimedia.org
>
--
Personal and confidential, please do not circulate or
re-quote.
_______________________________________________
Glamtools mailing list
Glamtools(a)lists.wikimedia.org
_______________________________________________
Glamtools mailing list
Glamtools(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/glamtools