Hi Lizzy, handling of metadata is quite problematic in Flickr approach causing that "housekeeping". We had to deal with it in our GLAM uploads and so we made a small tool called Flickr2GWToolset.
The idea is to chop data in right places BEFORE uploading with gwtoolset. For me, this was easier than doing that afterwards since I'm not familiar with tools used in commons housekeeping.
It worked quite well for us. It *might* be useful also for you if you are going to follow Flickr-route.
http://wikihacks.opendimension.org/flickr2gwtoolset/
Regards, Ari
Sent from my Debian. http://www.opendimension.org/
2015-09-28 20:23 GMT+03:00 Lizzy Jongma L.Jongma@rijksmuseum.nl:
The Flickr2Commons sounds interesting and I will def look into that.
I also think it is better to cut up the job a bit (more): 500 or 1000 objects per upload is spreading the eggs a bit more and is easier to keep track off. So I think my solution for next time will be to upload a set in smaller pieces. But if there is a pattern or unexplained drop-out: I will post it immediately!
Best wishes, Lizzy
On 28 Sep 2015, at 16:12, Fæ faewik@gmail.com wrote:
I have seen unexplained drop-outs for my direct uploads (not GWT) since 25 Sept and I see 4 files of mine created today with empty image text pages. I'd agree with leaving it for a day and trying again rather than spending a lot of time investigating.
If there is a pattern of failures of the same type, then the best thing will be to raise a Phabricator task request to analyse the cause.
Perhaps the learning point is that GWT is not a guaranteed service, something to keep in mind if working to a schedule. If the backup plan is to find an unpaid volunteer with experience in alternative batch upload methods, there are only a handful active at any one time, and most of us are over-committed with a backlog of ideas and projects.
Alternative easy workflows like Flickr2Commons are worth knowing about, as in a pinch you can "drag & drop" to a single purpose Flickr account and then mass upload so that the Files can go live to schedule, however this is likely to leave a problematic amount of "housekeeping".
Fae
On 28 September 2015 at 13:27, Hans Muller j.m.muller@hccnet.nl wrote:
Dear all,
Lizzy: i'd say: restart your job, i did the same at 13:48 h (11:48 wikipedia time).
My latest upload before that was at 11:04, then nothing so the job had gone somehow. Now my small job to test the waters runs ok again.
Groetjes, hansmuller
-- faewik@gmail.com https://commons.wikimedia.org/wiki/User:Fae
Glamtools mailing list Glamtools@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/glamtools
Glamtools mailing list Glamtools@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/glamtools