The common-scanning bot doesn’t need to work on new uploads. In principle you can do a
query to get all id’s where there is a picture for, that is easy doable by query on the
database, get the id’s without a picture from wikidata, and then check if you have a match
between the two. That is how it worked before as well. The only thing you need is regulary
updating wikidata about the pictures in the lists.
Regards,
André
On 27 jul. 2015, at 11:47, Jane Darnell <jane023(a)gmail.com> wrote:
Magnus,
Wonderful set of links, thanks! And of course Mix-n-Match is a great idea - maybe the
whole monuments database can go in there? The ecosystem is not well documented (or at
least no where central I think). My gut feeling is that Listeria should be used to
generate the lists on WIkipedia, though this should maybe be a two-step process (see the
previous discussion about sorting issues and units and so forth). I agree with the generic
bit, as I see lots of overlap with something like "Wiki Loves Art" for
example.
As for the Commons-scanning bot that looks at recent WLM uploads, my feeling on that one
is that we need to catch those with the current upload-campaign tooling somehow. All of
that was built prior to the birth of Wikidata as well.
Jane
On Mon, Jul 27, 2015 at 11:26 AM, Magnus Manske <magnusmanske(a)googlemail.com>
wrote:
Hi all,
I just subscribed to this list, after it was mentioned to me. I'd like to give a
brief overview of what I can offer in terms of WLM and Wikidata. I skimmed the July
mailing list archive about Wikidata, but please forgive me if I repeat something already
said.
Last year, I wrote the official WLM tool for the UK. As part of that exercise, I also
created Wikidata items for all grade I and II* monuments in the UK on Wikidata.
As a "side effect", I also created a tool to browse any monument with an ID
number on Wikidata:
https://tools.wmflabs.org/wlmuk/index_wd.html
I do not consider this "production grade" at the moment, but it could be seen
as a prototype. And while it currently requires a country-specific property (or multiple),
it could be extended to also use anything with P1435 ("heritage status") within
a country. That would abolish the need for unique, stable IDs, though it will make
management of these objects a little harder of Wikidata.
As for syncing with Wikipedia(s), I can offer a list-generating and -updating bot already
in in production on English Wikipedia and Wikidata, as described here:
http://magnusmanske.de/wordpress/?p=301
This could be used as an automated to-do-list (e.g. a list of all monuments in Lower
Saxony without an image on Wikidata), or a total overview of these items (with image where
available). This will require an image to be set in the Wikidata item, and it will only
show one image, so it might not be suited as a show-the-latest-image-of-X, and it will not
search Commons on its own.
For synchronizing a list of monuments with Wikidata, I can offer my trusty
mix'n'match tool:
https://tools.wmflabs.org/mix-n-match/
I can upload any list that consists of names and IDs (which can be fake), optionally a
brief description and a URL (pointing to an external catalog). The tool offers various
ways to match entries to existing Wikidata items, or declare entries as "not on
Wikidata yet", so they can be batch-created.
To find existing images for monuments, I can offer WD-FIST:
https://tools.wmflabs.org/fist/wdfist/index.html
For a set of items, it can scan Wikipedia (and other projects) for images, and present
them for one-click addition to Wikidata. It can even do Commons free text search, though
there tend to be quite a few false positives. At the very least, this could be a
"from home" activity for WLM enthusiasts.
A missing piece in this "ecosystem" appears to be a Commons-scanning bot that
looks at recent WLM uploads, and recognizes IDs of monuments, or maybe just names, and
generates a list of suggested matches between Commons and Wikidata (and, optionally,
Wikipedia). Such a bot could surely be arranged, given detailed enough specifications.
While I can not promise to devote as much time as last year on this, I'll help where
I can. One point I would like to emphasize is that, whatever we come up with, I would like
it to be generic enough to work for all, or at least most, sets of monuments, or even
beyond that; every set of items with coordinates and/or "in admin unit" property
should work out-of-the-box, and some parts (e.g. WD-FIST, Listeria) already work fine for
biographies, taxons, etc.
Cheers,
Magnus
_______________________________________________
Wiki Loves Monuments mailing list
WikiLovesMonuments(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikilovesmonuments
http://www.wikilovesmonuments.org
_______________________________________________
Wiki Loves Monuments mailing list
WikiLovesMonuments(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikilovesmonuments
http://www.wikilovesmonuments.org