On Tue, 31 Oct 2017, at 04:04 PM, Nicolas VIGNERON wrote:
2017-10-31 5:21 GMT+01:00 Sam Wilson sam@samwilson.id.au:
I've just been experimenting with adding a quick-statements generator to https://tools.wmflabs.org/ws-search — search for something, with 'linked to wikidata = no', and then each row of the results has a popup with code you can copy and paste into quickstatements.
I don't know if that sort of thing is useful. It's not working with many fields yet, but I'm finding it faster than entering the basics manually.
—sam.
Interesting.
It's a good start but I see a lot of things that need heavy improvements. I tried for br and fr Wikisources.
Most importantly not everything on Wikisource is an edition (there is disambig page and editions pages, for instance fr:s:Accroupissements a a work not an edition, it should detect that and change the P31 accordingly) Then the description is put on « Dfr "edition of Accroupissements by , Arthur Rimbaud" », it should be Den not Dfr, there shouldn't be a comma here, and ideally we should add description in multiple languages (maybe with an external bot ?). Some others data could probably easily be added (P996[https://www.wikidata.org/wiki/Property:P996] and P1957[https://www.wikidata.org/wiki/Property:P1957] at least and maybe P1476[https://www.wikidata.org/wiki/Property:P1476] too)
Yes it's certainly a first draft!! :-) Thanks for trying it out.
With the disambig pages, can you suggest how to detect them?
Ah, there's a couple of other bugs here:
The page https://fr.wikisource.org/wiki/Accroupissements actually already has a WIkidata ID, but the ws-search database didn't know about it :-( probably because it was failing for a while on some weird problems. I've re-run the scraper, and now that work is showing up with it's proper Q-number: https://tools.wmflabs.org/ws-search/?title=Accroupissements&author=&...
The idea with the quickstatements is that it'll only show it for works that are *not yet* linked to wikidata. This is where the disambig problem comes in, because there doesn't seem to be a simple way to determine what's an edition and what's a work without resorting to Wikidata. We could look at categories? Is it a truth universally acknowledged that pages in the categories defined as https://www.wikidata.org/wiki/Q15939659 are all disambiguation pages? That could work...
:-) Thanks again! I'll fix up the i18n stuff. The tool isn't doing anything at all yet, but I could register it on translatewiki if it's thought to be useful enough?
—sam