I'm at the Crossref LIVE 16 event in London where I just gave a presentation on WikiCite and Wikidata targeted at scholarly publishers.

Beside Crossref and Datacite people, I talked to a bunch of folks interested in collaborating on Wikidata integration, particularly from PLOS, Hindawi and Springer Nature. I started an interesting discussion with Andrew Smeall, who runs strategic projects at Hindawi, and I wanted to open it up to everyone on the lists.

Andrew asked me if – aside from efforts like ContentMine and StrepHit – there are any recommendations for publishers (especially OA publishers) to mark up their contents and facilitate information extraction and entity matching or even push triples to Wikidata to be considered for ingestion.

I don't think we have a recommended workflow for data providers for facilitating triple suggestions to Wikidata, other than leveraging the Primary Sources Tool. However, aligning keywords and terms with the corresponding Wikidata items via ID mapping sounds like a good first step. I pointed Andrew to Mix'n'Match as a handy way of mapping identifiers, but if you have other ideas on how to best support 2-way integration of Wikidata with scholarly contents, please chime in.

Dario

Dario Taraborelli Head of Research, Wikimedia Foundation
wikimediafoundation.org • nitens.org • @readermeter