The event built a lot of momentum around the definition of data models, workflows and technology needed to better represent source and citation data from Wikimedia projects, Wikidata in particular.
While we're still drafting a human-readable
report, I thought I'd share a preview of the notes from the various workgroups, to give you a sense of what we worked on and to let everyone join the discussion:
Main workgroups
Discuss and draft data models to represent different types of sources as Wikidata items
Design or improve tools to extract identifiers and bibliographic data from Wikipedia citation templates, look up and retrieve metadata
Discuss how to express the citation of a source in a Wikimedia artifact (such as a Wikipedia article, a Wikidata statements etc.) and review alternative ways to represent them
Improve tools for semi-automated statement and reference creation (StrepHit, ContentMine)
Identify use cases for SPARQL queries involving source metadata. Obtain a small open licensed bibliographic and citation graph dataset to build a proof-of-concept of the querying and visualization potential of source metadata in Wikidata.
Additional workgroups
Add license information to Wikidata to make Wikidata the central hub on license information on databases
Merge groups working on citation structure and source metadata models and integrate their recommendations
Extend Citoid to write source metadata into Wikidata
We're opening up the
wikicite-discuss@wikimedia.org mailing list to anyone interested in interacting with the participants in the event (we encouraged them to use the official wikidata list for anything of interest to the broader community). Phabricator also has a
dedicated tag for related initiatives.
The event was generously
funded by the Alfred P. Sloan Foundation, the Gordon and Betty Moore Foundation, and Crossref. We'll be exploring the feasibility of a follow-up event in the next 6-12 months to continue the work we started in Berlin and bring in more people than we could host due to funding/capacity.
Best,
Dario
on behalf of the organizers