Hey folks,
Dario and I just updated the scholarly citations dataset to include Digital
Object Identifiers. We found 742k citations (524k unique DOIs) in 172k
articles. Our spot checking suggests that 98% of these DOIs resolve. The
remaining 2% were extracted correctly, but they appear to be typos.
http://dx.doi.org/10.6084/m9.figshare.1299540
Like the dataset that we released for PubMed Identifiers, this dataset includes
the first known occurrence of a DOI citation in an English Wikipedia
article and the associated revision metadata, based on the most recent
complete content dump of English Wikipedia.
Feel free to share this with anyone interested via:
https://twitter.com/WikiResearch/status/564908585008627712
We'll be organizing our own work and analysis of these citations here:
https://meta.wikimedia.org/wiki/Research:Scholarly_article_citations_in_Wik…
-Aaron