Hey folks,
Dario and I just updated the scholarly citations dataset to include Digital Object Identifiers. We found 742k citations (524k unique DOIs) in 172k articles. Our spot checking suggests that 98% of these DOIs resolve. The remaining 2% were extracted correctly, but they appear to be typos.
http://dx.doi.org/10.6084/m9.figshare.1299540
Like the dataset that we released for PubMed Identifiers, this dataset includes the first known occurrence of a DOI citation in an English Wikipedia article and the associated revision metadata, based on the most recent complete content dump of English Wikipedia.
Feel free to share this with anyone interested via: https://twitter.com/WikiResearch/status/564908585008627712
We'll be organizing our own work and analysis of these citations here: https://meta.wikimedia.org/wiki/Research:Scholarly_article_citations_in_Wiki...
-Aaron