Hey folks,
Dario and I just updated the scholarly citations dataset to include Digital
Object Identifiers. We found 742k citations (524k unique DOIs) in 172k
articles. Our spot checking suggests that 98% of these DOIs resolve. The
remaining 2% were extracted correctly, but they appear to be typos.
http://dx.doi.org/10.6084/m9.figshare.1299540
Like the dataset that we released for PubMed Identifiers, this dataset includes
the first known occurrence of a DOI citation in an English Wikipedia
article and the associated revision metadata, based on the most recent
complete content dump of English Wikipedia.
Feel free to share this with anyone interested via:
https://twitter.com/WikiResearch/status/564908585008627712
We'll be organizing our own work and analysis of these citations here:
https://meta.wikimedia.org/wiki/Research:Scholarly_article_citations_in_Wik…
-Aaron
Hey all,
we just released a dataset of scholarly citations in the English Wikipedia by Pubmed / Pubmed Central ID.
http://dx.doi.org/10.6084/m9.figshare.1299540
The dataset currently includes the first known occurrence of a PMID or PMCID citation in an English Wikipedia article and the associated revision metadata, based on the most recent complete content dump of English Wikipedia. We’re planning on expanding this dataset to include other types of scholarly identifier soon.
Feel free to share this with anyone interested or spread the word via: https://twitter.com/WikiResearch/status/562422538613956608
Dario and Aaron