Today I wrote a small script https://github.com/nemobis/bots/blob/master/doi-doai-openaccess.py that finds, among existing DOI links, those which are available in open access via DOAI.io.
I'm now running the script for the ~40 most visited Wikipedias, but here is the output for the Italian Wikipedia (430 DOIs): https://it.wikipedia.org/wiki/Progetto:Coordinamento/Bibliografia_e_fonti/DO...
I've asked those links to be added/replaced to the existing ones, I think the same should be done on other wikis as well: https://it.wikipedia.org/w/index.php?title=Wikipedia%3ABot%2FRichieste&t...
The next step will be to search DOIs which are mentioned in the articles but not linked, or that are not linked via DOI.org, or that are indicated via their handle instead; and even harder, to find DOIs corresponding to citations which don't mention the DOI at all. What's the best reusable code/tool for this? I remember https://github.com/CrossRef/baleen and https://github.com/edsu/linkypedia but that's not quite the same thing.
Nemo
Federico, you should join wikicite-discuss@wikimedia.org and forward this there
On Jun 2, 2016, at 12:21 PM, Federico Leva (Nemo) nemowiki@gmail.com wrote:
Today I wrote a small script https://github.com/nemobis/bots/blob/master/doi-doai-openaccess.py that finds, among existing DOI links, those which are available in open access via DOAI.io.
I'm now running the script for the ~40 most visited Wikipedias, but here is the output for the Italian Wikipedia (430 DOIs): https://it.wikipedia.org/wiki/Progetto:Coordinamento/Bibliografia_e_fonti/DO...
I've asked those links to be added/replaced to the existing ones, I think the same should be done on other wikis as well: https://it.wikipedia.org/w/index.php?title=Wikipedia%3ABot%2FRichieste&t...
The next step will be to search DOIs which are mentioned in the articles but not linked, or that are not linked via DOI.org, or that are indicated via their handle instead; and even harder, to find DOIs corresponding to citations which don't mention the DOI at all. What's the best reusable code/tool for this? I remember https://github.com/CrossRef/baleen and https://github.com/edsu/linkypedia but that's not quite the same thing.
Nemo
Dario Taraborelli, 02/06/2016 12:32:
Federico, you should join wikicite-discuss@wikimedia.org and forward this there
The ML doesn't appear to exist: https://lists.wikimedia.org/mailman/listinfo/
Nemo
Hey Nemo,
Please check out https://en.wikipedia.org/wiki/WP:OABOT
I think we have crafted an actually more robust and scalable solution that includes DOAI and want the next step to be a bot (eventually working on multiple languages).
Would you like to be involved?
Best, Jake Orlowitz The Wikipedia Library
On Thu, Jun 2, 2016 at 3:21 AM Federico Leva (Nemo) nemowiki@gmail.com wrote:
Today I wrote a small script https://github.com/nemobis/bots/blob/master/doi-doai-openaccess.py that finds, among existing DOI links, those which are available in open access via DOAI.io.
I'm now running the script for the ~40 most visited Wikipedias, but here is the output for the Italian Wikipedia (430 DOIs):
https://it.wikipedia.org/wiki/Progetto:Coordinamento/Bibliografia_e_fonti/DO...
I've asked those links to be added/replaced to the existing ones, I think the same should be done on other wikis as well:
https://it.wikipedia.org/w/index.php?title=Wikipedia%3ABot%2FRichieste&t...
The next step will be to search DOIs which are mentioned in the articles but not linked, or that are not linked via DOI.org, or that are indicated via their handle instead; and even harder, to find DOIs corresponding to citations which don't mention the DOI at all. What's the best reusable code/tool for this? I remember https://github.com/CrossRef/baleen and https://github.com/edsu/linkypedia but that's not quite the same thing.
Nemo
OpenAccess mailing list OpenAccess@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/openaccess
Jake Orlowitz, 02/06/2016 23:51:
Please check out https://en.wikipedia.org/wiki/WP:OABOT
I already did and I commented on the talk page; the proposal there is to act on a specific dialect of cite templates, while DOI links are used in many more ways.
Neom
After a few tweaks, I ended up with a list of about 180k DOIs. https://zenodo.org/record/54799
Nemo
Federico Leva (Nemo), 02/06/2016 12:21:
Today I wrote a small script https://github.com/nemobis/bots/blob/master/doi-doai-openaccess.py that finds, among existing DOI links, those which are available in open access via DOAI.io.
I'm now running the script for the ~40 most visited Wikipedias, but here is the output for the Italian Wikipedia (430 DOIs): https://it.wikipedia.org/wiki/Progetto:Coordinamento/Bibliografia_e_fonti/DO...
I've asked those links to be added/replaced to the existing ones, I think the same should be done on other wikis as well: https://it.wikipedia.org/w/index.php?title=Wikipedia%3ABot%2FRichieste&t...
The next step will be to search DOIs which are mentioned in the articles but not linked, or that are not linked via DOI.org, or that are indicated via their handle instead; and even harder, to find DOIs corresponding to citations which don't mention the DOI at all. What's the best reusable code/tool for this? I remember https://github.com/CrossRef/baleen and https://github.com/edsu/linkypedia but that's not quite the same thing.
Nemo
openaccess@lists.wikimedia.org