On 10 February 2015 at 00:40, Dario Taraborelli dtaraborelli@wikimedia.org wrote:
Do DoIs not incude check digits?
they don’t, validation can be done via the CrossRef API or the DOI resolver. This method is not 100% reliable, especially when DOIs include special characters. CrossRef advised to use a 200 HTTP response code from the resolver with a noredirect flag (e.g. http://dx.doi.org/%7Bdoi%7D?noredirect=true) as an indication that the DOI is valid and resolves.
We should test for tehse in citation templates. Does your data show which templates (if any) the broken DoIs were in?
we haven’t checked if these errors occur systematically within specific templates, but we know that the code extracted them correctly with no parsing errors.
Thank you. I was thinking more of ensuring that check digits were verified by the template code; but that's clearly not possible. It's also not possible for a template to check a DOI by an http request; though a bot could do so.
We’ll share the list of broken DOIs so they can be reviewed and fixed.
Thank you.