Hi Andy,
Thanks, there seems to be quite a lot of work to do in this area indeed!
On 20/02/2018 19:49, Andy Mabbett wrote:
As an example, I created 'KoreaMed Unique
Identifier':
https://www.wikidata.org/wiki/Q47489994
How could we improve that? What additional properties might we need?
What properties already exist, that we could make use of?
I have recently proposed to create a "number of records" property to
store the number of identifiers in a given scheme:
https://www.wikidata.org/wiki/Wikidata:Property_proposal/number_of_records
This property could typically apply here. The idea behind this property
is that we could compare its values to the number of uses of the
corresponding property in Wikidata.
One other thing I would love to see happening on Wikidata is keeping
track of the links between identifier schemes. If identifier X and
identifier Y are both used by the same database Z, then we can probably
use Z to "match" X to Y and conversely.
If we had many "uses (P2283)" and "used by (P1535)" statements to
link
identifiers to databases, we could then draw a graph of identifiers and
databases using them. Given two identifiers, we could analyze the paths
between these two identifiers…
For now the graph is a bit sparse:
http://tinyurl.com/y89u3enx
(And you can already see one issue: even if we have a path from ORCID to
ISNI, that does not mean that we can convert an ORCID id to an ISNI for
the same person via this path, as GRID contains ISNIs for organizations
only…)
Thanks a lot Andy for adding such statements on
https://www.wikidata.org/wiki/Q43649390 by the way!
Also, this query:
http://tinyurl.com/y6wdrbhd
returns over 5000 instances/ subclasses of "unique identifier"
(Q6545185) but includes both /types/ of identifiers (like the example
above) and individual identifier values, like ".ar" as an internet TLD
(domain name itself - Q32635 - is a subclass, not an instance, of UID)
- how should we distinguish between the two classes?
Urgh, that's messy. I think I would just change the ontology:
"domain name" (Q32635) should not be a subclass of "unique
identifier"
(Q6545185), but rather an instance of it. (Actually the uniqueness is
debatable, I don't think DNS is meant to enforce any uniqueness at all,
as it is very common for a website to have multiple domain names. So
maybe just "domain name" "instance of" "identifier
(Q853614)" would do).
Antonin