I guess this question for me is how do we do this in practice? How do we make sure Wikidata stays up to date/synced with external databases we think are important?

On 7 September 2017 at 20:51, Marco Fossati <fossati@spaziodati.eu> wrote:
Hi everyone,

As a data quality addict, I've been investigating the coverage of external identifiers linked to Wikidata items about people.

Given the numbers on SQID [1] and some SPARQL queries [2, 3], it seems that even the second most used ID (VIAF) only covers *25%* of people items circa.
Then, there is a long tail of IDs that are barely used at all.

So here is my question:
*which external identifiers deserve an effort to achieve exhaustive coverage?*

Looking forward to your valuable feedback.
Cheers,

Marco

[1] https://tools.wmflabs.org/sqid/#/browse?type=properties "Select datatype" set to "ExternalId", "Used for class" set to "human Q5"
[2] total people: http://tinyurl.com/ybvcm5uw
[3] people with a VIAF link: http://tinyurl.com/ya6dnpr7

_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata