It's certainly an idea to keep track of what's in Wikidata, and what types of categories and infoboxes have or have not had information transferred.
There are some queries to give top-level round numbers for the UK and Ireland at https://www.wikidata.org/wiki/Wikidata:WikiProject_UK_and_Ireland#Stats that could adapt straightforwardly to other countries; plus queries to examine some of the more obvious gaps and anomalies at
https://www.wikidata.org/wiki/Wikidata_talk:WikiProject_UK_and_Ireland#To-do...
Looking through the results of the Autolist queries shows that some really quite odd subclasses are getting into the trees of these top-level things -- in particular the subclass tree of "event" looks to be needing some considerable cleaning up.
I've been meaning to take the counts down a few more levels, to see what of the immediate sub-classes of the top-level classes seem to be most populated (and/or most under-populated), but haven't had the moment to do that yet.
-- J.
On 11/03/2015 14:08, Markus Krötzsch wrote:
Hi Andrew,
This is a great idea! It would help data consumers to know what to expect and community members to know what to put in (or where help with imports would be appreciated). Moreover, the discussion about this list would be a great way to structure our work in general (have documented discussions about our goals for certain types of data). I feel that the bot right approval process is not the best place to decide if we strive to have all streets or all lighthouses in.
For things that are not complete in Wikidata (yet or ever), it would further help to provide pointers to other, more complete data sources (and the properties we might have to link to them).
The question is how to best organise this list. Your initial example setup already shows that this tends to become very diverse (not to say: chaotic). One could link this from the related class items (e.g., lighthouses or paintings), but having this as another extra load on the talk page would maybe not so ideal either. After all, this could be one of the first things that newbies to Wikidata want to get an idea about.
Cheers,
Markus
On 11.03.2015 14:07, Andrew Gray wrote: ...
I wonder if it would be useful to have a centralised list of "classes of things in Wikidata". For example:
Things entirely in Wikidata
- MEPs
- County-level administrative divisions of all countries
- All artworks by the following people (list)
- Cultural heritage sites in the following countries (list)
- All people listed in the following biographical databases (list)
- (etc)
Things not yet entirely in Wikidata (but probably will be eventually)
- All national-level elected representatives
- All species
- Lighthouses
- All artworks by the following people (list)
- Cultural heritage sites in the following countries (list)
- All people listed in the following biographical databases (list)
Things which will never be complete in Wikidata
- All local politicians
- Streets worldwide
- All businesses
This would be a very useful adjunct to the notability page, as it would give concrete examples to work from for the sort of things we feel are appropriate.
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l