After glancing at https://www.wikidata.org/wiki/User:Ladsgroup/Kian/Possible_mistakes/frFilm, it doesn't appear to me that either Wikidata type hierarchy or Wikipedia category hierarchy is being considered when evaluating type mismatches. Is that intentional?
For example
Grave of the Fireflies (Q274520) https://www.wikidata.org/wiki/Q274520NoYes (0.731427666987)
is an instance of animated film which is a subtype of film.
Conversely, this telefilm d'horreur
Le Collectionneur de cerveaux (Q579355) https://www.wikidata.org/wiki/Q579355YesNo (0.239868037957
is part of a subcategory of film d'horreur -> film de fiction
The one other that I glanced at, https://www.wikidata.org/wiki/User:Ladsgroup/Kian/Possible_mistakes/frHuman, seems to have systematic issues with correct classification of Wikipedia pages about multiple people (e.g. brothers) which Wikidata correctly identifies as not people.
It also, strangely, seems to think that Wikidata atomic elements are humans and I can't see why:
calcium (Q706) https://www.wikidata.org/wiki/Q706YesNo (0.0225392419603)
Have you considered using other signals as inputs to your models? For example, Freebase types should be a pretty reliable signal for things like humans and films.
Tom
On Sun, Aug 30, 2015 at 11:56 AM, Amir Ladsgroup ladsgroup@gmail.com wrote:
Thanks Nemo!
I added new reports: https://www.wikidata.org/wiki/User:Ladsgroup/Kian/Possible_mistakes
If you check them, you can easily find tons of errors, some of them are mis-categorization in Wikipedia, some of them are mistake in connecting article from Wikipedia to wrong item, some of them are vandalism in Wikidata, some of them are mistakes by bots or Widar users. Please check them if you want to have better quality in Wikidata
Best
On Sun, Aug 30, 2015 at 12:16 PM Federico Leva (Nemo) nemowiki@gmail.com wrote:
Amir Ladsgroup, 28/08/2015 20:17:
Another thing I did is reporting possible mistakes, when Wikipedia and Wikidata don't agree on one statement,
Nice, with this Wikidata has better quality control systems than Wikipedia. ;-)
Nemo
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata