After glancing at https://www.wikidata.org/wiki/User:Ladsgroup/Kian/Possible_mistakes/frFilm, it doesn't appear to me that either Wikidata type hierarchy or Wikipedia category hierarchy is being considered when evaluating type mismatches. Is that intentional?
For example
Grave of the Fireflies (Q274520) No Yes (0.731427666987) is an instance of animated film which is a subtype of film.Conversely, this telefilm d'horreur
Le Collectionneur de cerveaux (Q579355) Yes No (0.239868037957 is part of a subcategory of film d'horreur -> film de fictionThe one other that I glanced at, https://www.wikidata.org/wiki/User:Ladsgroup/Kian/Possible_mistakes/frHuman, seems to have systematic issues with correct classification of Wikipedia pages about multiple people (e.g. brothers) which Wikidata correctly identifies as not people.
It also, strangely, seems to think that Wikidata atomic elements are humans and I can't see why:
calcium (Q706) Yes No (0.0225392419603)
Have you considered using other signals as inputs to your models? For example, Freebase types should be a pretty reliable signal for things like humans and films.
Tom_______________________________________________On Sun, Aug 30, 2015 at 11:56 AM, Amir Ladsgroup <ladsgroup@gmail.com> wrote:Thanks Nemo!I added new reports:If you check them, you can easily find tons of errors, some of them are mis-categorization in Wikipedia, some of them are mistake in connecting article from Wikipedia to wrong item, some of them are vandalism in Wikidata, some of them are mistakes by bots or Widar users. Please check them if you want to have better quality in WikidataBestOn Sun, Aug 30, 2015 at 12:16 PM Federico Leva (Nemo) <nemowiki@gmail.com> wrote:Amir Ladsgroup, 28/08/2015 20:17:
>
> Another thing I did is reporting possible mistakes, when Wikipedia and
> Wikidata don't agree on one statement,
Nice, with this Wikidata has better quality control systems than
Wikipedia. ;-)
Nemo
_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata
_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata