This is the result for German Wikipedia:
I ran the bot for German and I wanted to add P31:5 but it seems more than
90% of Wikidata items have P31 statement (how?) and there was nothing that
I could do, so I got list of articles in German Wikipedia that doesn't have
item in Wikidata. There were 16K articles and output of the bot for each
one of them is this <https://tools.wmflabs.org/dexbot/kian_res2.txt>. If
you plot it, you would have this
<https://tools.wmflabs.org/dexbot/kian2.png>. When the number is below
0.50 it is obvious that they are not human. Between 0.50-0.61 there are 78
articles that the bot can't determine whether it's a human or not [1] and
articles with more than 0.61 is definitely human. I used 0.62 just to be
sure and created 3600 items with P31:5 in them.
Imagine if I do something like that for English Wikipedia.
[1]: They are probably about a cat or tree with categories of humans in
them.
Best
On Sun, Mar 8, 2015 at 3:07 AM, Amir Ladsgroup <ladsgroup(a)gmail.com> wrote:
On Sat, Mar 7, 2015 at 9:19 PM, Jeroen De Dauw <jeroendedauw(a)gmail.com>
wrote:
Hey,
Yay, neural nets are definitely fun! Am I right in understanding this is
a software you created for the specific purpose of doing tasks in Wikidata?
Yes, in Wikidata and Wikipedia.
Congratulations for this bold step towards the
Singularity :-)
Don't worry, it'll be some time before AI can actually ingest Wikidata,
see
https://dl.dropboxusercontent.com/u/7313450/entropy/aitraining.png
Cheers
--
Jeroen De Dauw -
http://www.bn2vs.com
Software craftsmanship advocate
Evil software architect at Wikimedia Germany
~=[,,_,,]:3
_______________________________________________
Wikidata-l mailing list
Wikidata-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l
--
Amir