Hi Simon,
thanks for the paper, interesting findings!
Let me play the devil's advocate a bit here with your example: "For a
politician, for instance, the political party is generally much more
important than music instruments played."
Now let's compare the following two did-you-know facts:
- Did you know that Bill Clinton is a famous politician from the Democrat
Party.
- Did you know that Bill Clinton is a famous politician who is also a
saxophonist.
To me, the second is more interesting :)
But here, the interestingness is related to the degree of being unusual.
Any thoughts on this?
Regards,
Fariz
On Sat, Aug 26, 2017 at 7:48 PM, Simon Razniewski <srazniew(a)gmail.com>
wrote:
Hello,
I wanted to make you aware of our new paper "Doctoral Advisor or Medical
Condition: Towards Entity-specific Rankings of Knowledge Base Properties",
which deals with the problem of determining the interestingness of Wikidata
properties for individual entities.
In the paper we develop a dataset of 350 random (entity, property1,
property2) records, and use human judgments to determine the more
interesting property in each record.
We then show that state-of-the-art techniques (Wikidata Property
Suggestor, Google search) achieve 61% precision on predicting the winner in
high-agreement records, which can be lifted to 74% by using linguistic
similarity, but remains still significantly below human performance (87.5%
precision).
Paper:
http://www.simonrazniewski.com/2017_ADMA.pdf (to appear at ADMA
2017).
Dataset:
https://www.kaggle.com/srazniewski/wikidatapropertyranking
Best wishes,
Simon Razniewski
_______________________________________________
Wikidata mailing list
Wikidata(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata