Hi Simon,
thanks for the paper, interesting findings!
Let me play the devil's advocate a bit here with your example: "For a politician, for instance, the political party is generally much more important than music instruments played."
Now let's compare the following two did-you-know facts: - Did you know that Bill Clinton is a famous politician from the Democrat Party. - Did you know that Bill Clinton is a famous politician who is also a saxophonist.
To me, the second is more interesting :)
But here, the interestingness is related to the degree of being unusual. Any thoughts on this?
Regards, Fariz
On Sat, Aug 26, 2017 at 7:48 PM, Simon Razniewski srazniew@gmail.com wrote:
Hello,
I wanted to make you aware of our new paper "Doctoral Advisor or Medical Condition: Towards Entity-specific Rankings of Knowledge Base Properties", which deals with the problem of determining the interestingness of Wikidata properties for individual entities.
In the paper we develop a dataset of 350 random (entity, property1, property2) records, and use human judgments to determine the more interesting property in each record. We then show that state-of-the-art techniques (Wikidata Property Suggestor, Google search) achieve 61% precision on predicting the winner in high-agreement records, which can be lifted to 74% by using linguistic similarity, but remains still significantly below human performance (87.5% precision).
Paper: http://www.simonrazniewski.com/2017_ADMA.pdf (to appear at ADMA 2017). Dataset: https://www.kaggle.com/srazniewski/wikidatapropertyranking
Best wishes, Simon Razniewski
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata