Hi Simon,

thanks for the paper, interesting findings!

Let me play the devil's advocate a bit here with your example: "For a politician, for instance, the political party is generally much more important than music instruments played."

Now let's compare the following two did-you-know facts:
- Did you know that Bill Clinton is a famous politician from the Democrat Party.
- Did you know that Bill Clinton is a famous politician who is also a saxophonist.

To me, the second is more interesting :)

But here, the interestingness is related to the degree of being unusual. Any thoughts on this?

Regards,
Fariz

On Sat, Aug 26, 2017 at 7:48 PM, Simon Razniewski <srazniew@gmail.com> wrote:
Hello,

I wanted to make you aware of our new paper "Doctoral Advisor or Medical Condition: Towards Entity-specific Rankings of Knowledge Base Properties", which deals with the problem of determining the interestingness of Wikidata properties for individual entities.

In the paper we develop a dataset of 350 random (entity, property1, property2) records, and use human judgments to determine the more interesting property in each record.
We then show that state-of-the-art techniques (Wikidata Property Suggestor, Google search) achieve 61% precision on predicting the winner in high-agreement records, which can be lifted to 74% by using linguistic similarity, but remains still significantly below human performance (87.5% precision).

Paper: http://www.simonrazniewski.com/2017_ADMA.pdf (to appear at ADMA 2017).
Dataset: https://www.kaggle.com/srazniewski/wikidatapropertyranking

Best wishes,
Simon Razniewski


_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata