Always agreed, it's a classification problem.
So what claims/statements do I rule out ? Or what should I only rule in (claims/statements) when wanting to return only "real" entities ? Can someone help with those negative claims/statements that I am looking for ?
So far, I only have got
1. filtering out any entry with P31:Q13406463 should omit most
of them from your results.
Freebase simply decided to not keep Wikipedia topic pages that simply held "lists of entities", but instead Freebase liked to easily generate those same "lists of entities" by using queries. There was no need to have hand coded lists in Freebase. It was a graph database and could generate all kinds of lists programmaticlly for a user, and keep those lists as views against our user profile for easy tweaking or re-use when we wanted to. (stored user queries)