On Wed, Aug 15, 2018 at 7:20 AM, Stas Malyshev smalyshev@wikimedia.org wrote:
a Qnumber or Pnumber in there (extra incentive for people to add labels in their language). Probably also everything duplicated in the text
That presents a problem. While you see "instance of": "human", the data is P31:Q5. We can, of course, put "instance of": "human" in the index. But what if label for Q5 changes? Now we have to re-index 10 million records.
I haven't thought this through, but would it be possible to index just Q5, and then when someone searches on "human" to see what are all the items with the label "human", so that the search becomes "human OR Q5"?