On 14/04/2016 20:31, Andrew Gray wrote:
Hi all,
Writing to this list because I swear I've seen it discussed here before...
I'm curious to know how many cases we have where two people share both a name and a birthdate *but are confirmed to be different people*. I vaguely remember this being discussed as part of the duplicate checking work, but can't put my finger on where this was. I've been able to find markers onwiki for "person with the same name as another person" and "person with the same birth and death dates", but neither of these are quite the same. Does this discussion ring a bell with anyone?
Alternatively, if someone can take this as a challenge to make a really creative SPARQL query, I'd be grateful :-)
User:Jura has been steadily investigating matching paired birth+death dates as they come up.
Completed investigations can be found here:
https://www.wikidata.org/wiki/Wikidata:Database_reports/identical_birth_and_...
while a bot scans for new matches which it writes to
https://www.wikidata.org/wiki/Wikidata:Database_reports/identical_birth_and_...
Unfortunately, because of the limits on template expansion, it's hard to see the corresponding names. But there certainly are some confirmed twins in there with quite similar names.
There are also some queries narrowed to particular first names or particular professions at
https://www.wikidata.org/wiki/Wikidata:WikiProject_Data_cleaning/matched_bir...
That page hasn't been updated since the SPARQL service was very very new, so the counts are now all probably way off.
One early case I remember was a sportsman that we had separate articles on as a cricketer and as a footballer, until his career was cut short by World War I. But I think he has now been merged on en-wiki.
I see we also currently have https://en.wikipedia.org/wiki/Jack_Jenkins_(rugby_player) https://en.wikipedia.org/wiki/John_Jenkins_(rugby_league)
-- though the dates on the latter have been questioned.
Hope this helps,
James.