Can you explain what "indexing" means in this context?  Is there some type of matching process?  How are duplicates resolved, if at all? Was the Wikidata info extracted from a dump or one of the APIs?

When I looked at the first person I picked at random, Pierre Berdoy (ID:269710), I see that both Wikidata and Wikipedia claim that he was born in Biarritz while the NYPL database claims he was born in Nashua, NH.  So, it would appear that there are either two different people with the same name, born in different places, or the birth place is wrong.

http://mgiraldo.github.io/pic/?&biography.TermID=2028247&Location=269710|42.7575,-71.4644
https://www.wikidata.org/wiki/Q3383941

Tom




On Tue, Dec 8, 2015 at 7:10 PM, David Lowe <davidlowe@nypl.org> wrote:
Hello all,
The Photographers' Identities Catalog (PIC) is an ongoing project of visualizing photo history through the lives of photographers and photo studios. I have information on 115,000 photographers and studios as of tonight. It is still under construction, but as I've almost completed an initial indexing of the ~12,000 photographers in WikiData, I thought I'd share it with you. We (the New York Public Library) hope to launch it officially in mid to late January. This represents about 12 years worth of my work of researching in NYPL's photography collection, censuses and business directories, and scraping or indexing trusted websites, databases, and published biographical dictionaries pertaining to photo history.
Again, please bear in mind that our programmer is still hard at work (and I continue to refine and add to the data*), but we welcome your feedback, questions, critiques, etc. To see the WikiData photographers, select WikiData from the Source dropdown. Have fun!

PIC

Thanks,
David

*Tomorrow,  for instance, I'll start mining Wikidata for birth & death locations.

_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata