On Sun, Jan 18, 2015 at 9:11 AM, Maarten Dammers <maarten@mdammers.nl> wrote:
Hi Tom,

Tom Morris schreef op 17-1-2015 om 17:17:
2015-01-17 4:27 GMT-05:00 Lydia Pintscher <lydia.pintscher@wikimedia.de>:

The log is at https://meta.wikimedia.org/wiki/IRC_office_hours/Office_hours_2015-01-16
for anyone who couldn't make it.

Denny discusses importing all missing VIAF keys from Freebase using "multichill" (unclear what that is from the context) on the assumption that the error rate is low.  It would be worth checking assumptions like that with folks who are familiar with the Freebase data before acting on them.
I guess you are refereing to "18:57:55 <vrandecic> If you ask me, I am happy with just letting multichill to upload the VIAFs that are still missing"

That would be me. VIAF is a very good starting point for getting more authority data. If you have viaf, you can add other authority control data based on that. So getting more links to viaf would be nice. Not sure how many are still missing. I recently did that for ULAN and NTA ( https://www.wikidata.org/w/index.php?title=Q120609&diff=182583270&oldid=180304990 / https://www.wikidata.org/w/index.php?title=Q1610938&diff=182636686&oldid=182321267 ). Was able to add over 100.000 new links. I still have to do this for other types of authority control. The more tightly connected things get, the easier it gets to find problems or duplicates.

Maarten

Hello All,

It was me that originally imported the about 400,000 VIAF links into Wikidata. The way that they were matched was using by using a name-and-date-of-birth-matching algorithm with an English Wikipedia dump, and it was done by a team that works for VIAF.org . Then then those matches were imported into English Wikipedia. Additionally other languages like Italian Wikipedia and Commons had done some manual matching. After about a year of manual correcting in Wikipedias, I took the authority control from (I think about 9) different Wikis, and imported it into Wikidata. I also later did some of what Maarten/multichill is doing now which is to do some lookups on VIAF and import subsequent data over, like sex/gender and alternative names. I'm going to ping the people at VIAF (which is part of OCLC, for whom I used to work, but no longer) about this to see if they have any thoughts to add as well.


Make a great day,
Max Klein ‽ http://notconfusing.com/ 

_______________________________________________
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l