Hi,
thanks for all of these information!
I wanted to clarify the "quality guarantee" in YAGO. It is true that we did not evaluate every statement individually. Rather, we proceeded as follows: For each relation in YAGO, we have taken a random sample, and evaluated it manually (wrt Wikipedia). From the ratio of correct statements on the sample, we have used statistic techniques to estimated the ratio of correct statements for the whole of YAGO. This ratio will usually be lower than the ratio on the sample. Thus, more precisely, the guarantee reads: With a probability of 95%, the ratio of correct statements for "actedInMovie" is in the interval 97.36%+/-2.64%.
I also wanted to ask again on the relationship between Freebase and Wikidata: Freebase was bootstrapped from the infoboxes of Wikipedia, but I think its main selling point is that volunteers can add and correct data. Thus, my understanding is that, both in Wikidata and in Freebase, volunteers would fill up structured, factual information. Is that right? My intuition is that Wikidata will have a more principled approach, because it can build on the Wikipedia/Wikimedia culture. Other comments are appreciated.
Thanks
Fabian
On 4/17/12 8:33 AM, Fabian M. Suchanek wrote:
I also wanted to ask again on the relationship between Freebase and Wikidata: Freebase was bootstrapped from the infoboxes of Wikipedia, but I think its main selling point is that volunteers can add and correct data. Thus, my understanding is that, both in Wikidata and in Freebase, volunteers would fill up structured, factual information. Is that right? My intuition is that Wikidata will have a more principled approach, because it can build on the Wikipedia/Wikimedia culture. Other comments are appreciated.
Yes, in a nutshell.
Freebase is a kind of Data Wiki just like Wikidata. Folks will create and manage data objects in both data spaces. Mutual benefits will arise from object co- and cross-references across these data spaces, thanks to the Web :-)
I see something like this:
[WikiData] -- (ProducingbetterRefStructuredDataFor) --> [Wikipedia] [Rest of the Web] .
[Freebase] -- (consumingContentFrom) -- [Wikipedia].
[DBpedia] -- (consumingContentFrom) -- [Wikipedia].
[DBpedia] -- (crossReferences) -- [Freebase] .
[Freebase] -- (crossReferences) -- [DBpedia] .
[DBpedia] -- (crossReferences) -- [YAGO] .
[YAGO] -- (crossReferences) -- [DBpedia] .
[WikiData] -- (leverages) -- [All of the Above] .
++