From: JFC Morfin jefsey@jefsey.com Thank you for this detailed explanation. How do you see the integration/impact of Wikidata on both projects?
My intuition is that the impact could be mutual: * for YAGO and DBpedia, the impact would be immediate, because Wikidata could essentially provide cleaner infobox data for these projects. Yet, we have to see how Wikidata will position itself to Freebase, which seems to pursue a similar goal: http://en.wikipedia.org/wiki/Freebase (If you have thoughts on distinguishing Wikidata from Freebase, we'd be happy to know)
* for Wikidata, there could be some leverage in the ontologies as well, possibly for bootstrapping. - YAGO, e.g., has mappings of infobox data to relations with domains and ranges, with a quality guarantee. One naive idea is that these could contribute to filling up Wikidata initially, because it seems easier for humans to correct or complete data than to insert it from scratch. - Another aspect is that YAGO connects the Wikipedia categories and pages to WordNet, the major digital lexicon of English (http://wordnet.princeton.edu/). This could contribute a strict semantic typing / class hierarchy / taxonomy to Wikidata, which is so far absent in Wikipedia. - Last, YAGO has the connection to Geonames (providing data about geographical entities), and also the connection to the Universal Wordnet (providing translations of class names and entity names to 200 other languages -- basically a cleaned and expanded version of the Wikipedia interlanguage links). - DBpedia, too, could contribute, because its hub position in the cloud of linked data connects it to many other resources.
Cheers
Fabian
On Mon, Apr 16, 2012 at 10:54 AM, Fabian M. Suchanek f.m.suchanek@gmail.com wrote:
From: JFC Morfin jefsey@jefsey.com Thank you for this detailed explanation. How do you see the integration/impact of Wikidata on both projects?
My intuition is that the impact could be mutual:
- for YAGO and DBpedia, the impact would be immediate, because
Wikidata could essentially provide cleaner infobox data for these projects. Yet, we have to see how Wikidata will position itself to Freebase, which seems to pursue a similar goal: http://en.wikipedia.org/wiki/Freebase (If you have thoughts on distinguishing Wikidata from Freebase, we'd be happy to know)
I don't speak for Freebase, but I view Freebase, DBpedia, and YAGO as all occupying comparable positions relative to Wikipedia/Wikidata. They currently attempt to reverse engineer structured data out of Wikipedia infoboxes and if Wikidata is successful in providing the data source for the Wikipedia infoboxes, it'll eliminate a lot of troublesome, error-prone parsing code.
Some of the ways that Freebase is different include:
- it's editable by anyone, so you don't need to go back to Wikipedia to correct mistakes. - it doesn't have a notability requirement like Wikipedia. If it's factual and non-spammy, you can include it. - infobox mappings aren't public and can only be modified by Google employees - a relatively small number of popular infoboxes are mined (nowhere near DBpedia's coverage) - the refresh cycle is every couple of weeks (ie much faster than DBpedia but much slower than DBpedia live) - it includes a large amount of non-Wikipedia data from MusicBrainz, OpenLibrary, Geonames, etc, as well as being linked to a number of other sources of strong identifiers such as the New York Times, IMDB, NNDB, U.S. Library of Congress Name Authority File and Subject Headings, etc.
As far as positioning between Wikidata and Freebase goes, there's really no way that Freebase (or any other non-Wikimedia Foundation effort) could ever compete with Wikidata in the context of providing data to Wikipedia. The Wikipedia culture is just too insular. Instead I would expect Freebase to stop parsing infoboxes and consume data directly from Wikidata in the same way that I would expect DBpedia, YAGO and other consumers to.
Before that happens though, Wikidata not only needs to get the technical infrastructure in place, but also change the culture of Wikipedia editors so that they're not anti-data and care about the semantics as well as the presentation of the information. A lot of today's quality problems are social, not technical.
- YAGO, e.g., has mappings of infobox data to relations with domains
and ranges, with a quality guarantee.
Guarantee? My understanding of the previous post was that a very small sample of YAGO data had been measured for precision (with good results), not that there was 100% curation or any type of quality "guarantee."
Freebase has a stated 99% quality goal, but actual quality (as well as coverage) varies greatly from domain to domain.
Tom
On 4/16/12 2:54 PM, Tom Morris wrote:
- the refresh cycle is every couple of weeks (ie much faster than
DBpedia but much slower than DBpedia live)
Why do you make the comment above? Are you not aware of the DBpedia-Live editions have existed for a few years now?
Fundamentally, Freebase addresses some of what DBpedia covers and some of what Wikidata (a Data Wiki) covers. Of course, all of these -- plus YAGO -- are inherently mutually beneficial.
Links:
1. http://live.dbpedia.org/LiveStats/ -- DBpedia-Live hosted by University of Leipzig. 2. http://dbpedia-live.openlinksw.com/live/ -- DBpedia-Live edition we host .
On Mon, Apr 16, 2012 at 5:19 PM, Kingsley Idehen kidehen@openlinksw.com wrote:
On 4/16/12 2:54 PM, Tom Morris wrote:
- the refresh cycle is every couple of weeks (ie much faster than
DBpedia but much slower than DBpedia live)
Why do you make the comment above? Are you not aware of the DBpedia-Live editions have existed for a few years now?
I think my text that you quoted answers the question since I reference Live -- or do I get points off for incorrect capitalization/punctuation?
three months >> two weeks >> minutes DBpedia >> Freebase >> DBpedia-Live (phew! spelled it correctly this time)
By my calculations though, availability is actually 10 months, not "a few years." http://blog.aksw.org/2011/official-dbpedia-live-release/
On Tue, Apr 17, 2012 at 8:33 AM, Fabian M. Suchanek f.m.suchanek@gmail.com wrote:
I also wanted to ask again on the relationship between Freebase and Wikidata: Freebase was bootstrapped from the infoboxes of Wikipedia,
Wikipedia based data from infoboxes is updated on a regular basis. It wasn't just a one time bootstrap.
but I think its main selling point is that volunteers can add and correct data. Thus, my understanding is that, both in Wikidata and in Freebase, volunteers would fill up structured, factual information. Is that right?
I outlined most of the major differences that come to mind. I don't think there's any one particular "selling point" and, in particular, the Freebase team has never really attempted to do much in the way of "selling" at all. I don't really think that there's any overlap or competition between the two projects. If Wikidata is successful, Freebase rips out their infoboxes parsers and gets cleaner Wikipedia data to import with less effort.
My intuition is that Wikidata will have a more principled approach, because it can build on the Wikipedia/Wikimedia culture.
To the extent that the Wikidata project is unsuccessful in changing the current Wikipedia culture, they'll inherit both the good and bad points of the existing culture. Personally, I could do with a few less "deletionists" and petty tyrants ruling "their" corner of Wikipedia.
Tom
On 4/17/12 10:32 AM, Tom Morris wrote:
On Mon, Apr 16, 2012 at 5:19 PM, Kingsley Idehenkidehen@openlinksw.com wrote:
On 4/16/12 2:54 PM, Tom Morris wrote:
- the refresh cycle is every couple of weeks (ie much faster than
DBpedia but much slower than DBpedia live)
Why do you make the comment above? Are you not aware of the DBpedia-Live editions have existed for a few years now?
I think my text that you quoted answers the question since I reference Live -- or do I get points off for incorrect capitalization/punctuation?
three months>> two weeks>> minutes DBpedia>> Freebase>> DBpedia-Live (phew! spelled it correctly this time)
DBpedia-Live is instant.
By my calculations though, availability is actually 10 months, not "a few years." http://blog.aksw.org/2011/official-dbpedia-live-release/
Well you missed the memo re. the fact that we've actually had DBpedia-Live for much longer than 10 months [1].
Link:
1. http://lists.w3.org/Archives/Public/public-lod/2010Jun/0262.html -- random post from a Google search .
Kingsley
On Tue, Apr 17, 2012 at 8:33 AM, Fabian M. Suchanek f.m.suchanek@gmail.com wrote:
I also wanted to ask again on the relationship between Freebase and Wikidata: Freebase was bootstrapped from the infoboxes of Wikipedia,
Wikipedia based data from infoboxes is updated on a regular basis. It wasn't just a one time bootstrap.
but I think its main selling point is that volunteers can add and correct data. Thus, my understanding is that, both in Wikidata and in Freebase, volunteers would fill up structured, factual information. Is that right?
I outlined most of the major differences that come to mind. I don't think there's any one particular "selling point" and, in particular, the Freebase team has never really attempted to do much in the way of "selling" at all. I don't really think that there's any overlap or competition between the two projects. If Wikidata is successful, Freebase rips out their infoboxes parsers and gets cleaner Wikipedia data to import with less effort.
My intuition is that Wikidata will have a more principled approach, because it can build on the Wikipedia/Wikimedia culture.
To the extent that the Wikidata project is unsuccessful in changing the current Wikipedia culture, they'll inherit both the good and bad points of the existing culture. Personally, I could do with a few less "deletionists" and petty tyrants ruling "their" corner of Wikipedia.
Tom
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Shall we create a Wikidata vs {Freebase, DBpedia, YAGO} comparison table on meta (or enwiki)? There's a lot of valuable information in this thread (and I was not familiar with YAGO – thanks Fabian) but it's hardly readable. We would do a huge favor to the press and the non-technical community if we had a single place where the differences are documented. Currently, there's only one page about the relation between Wikidata and DBpedia. [1]
Lydia, any thoughts?
Dario
[1] http://meta.wikimedia.org/wiki/Wikidata/Notes/DBpedia_and_Wikidata
On Apr 17, 2012, at 8:08 AM, Kingsley Idehen wrote:
On 4/17/12 10:32 AM, Tom Morris wrote:
On Mon, Apr 16, 2012 at 5:19 PM, Kingsley Idehenkidehen@openlinksw.com wrote:
On 4/16/12 2:54 PM, Tom Morris wrote:
- the refresh cycle is every couple of weeks (ie much faster than
DBpedia but much slower than DBpedia live)
Why do you make the comment above? Are you not aware of the DBpedia-Live editions have existed for a few years now?
I think my text that you quoted answers the question since I reference Live -- or do I get points off for incorrect capitalization/punctuation?
three months>> two weeks>> minutes DBpedia>> Freebase>> DBpedia-Live (phew! spelled it correctly this time)
DBpedia-Live is instant.
By my calculations though, availability is actually 10 months, not "a few years." http://blog.aksw.org/2011/official-dbpedia-live-release/
Well you missed the memo re. the fact that we've actually had DBpedia-Live for much longer than 10 months [1].
Link:
- http://lists.w3.org/Archives/Public/public-lod/2010Jun/0262.html -- random post from a Google search .
Kingsley
On Tue, Apr 17, 2012 at 8:33 AM, Fabian M. Suchanek f.m.suchanek@gmail.com wrote:
I also wanted to ask again on the relationship between Freebase and Wikidata: Freebase was bootstrapped from the infoboxes of Wikipedia,
Wikipedia based data from infoboxes is updated on a regular basis. It wasn't just a one time bootstrap.
but I think its main selling point is that volunteers can add and correct data. Thus, my understanding is that, both in Wikidata and in Freebase, volunteers would fill up structured, factual information. Is that right?
I outlined most of the major differences that come to mind. I don't think there's any one particular "selling point" and, in particular, the Freebase team has never really attempted to do much in the way of "selling" at all. I don't really think that there's any overlap or competition between the two projects. If Wikidata is successful, Freebase rips out their infoboxes parsers and gets cleaner Wikipedia data to import with less effort.
My intuition is that Wikidata will have a more principled approach, because it can build on the Wikipedia/Wikimedia culture.
To the extent that the Wikidata project is unsuccessful in changing the current Wikipedia culture, they'll inherit both the good and bad points of the existing culture. Personally, I could do with a few less "deletionists" and petty tyrants ruling "their" corner of Wikipedia.
Tom
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
--
Regards,
Kingsley Idehen Founder& CEO OpenLink Software Company Web: http://www.openlinksw.com Personal Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca handle: @kidehen Google+ Profile: https://plus.google.com/112399767740508618350/about LinkedIn Profile: http://www.linkedin.com/in/kidehen
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
On Tue, Apr 17, 2012 at 5:32 PM, Dario Taraborelli dtaraborelli@wikimedia.org wrote:
Shall we create a Wikidata vs {Freebase, DBpedia, YAGO} comparison table on meta (or enwiki)? There's a lot of valuable information in this thread (and I was not familiar with YAGO – thanks Fabian) but it's hardly readable. We would do a huge favor to the press and the non-technical community if we had a single place where the differences are documented. Currently, there's only one page about the relation between Wikidata and DBpedia. [1]
Lydia, any thoughts?
Yes, definitely. I've started working on that but it's just a start and I need some more input from people before this is in a publishable state. If anyone wants to help please let me know.
Cheers Lydia
At 17:32 17/04/2012, Dario Taraborelli wrote:
Shall we create a Wikidata vs {Freebase, DBpedia, YAGO} comparison table on meta (or enwiki)? There's a lot of valuable information in this thread (and I was not familiar with YAGO thanks Fabian) but it's hardly readable. We would do a huge favor to the press and the non-technical community if we had a single place where the differences are documented. Currently, there's only one page about the relation between Wikidata and DBpedia. [1]
My main concern is the information loop. The more there are such systems mutually relying on the others how will they protect themselves against adequately entered hoaxes? jfc
On 4/17/12 11:32 AM, Dario Taraborelli wrote:
Shall we create a Wikidata vs {Freebase, DBpedia, YAGO} comparison table on meta (or enwiki)? There's a lot of valuable information in this thread (and I was not familiar with YAGO – thanks Fabian) but it's hardly readable. We would do a huge favor to the press and the non-technical community if we had a single place where the differences are documented. Currently, there's only one page about the relation between Wikidata and DBpedia. [1]
Lydia, any thoughts?
Not a "versus" style table. That sends the wrong signals when these services are fundamentally complimentary .
Kingsley
Dario
[1] http://meta.wikimedia.org/wiki/Wikidata/Notes/DBpedia_and_Wikidata
On Apr 17, 2012, at 8:08 AM, Kingsley Idehen wrote:
On 4/17/12 10:32 AM, Tom Morris wrote:
On Mon, Apr 16, 2012 at 5:19 PM, Kingsley Idehenkidehen@openlinksw.com wrote:
On 4/16/12 2:54 PM, Tom Morris wrote:
- the refresh cycle is every couple of weeks (ie much faster than
DBpedia but much slower than DBpedia live)
Why do you make the comment above? Are you not aware of the DBpedia-Live editions have existed for a few years now?
I think my text that you quoted answers the question since I reference Live -- or do I get points off for incorrect capitalization/punctuation?
three months>> two weeks>> minutes DBpedia>> Freebase>> DBpedia-Live (phew! spelled it correctly this time)
DBpedia-Live is instant.
By my calculations though, availability is actually 10 months, not "a few years." http://blog.aksw.org/2011/official-dbpedia-live-release/
Well you missed the memo re. the fact that we've actually had DBpedia-Live for much longer than 10 months [1].
Link:
- http://lists.w3.org/Archives/Public/public-lod/2010Jun/0262.html -- random post from a Google search .
Kingsley
On Tue, Apr 17, 2012 at 8:33 AM, Fabian M. Suchanek f.m.suchanek@gmail.com wrote:
I also wanted to ask again on the relationship between Freebase and Wikidata: Freebase was bootstrapped from the infoboxes of Wikipedia,
Wikipedia based data from infoboxes is updated on a regular basis. It wasn't just a one time bootstrap.
but I think its main selling point is that volunteers can add and correct data. Thus, my understanding is that, both in Wikidata and in Freebase, volunteers would fill up structured, factual information. Is that right?
I outlined most of the major differences that come to mind. I don't think there's any one particular "selling point" and, in particular, the Freebase team has never really attempted to do much in the way of "selling" at all. I don't really think that there's any overlap or competition between the two projects. If Wikidata is successful, Freebase rips out their infoboxes parsers and gets cleaner Wikipedia data to import with less effort.
My intuition is that Wikidata will have a more principled approach, because it can build on the Wikipedia/Wikimedia culture.
To the extent that the Wikidata project is unsuccessful in changing the current Wikipedia culture, they'll inherit both the good and bad points of the existing culture. Personally, I could do with a few less "deletionists" and petty tyrants ruling "their" corner of Wikipedia.
Tom
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
--
Regards,
Kingsley Idehen Founder& CEO OpenLink Software Company Web: http://www.openlinksw.com Personal Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca handle: @kidehen Google+ Profile: https://plus.google.com/112399767740508618350/about LinkedIn Profile: http://www.linkedin.com/in/kidehen
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
On Tue, Apr 17, 2012 at 10:23 PM, Kingsley Idehen kidehen@openlinksw.com wrote:
On 4/17/12 11:32 AM, Dario Taraborelli wrote:
Shall we create a Wikidata vs {Freebase, DBpedia, YAGO} comparison table on meta (or enwiki)? There's a lot of valuable information in this thread (and I was not familiar with YAGO – thanks Fabian) but it's hardly readable. We would do a huge favor to the press and the non-technical community if we had a single place where the differences are documented. Currently, there's only one page about the relation between Wikidata and DBpedia. [1]
Lydia, any thoughts?
Not a "versus" style table. That sends the wrong signals when these services are fundamentally complimentary .
Yes. I'd prefer a short text - something we can add to the existing FAQ. (That's also what I have so far.)
Cheers Lydia
On 4/17/12 4:27 PM, Lydia Pintscher wrote:
On Tue, Apr 17, 2012 at 10:23 PM, Kingsley Idehen kidehen@openlinksw.com wrote:
On 4/17/12 11:32 AM, Dario Taraborelli wrote:
Shall we create a Wikidata vs {Freebase, DBpedia, YAGO} comparison table on meta (or enwiki)? There's a lot of valuable information in this thread (and I was not familiar with YAGO – thanks Fabian) but it's hardly readable. We would do a huge favor to the press and the non-technical community if we had a single place where the differences are documented. Currently, there's only one page about the relation between Wikidata and DBpedia. [1]
Lydia, any thoughts?
Not a "versus" style table. That sends the wrong signals when these services are fundamentally complimentary .
Yes. I'd prefer a short text - something we can add to the existing FAQ. (That's also what I have so far.)
Cheers Lydia
How about fleshing this out?
[WikiData] -- (ProducingbetterRefStructuredDataFor) --> [Wikipedia] [Rest of the Web] .
[Freebase] -- (consumingContentFrom) --> [Wikipedia].
[DBpedia] -- (consumingContentFrom) --> [Wikipedia].
[DBpedia] -- (crossReferences) --> [Freebase] .
[Freebase] -- (crossReferences) --> [DBpedia] .
[DBpedia] -- (crossReferences) --> [YAGO] .
[YAGO] -- (crossReferences) --> [DBpedia] .
[WikiData] -- (leverages) --> [All of the Above] .
++
Hi,
+1 for a wiki/FAQ describing the Wikipedia ecosystem and the relationships between Wikidata and the other semantic projects around Wikipedia. This is a legitimate question for the general public and the press. And it's also probably useful for many persons in this field, as illustrated in this thread.
We could start with a general overview of Wikipedia and its "semantic" ecosystem, as proposed by Kingsley. Then we could have a summary of each project, with a highlight of its key difference with Wikipedia, Wikidata and other projects. And we could finish with a table summarizing the goal and difference of each project, with links to project pages
Or is it too verbose? (in which case we might just want the introduction and the final table...)
-Nicolas.
On Apr 17, 2012, at 1:51 PM, Kingsley Idehen wrote:
On 4/17/12 4:27 PM, Lydia Pintscher wrote:
On Tue, Apr 17, 2012 at 10:23 PM, Kingsley Idehen kidehen@openlinksw.com wrote:
On 4/17/12 11:32 AM, Dario Taraborelli wrote:
Shall we create a Wikidata vs {Freebase, DBpedia, YAGO} comparison table on meta (or enwiki)? There's a lot of valuable information in this thread (and I was not familiar with YAGO – thanks Fabian) but it's hardly readable. We would do a huge favor to the press and the non-technical community if we had a single place where the differences are documented. Currently, there's only one page about the relation between Wikidata and DBpedia. [1]
Lydia, any thoughts?
Not a "versus" style table. That sends the wrong signals when these services are fundamentally complimentary .
Yes. I'd prefer a short text - something we can add to the existing FAQ. (That's also what I have so far.)
Cheers Lydia
How about fleshing this out?
[WikiData] -- (ProducingbetterRefStructuredDataFor) --> [Wikipedia] [Rest of the Web] .
[Freebase] -- (consumingContentFrom) --> [Wikipedia].
[DBpedia] -- (consumingContentFrom) --> [Wikipedia].
[DBpedia] -- (crossReferences) --> [Freebase] .
[Freebase] -- (crossReferences) --> [DBpedia] .
[DBpedia] -- (crossReferences) --> [YAGO] .
[YAGO] -- (crossReferences) --> [DBpedia] .
[WikiData] -- (leverages) --> [All of the Above] .
++
--
Regards,
Kingsley Idehen Founder& CEO OpenLink Software Company Web: http://www.openlinksw.com Personal Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca handle: @kidehen Google+ Profile: https://plus.google.com/112399767740508618350/about LinkedIn Profile: http://www.linkedin.com/in/kidehen
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
+1 for Nicolas Torzec proposition.
Isabelle Ayel Tel/Fax: +34 976 088 566 Email: isabelle.ayel@gmail.com My blogs: isayel.net | pouce-café http://zfr.posterous.com | zaragoza día día http://zaragozadiadia.posterous.com Networking with: http://www.facebook.com/isayel?ref=name Facebookhttp://www.facebook.com/isayel?ref=name http://twitter.com/nanouk Twitter http://twitter.com/nanouk https://plus.google.com/111878932945338226153 Google Plushttps://plus.google.com/111878932945338226153 http://es.linkedin.com/in/isabelleayel LinkedInhttp://es.linkedin.com/in/isabelleayel Contact me: [image: Google Talk] isabelle.ayel [image: Skype] aspmaetb [image: Twitter] http://twitter.com/nanouk Latest tweet: How We Pay Taxes: 11 Charts (USA) http://t.co/0bz4RmbF (la même analyse en #France #elections? Follow @nanouk http://twitter.com/nanouk Reply http://twitter.com/?status=@nanouk%20&in_reply_to_status_id=192547379755499520&in_reply_to=nanouk Retweet http://twitter.com/?status=RT%20%40nanouk%3A%20How%20We%20Pay%20Taxes%3A%2011%20Charts%20%20(USA)%20http%3A%2F%2Ft.co%2F0bz4RmbF%20(la%20m%C3%AAme%20analyse%20en%20%23France%20%23elections%3F 11:37 Apr-18http://twitter.com/nanouk/statuses/192547379755499520 Get this email app! http://www.wisestamp.com/apps/twitter?utm_source=extension&utm_medium=email&utm_term=twitter&utm_campaign=apps
On Wed, Apr 18, 2012 at 09:40, Nicolas Torzec nicolas.torzec@gmail.comwrote:
Hi,
+1 for a wiki/FAQ describing the Wikipedia ecosystem and the relationships between Wikidata and the other semantic projects around Wikipedia. This is a legitimate question for the general public and the press. And it's also probably useful for many persons in this field, as illustrated in this thread.
We could start with a general overview of Wikipedia and its "semantic" ecosystem, as proposed by Kingsley. Then we could have a summary of each project, with a highlight of its key difference with Wikipedia, Wikidata and other projects. And we could finish with a table summarizing the goal and difference of each project, with links to project pages
Or is it too verbose? (in which case we might just want the introduction and the final table...)
-Nicolas.
On Apr 17, 2012, at 1:51 PM, Kingsley Idehen wrote:
On 4/17/12 4:27 PM, Lydia Pintscher wrote:
On Tue, Apr 17, 2012 at 10:23 PM, Kingsley Idehen kidehen@openlinksw.com wrote:
On 4/17/12 11:32 AM, Dario Taraborelli wrote:
Shall we create a Wikidata vs {Freebase, DBpedia, YAGO} comparison
table
on meta (or enwiki)? There's a lot of valuable information in this
thread
(and I was not familiar with YAGO – thanks Fabian) but it's hardly
readable.
We would do a huge favor to the press and the non-technical community
if we
had a single place where the differences are documented. Currently,
there's
only one page about the relation between Wikidata and DBpedia. [1]
Lydia, any thoughts?
Not a "versus" style table. That sends the wrong signals when these
services
are fundamentally complimentary .
Yes. I'd prefer a short text - something we can add to the existing FAQ. (That's also what I have so far.)
Cheers Lydia
How about fleshing this out?
[WikiData] -- (ProducingbetterRefStructuredDataFor) --> [Wikipedia]
[Rest of the Web] .
[Freebase] -- (consumingContentFrom) --> [Wikipedia].
[DBpedia] -- (consumingContentFrom) --> [Wikipedia].
[DBpedia] -- (crossReferences) --> [Freebase] .
[Freebase] -- (crossReferences) --> [DBpedia] .
[DBpedia] -- (crossReferences) --> [YAGO] .
[YAGO] -- (crossReferences) --> [DBpedia] .
[WikiData] -- (leverages) --> [All of the Above] .
++
--
Regards,
Kingsley Idehen Founder& CEO OpenLink Software Company Web: http://www.openlinksw.com Personal Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca handle: @kidehen Google+ Profile: https://plus.google.com/112399767740508618350/about LinkedIn Profile: http://www.linkedin.com/in/kidehen
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
On 4/18/12 3:40 AM, Nicolas Torzec wrote:
Hi,
+1 for a wiki/FAQ describing the Wikipedia ecosystem and the relationships between Wikidata and the other semantic projects around Wikipedia. This is a legitimate question for the general public and the press. And it's also probably useful for many persons in this field, as illustrated in this thread.
We could start with a general overview of Wikipedia and its "semantic" ecosystem, as proposed by Kingsley. Then we could have a summary of each project, with a highlight of its key difference with Wikipedia, Wikidata and other projects. And we could finish with a table summarizing the goal and difference of each project, with links to project pages
Or is it too verbose? (in which case we might just want the introduction and the final table...)
-Nicolas.
So we have this rough draft categorization:
Content Consumers & Structured Data Transformers:
1. Freebase 2. DBpedia .
Data Wikis:
1. Freebase 2. Wikidata
*DBpedia isn't a Data Wiki because edits occur via the Wikipedia content Wiki*
Data Wiki Platforms:
1. OntoWiki 2. Semantic MediaWiki.
Content Wikis:
1. Wikipedia
Content Wiki Platforms:
1. MediaWiki 2. Other Wiki platforms .
Kingsley
On Apr 17, 2012, at 1:51 PM, Kingsley Idehen wrote:
On 4/17/12 4:27 PM, Lydia Pintscher wrote:
On Tue, Apr 17, 2012 at 10:23 PM, Kingsley Idehen kidehen@openlinksw.com wrote:
On 4/17/12 11:32 AM, Dario Taraborelli wrote:
Shall we create a Wikidata vs {Freebase, DBpedia, YAGO} comparison table on meta (or enwiki)? There's a lot of valuable information in this thread (and I was not familiar with YAGO – thanks Fabian) but it's hardly readable. We would do a huge favor to the press and the non-technical community if we had a single place where the differences are documented. Currently, there's only one page about the relation between Wikidata and DBpedia. [1]
Lydia, any thoughts?
Not a "versus" style table. That sends the wrong signals when these services are fundamentally complimentary .
Yes. I'd prefer a short text - something we can add to the existing FAQ. (That's also what I have so far.)
Cheers Lydia
How about fleshing this out?
[WikiData] -- (ProducingbetterRefStructuredDataFor) --> [Wikipedia] [Rest of the Web] .
[Freebase] -- (consumingContentFrom) --> [Wikipedia].
[DBpedia] -- (consumingContentFrom) --> [Wikipedia].
[DBpedia] -- (crossReferences) --> [Freebase] .
[Freebase] -- (crossReferences) --> [DBpedia] .
[DBpedia] -- (crossReferences) --> [YAGO] .
[YAGO] -- (crossReferences) --> [DBpedia] .
[WikiData] -- (leverages) --> [All of the Above] .
++
--
Regards,
Kingsley Idehen Founder& CEO OpenLink Software Company Web: http://www.openlinksw.com Personal Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca handle: @kidehen Google+ Profile: https://plus.google.com/112399767740508618350/about LinkedIn Profile: http://www.linkedin.com/in/kidehen
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
On Apr 18, 2012, at 09:18 AM, Kingsley Idehen wrote:
On 4/18/12 3:40 AM, Nicolas Torzec wrote:
Hi,
+1 for a wiki/FAQ describing the Wikipedia ecosystem and the relationships between Wikidata and the other semantic projects around Wikipedia. This is a legitimate question for the general public and the press. And it's also probably useful for many persons in this field, as illustrated in this thread.
We could start with a general overview of Wikipedia and its "semantic" ecosystem, as proposed by Kingsley. Then we could have a summary of each project, with a highlight of its key difference with Wikipedia, Wikidata and other projects. And we could finish with a table summarizing the goal and difference of each project, with links to project pages
Or is it too verbose? (in which case we might just want the introduction and the final table...)
-Nicolas.
So we have this rough draft categorization:
Content Consumers & Structured Data Transformers:
- Freebase
- DBpedia .
Data Wikis:
- Freebase
- Wikidata
*DBpedia isn't a Data Wiki because edits occur via the Wikipedia content Wiki*
Data Wiki Platforms:
- OntoWiki
- Semantic MediaWiki.
Content Wikis:
- Wikipedia
Content Wiki Platforms:
- MediaWiki
- Other Wiki platforms .
One element I see missing from (or at least, unclear in) the discussion thus far, is how corrections or other changes are made to the data therein, and how changes made in one set are fed (back) to the others. In other words --
1. When an error is discovered in data on service "x", where are edits made to correct that data?
2. If edits are made locally (on service "x") to content which originated on another service ("y"), do those edits also get applied to the original source, and if so, how (e.g., automatically by service "x"; manually by the user; manually by service "x" admin team; etc.)?
I think information like this is needed throughout, and will help a lot in demonstrating the complementary nature of these various services.
DBpedia content, for instance, is not edited directly, but gets all its changes by digesting edits made to Wikipedia.
Freebase also digests changes made to Wikipedia (but it's not clear to me exactly how these are then acted on), but is also edited directly -- and I don't see a mechanism that routes such direct Freebase edits back to Wikipedia (or elsewhere).
Regards,
Ted
-- A: Yes. http://www.guckes.net/faq/attribution.html | Q: Are you sure? | | A: Because it reverses the logical flow of conversation. | | | Q: Why is top posting frowned upon?
Ted Thibodeau, Jr. // voice +1-781-273-0900 x32 Senior Support & Evangelism // mailto:tthibodeau@openlinksw.com // http://twitter.com/TallTed OpenLink Software, Inc. // http://www.openlinksw.com/ 10 Burlington Mall Road, Suite 265, Burlington MA 01803 Weblog -- http://www.openlinksw.com/blogs/ LinkedIn -- http://www.linkedin.com/company/openlink-software/ Twitter -- http://twitter.com/OpenLink Google+ -- http://plus.google.com/100570109519069333827/ Facebook -- http://www.facebook.com/OpenLinkSoftware Universal Data Access, Integration, and Management Technology Providers
On Wed, Apr 18, 2012 at 3:50 PM, Ted Thibodeau Jr tthibodeau@openlinksw.com wrote:
One element I see missing from (or at least, unclear in) the discussion thus far, is how corrections or other changes are made to the data therein, and how changes made in one set are fed (back) to the others. In other words --
- When an error is discovered in data on service "x", where
are edits made to correct that data?
- If edits are made locally (on service "x") to content which
originated on another service ("y"), do those edits also get applied to the original source, and if so, how (e.g., automatically by service "x"; manually by the user; manually by service "x" admin team; etc.)?
I think information like this is needed throughout, and will help a lot in demonstrating the complementary nature of these various services.
DBpedia content, for instance, is not edited directly, but gets all its changes by digesting edits made to Wikipedia.
Freebase also digests changes made to Wikipedia (but it's not clear to me exactly how these are then acted on), but is also edited directly -- and I don't see a mechanism that routes such direct Freebase edits back to Wikipedia (or elsewhere).
Regards,
Ted
Editing will be possible in Wikidata directly. These changes will be visible in whatever gets its data from Wikidata, like Wikipedia. The rest is a bit outside the scope of this list.
Cheers Lydia
On 4/18/12 9:50 AM, Ted Thibodeau Jr wrote:
On Apr 18, 2012, at 09:18 AM, Kingsley Idehen wrote:
On 4/18/12 3:40 AM, Nicolas Torzec wrote:
Hi,
+1 for a wiki/FAQ describing the Wikipedia ecosystem and the relationships between Wikidata and the other semantic projects around Wikipedia. This is a legitimate question for the general public and the press. And it's also probably useful for many persons in this field, as illustrated in this thread.
We could start with a general overview of Wikipedia and its "semantic" ecosystem, as proposed by Kingsley. Then we could have a summary of each project, with a highlight of its key difference with Wikipedia, Wikidata and other projects. And we could finish with a table summarizing the goal and difference of each project, with links to project pages
Or is it too verbose? (in which case we might just want the introduction and the final table...)
-Nicolas.
So we have this rough draft categorization:
Content Consumers& Structured Data Transformers:
- Freebase
- DBpedia .
Data Wikis:
- Freebase
- Wikidata
*DBpedia isn't a Data Wiki because edits occur via the Wikipedia content Wiki*
Data Wiki Platforms:
- OntoWiki
- Semantic MediaWiki.
Content Wikis:
- Wikipedia
Content Wiki Platforms:
- MediaWiki
- Other Wiki platforms .
One element I see missing from (or at least, unclear in) the discussion thus far, is how corrections or other changes are made to the data therein, and how changes made in one set are fed (back) to the others. In other words --
When an error is discovered in data on service "x", where are edits made to correct that data?
If edits are made locally (on service "x") to content which originated on another service ("y"), do those edits also get applied to the original source, and if so, how (e.g., automatically by service "x"; manually by the user; manually by service "x" admin team; etc.)?
I think information like this is needed throughout, and will help a lot in demonstrating the complementary nature of these various services.
DBpedia content, for instance, is not edited directly, but gets all its changes by digesting edits made to Wikipedia.
Freebase also digests changes made to Wikipedia (but it's not clear to me exactly how these are then acted on), but is also edited directly -- and I don't see a mechanism that routes such direct Freebase edits back to Wikipedia (or elsewhere).
Regards,
Ted
Provenance data is one functionality realm I expect Wikidata to accentuate. Basically, this is has to be a critical component of any Data Wiki oriented initiative.
Kingsley
-- A: Yes. http://www.guckes.net/faq/attribution.html | Q: Are you sure? | | A: Because it reverses the logical flow of conversation. | | | Q: Why is top posting frowned upon?
Ted Thibodeau, Jr. // voice +1-781-273-0900 x32 Senior Support& Evangelism // mailto:tthibodeau@openlinksw.com // http://twitter.com/TallTed OpenLink Software, Inc. // http://www.openlinksw.com/ 10 Burlington Mall Road, Suite 265, Burlington MA 01803 Weblog -- http://www.openlinksw.com/blogs/ LinkedIn -- http://www.linkedin.com/company/openlink-software/ Twitter -- http://twitter.com/OpenLink Google+ -- http://plus.google.com/100570109519069333827/ Facebook -- http://www.facebook.com/OpenLinkSoftware Universal Data Access, Integration, and Management Technology Providers
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
On Tue, Apr 17, 2012 at 4:27 PM, Lydia Pintscher lydia.pintscher@wikimedia.de wrote:
On Tue, Apr 17, 2012 at 10:23 PM, Kingsley Idehen kidehen@openlinksw.com wrote:
On 4/17/12 11:32 AM, Dario Taraborelli wrote:
Shall we create a Wikidata vs {Freebase, DBpedia, YAGO} comparison table on meta (or enwiki)?
Not a "versus" style table. That sends the wrong signals when these services are fundamentally complimentary .
Yes. I'd prefer a short text - something we can add to the existing FAQ. (That's also what I have so far.)
I actually like tables myself and don't see them as being adversarial, but I'm happy to contribute to the comparison in whatever form it takes.
Having a matrix/table: a) focuses people on choosing a few important dimensions to summarize and b) makes any holes in the comparison matrix obvious so they can be filled in (which is much harder when parsing text).
Tom
On 4/18/12 10:25 AM, Tom Morris wrote:
On Tue, Apr 17, 2012 at 4:27 PM, Lydia Pintscher lydia.pintscher@wikimedia.de wrote:
On Tue, Apr 17, 2012 at 10:23 PM, Kingsley Idehen kidehen@openlinksw.com wrote:
On 4/17/12 11:32 AM, Dario Taraborelli wrote:
Shall we create a Wikidata vs {Freebase, DBpedia, YAGO} comparison table on meta (or enwiki)?
Not a "versus" style table. That sends the wrong signals when these services are fundamentally complimentary .
Yes. I'd prefer a short text - something we can add to the existing FAQ. (That's also what I have so far.)
I actually like tables myself and don't see them as being adversarial, but I'm happy to contribute to the comparison in whatever form it takes.
Having a matrix/table: a) focuses people on choosing a few important dimensions to summarize and b) makes any holes in the comparison matrix obvious so they can be filled in (which is much harder when parsing text).
Tom
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Tom,
To clarify the context of my comment.
I meant "versus" is adversarial. I have nothing against tables and the coherence of tabular style presentation.