Hi list
Where can one find the mapping between Wikidata and Wikipedia. I can see the links online in both Wikidata and Wikipedia, but I cannot find the links in the RDF-dump f.ex. of Wikidata, nor in the XML-dump of Wikipedia.
With regards, Rune Stilling
Hello Rune,
They are encoded in the RDF dumps of Wikidata using the http://schema.org/about property e.g.:
<https://en.wikipedia.org/wiki/Douglas_Adams https://en.wikipedia.org/wiki/Douglas_Adams> schema:about < http://www.wikidata.org/entity/Q42%3E
Best,
Thomas
Le mer. 22 nov. 2017 à 16:21, Rune Stilling subs@rdfined.dk a écrit :
Hi list
Where can one find the mapping between Wikidata and Wikipedia. I can see the links online in both Wikidata and Wikipedia, but I cannot find the links in the RDF-dump f.ex. of Wikidata, nor in the XML-dump of Wikipedia.
With regards, Rune Stilling
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Hi Thomas
Thank you - But I simply cannot find this data in the latest-truths.nt-file.
F.ex. the following grep command returns nothing:
grep ^<https://en%5C.wikipedia%5C.org latest-truthy-BETA.nt
Also when I query my local triplestore installed with the RDF-data I cannot find this relation.
Best, Rune
Den 22. nov. 2017 kl. 16.40 skrev Thomas Pellissier Tanon thomas@pellissier-tanon.fr:
Hello Rune,
They are encoded in the RDF dumps of Wikidata using the http://schema.org/about http://schema.org/about property e.g.:
<https://en.wikipedia.org/wiki/Douglas_Adams https://en.wikipedia.org/wiki/Douglas_Adams> schema:about <http://www.wikidata.org/entity/Q42 http://www.wikidata.org/entity/Q42>
Best,
Thomas
Le mer. 22 nov. 2017 à 16:21, Rune Stilling <subs@rdfined.dk mailto:subs@rdfined.dk> a écrit : Hi list
Where can one find the mapping between Wikidata and Wikipedia. I can see the links online in both Wikidata and Wikipedia, but I cannot find the links in the RDF-dump f.ex. of Wikidata, nor in the XML-dump of Wikipedia.
With regards, Rune Stilling
Wikidata mailing list Wikidata@lists.wikimedia.org mailto:Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata https://lists.wikimedia.org/mailman/listinfo/wikidata
Hi Thomas
Thank you - But I simply cannot find this data in the latest-truths.nt-file.
F.ex. the following grep command returns nothing:
grep ^<https://en%5C.wikipedia%5C.org https://en/.wikipedia/.org latest-truthy-BETA.nt
Also when I query my local triplestore installed with the RDF-data I cannot find this relation.
Best, Rune
Den 22. nov. 2017 kl. 16.40 skrev Thomas Pellissier Tanon <thomas@pellissier-tanon.fr mailto:thomas@pellissier-tanon.fr>:
Hello Rune,
They are encoded in the RDF dumps of Wikidata using the http://schema.org/about http://schema.org/about property e.g.:
<https://en.wikipedia.org/wiki/Douglas_Adams https://en.wikipedia.org/wiki/Douglas_Adams> schema:about <http://www.wikidata.org/entity/Q42 http://www.wikidata.org/entity/Q42>
Best,
Thomas
Le mer. 22 nov. 2017 à 16:21, Rune Stilling <subs@rdfined.dk mailto:subs@rdfined.dk> a écrit : Hi list
Where can one find the mapping between Wikidata and Wikipedia. I can see the links online in both Wikidata and Wikipedia, but I cannot find the links in the RDF-dump f.ex. of Wikidata, nor in the XML-dump of Wikipedia.
With regards, Rune Stilling
Wikidata mailing list Wikidata@lists.wikimedia.org mailto:Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata https://lists.wikimedia.org/mailman/listinfo/wikidata
I neither can find the triple. I have searched with
$ pv wikidata-20170613-truthy-BETA.nt.bz2 | bzcat | grep "https://en.wikipedia.org/wiki/Wikidata" 5.12GiB 0:36:11 [2.41MiB/s] [====================================================================================================>] 100%
and nothing comes up.
The truthy dumps seem not have have that particular type of triple, while it is available in the full triple format, e.g., at https://www.wikidata.org/entity/Q2013.nt
I think you need to download the big one: https://dumps.wikimedia.org/wikidatawiki/entities/latest-all.ttl.gz
The "all" one has the "schema:about" triple. For instance:
https://en.wikipedia.org/wiki/Northern_Ireland a schema:Article ; schema:about wd:Q26 ; schema:inLanguage "en" ; schema:isPartOf https://en.wikipedia.org/ ; schema:name "Northern Ireland"@en .
--- Finn Årup Nielsen http://people.compute.dtu.dk/faan/
On 11/23/2017 08:46 AM, Rune Stilling wrote:
Hi Thomas
Thank you - But I simply cannot find this data in the latest-truths.nt-file.
F.ex. the following grep command returns nothing:
grep ^<https://en%5C.wikipedia%5C.org https://en%5C.wikipedia%5C.org latest-truthy-BETA.nt
Also when I query my local triplestore installed with the RDF-data I cannot find this relation.
Best, Rune
Den 22. nov. 2017 kl. 16.40 skrev Thomas Pellissier Tanon <thomas@pellissier-tanon.fr mailto:thomas@pellissier-tanon.fr>:
Hello Rune,
They are encoded in the RDF dumps of Wikidata using the http://schema.org/about property e.g.:
<https://en.wikipedia.org/wiki/Douglas_Adams https://en.wikipedia.org/wiki/Douglas_Adams> schema:about http://www.wikidata.org/entity/Q42
Best,
Thomas
Le mer. 22 nov. 2017 à 16:21, Rune Stilling <subs@rdfined.dk mailto:subs@rdfined.dk> a écrit :
Hi list Where can one find the mapping between Wikidata and Wikipedia. I can see the links online in both Wikidata and Wikipedia, but I cannot find the links in the RDF-dump f.ex. of Wikidata, nor in the XML-dump of Wikipedia. With regards, Rune Stilling _______________________________________________ Wikidata mailing list Wikidata@lists.wikimedia.org <mailto:Wikidata@lists.wikimedia.org> https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Hi Thomas
I have found the links in the ttl-file.
Another question. What happens if a Wikipedia title/url changes. How do you handle this? Do you update Wikidata manually (the link from Wikidata to Wikipedia)?
Best, Rune
Den 23. nov. 2017 kl. 08.46 skrev Rune Stilling subs@rdfined.dk:
Hi Thomas
Thank you - But I simply cannot find this data in the latest-truths.nt-file.
F.ex. the following grep command returns nothing:
grep ^<https://en%5C.wikipedia%5C.org https://en/.wikipedia/.org latest-truthy-BETA.nt
Also when I query my local triplestore installed with the RDF-data I cannot find this relation.
Best, Rune
Den 22. nov. 2017 kl. 16.40 skrev Thomas Pellissier Tanon <thomas@pellissier-tanon.fr mailto:thomas@pellissier-tanon.fr>:
Hello Rune,
They are encoded in the RDF dumps of Wikidata using the http://schema.org/about http://schema.org/about property e.g.:
<https://en.wikipedia.org/wiki/Douglas_Adams https://en.wikipedia.org/wiki/Douglas_Adams> schema:about <http://www.wikidata.org/entity/Q42 http://www.wikidata.org/entity/Q42>
Best,
Thomas
Le mer. 22 nov. 2017 à 16:21, Rune Stilling <subs@rdfined.dk mailto:subs@rdfined.dk> a écrit : Hi list
Where can one find the mapping between Wikidata and Wikipedia. I can see the links online in both Wikidata and Wikipedia, but I cannot find the links in the RDF-dump f.ex. of Wikidata, nor in the XML-dump of Wikipedia.
With regards, Rune Stilling
Wikidata mailing list Wikidata@lists.wikimedia.org mailto:Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata https://lists.wikimedia.org/mailman/listinfo/wikidata
Rune Stilling, 11/12/2017 17:55:
Another question. What happens if a Wikipedia title/url changes. How do you handle this? Do you update Wikidata manually (the link from Wikidata to Wikipedia)?
If the page is renamed, the same action also updates the linked Wikidata entity, so there is no additional work for editors most of the time.
I don't know what you're supposed to do as a data consumer: if you produce links and want to ensure they keep working, then I guess you should convert to page IDs and use a link of the form https://en.wikipedia.org/?curid=8091
Federico