On Wed, Aug 5, 2015 at 8:32 PM, Vladimir Alexiev <vladimir.alexiev@ontotext.com> wrote:
Hi folks!

We need a full mapping of WD item -> enwiki sitelinks.

1. We extracted from Dbpedia 2015-04 all statements of the form
  <http://dbpedia.org/resource/Northern_Ireland>
<http://www.w3.org/2002/07/owl#sameAs> <http://wikidata.org/entity/Q26>
And the count is 5882410

2. Checked with WDQ:
https://wdq.wmflabs.org/api?q=link[enwiki]&noitems=1
"items":6263098
6.08% are missing from DBpedia. That's a lot

Hi Vladimir,

The reason might be that the DBpedia dumps are based on Wikidata dumps from March
If you care to give it a try you can try running the extraction framework with "wikidata" and use only the "WikidataSameAsExtractor" extractor.

Cheers,
Dimitris
 
How to get them from Wikidata?

3. WDQ doesn't seem to return sitelinks.
https://wdq.wmflabs.org/api?q=link[enwiki]&props=enwiki returns just item
numbers

4. The SPARQL endpoint doesn't seem to have them:
http://wdqs-beta.wmflabs.org/

prefix schema: <http://schema.org/>
select * {?x schema:about ?y}

returns nothing.
https://www.mediawiki.org/wiki/Wikibase/Indexing/RDF_Dump_Format#WDQS_data_d
ifferences
says "5. Depending on the instance of the service, multi-language labels and
sitelinks may or may not be supported."
I think this service doesn't have sitelinks: is there one that has them?



_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata



--
Kontokostas Dimitris