Re: [Wikidata] SPARQL CONSTRUCT results truncated

11 Feb 2016

Hi Joachim,

Here is a short program that solves your problem:

https://github.com/Wikidata/Wikidata-Toolkit-Examples/blob/master/src/examp…

It is in Java, so, you need that (and Maven) to run it, but that's the 
only technical challenge ;-). You can run the program in various ways as 
described in the README:

https://github.com/Wikidata/Wikidata-Toolkit-Examples

The program I wrote puts everything into a CSV file, but you can of 
course also write RDF triples if you prefer this, or any other format 
you wish. The code should be easy to modify.

On a first run, the tool will download the current Wikidata dump, which 
takes a while (it's about 6G), but after this you can find and serialise 
all results in less than half an hour (for a processing rate of around 
10K items/second). A regular laptop is enough to run it.

Cheers,

Markus

On 11.02.2016 01:34, Stas Malyshev wrote:
...
  Hi!

  I try to extract all mappings from wikidata to
the GND authority file,
 along with the according wikipedia pages, expecting roughly 500,000 to
 1m triples as result. 
 As a starting note, I don't think extracting 1M triples may be the best
 way to use query service. If you need to do processing that returns such
 big result sets - in millions - maybe processing the dump - e.g. with
 wikidata toolkit at https://github.com/Wikidata/Wikidata-Toolkit - would
 be better idea?

  However, with various calls, I get much less
triples (about 2,000 to
 10,000). The output seems to be truncated in the middle of a statement, e.g. 
 It may be some kind of timeout because of the quantity of the data being
 sent. How long does such request take?

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

Re: [Wikidata] SPARQL CONSTRUCT results truncated