Hi Wikidata community,
Somebody pointed me to the following issue: https://phabricator.wikimedia.org/T179681 Unfortunately I'm not able to log in there with the "Phabricator" so I cannot edit the issue directly. I'm sending this email instead.
The issue seems to be stalled because it is not possible to create HDT files that contain more than 2B triples. However, this is possible in a specific 64 bit branch, which is how I created the downloadable version I've sent a few days ago. As indicated, I can create these files for the community if there is a use case.
--- Cheers, Wouter.
Email: wouter@triply.cc WWW: http://triply.cc Tel: +31647674624
On Tue, Dec 12, 2017 at 11:24 AM, Wouter Beek wouter@triply.cc wrote:
Hi list,
I'm sorry, I was under the impression that I had already shared this resource with you earlier, but I haven't...
On 7 Nov I created an HDT file based on the then current download link from https://dumps.wikimedia.org/wikidatawiki/entities/latest-all.ttl.gz
You can download this HDT file and it's index from the following locations:
- http://lod-a-lot.lod.labs.vu.nl/data/wikidata.hdt (~45GB)
- http://lod-a-lot.lod.labs.vu.nl/data/wikidata.hdt.index.v1-1 (~28GB)
You may need to compile with 64bit support, because there are more than 2B triples (https://github.com/rdfhdt/hdt-cpp/tree/develop-64). (To be exact, there are 4,579,973,187 triples in this file.)
PS: If this resource turns out to be useful to the community we can offer an updated HDT file at a to be determined interval.
Cheers, Wouter Beek.
Email: wouter@triply.cc WWW: http://triply.cc Tel: +31647674624
On Tue, Nov 7, 2017 at 6:31 PM, Laura Morales lauretas@mail.com wrote:
drops `a wikibase:Item` and `a wikibase:Statement` types
off topic but... why drop `a wikibase:Item`? Without this it seems impossible to retrieve a list of items.
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata