Hi 

I was able to reduce the load time to 9.1 hours aprox. (32890338 msec) in Virtuoso 7.
I used 6 SSD disks of 1T each with RAID 0 (mdadm software RAID, I have not tried with hardware RAID).
The virtuoso.ini for 256G RAM is
https://gist.github.com/asanchez75/58d5aed504051c7fbf9af0921c3c9130
I downloaded the dump from 
https://dumps.wikimedia.org/wikidatawiki/entities/latest-all.ttl.gz 
on August 30th, 
The size is 387G uncompressed and finally the file virtuoso.db is 362G. The total number of triples is 9 470 700 617.
Have a look to the simple patch here (is just a workaround)
https://github.com/asanchez75/virtuoso-opensource/commit/5d7b1b9b29e53cb8a25bed69f512a150f9f05d50
You can create your own docker image with that patch using
https://github.com/asanchez75/docker-virtuoso/tree/brendan
Check the Dockerfile which retrieves the patch from my forked Virtuoso git repository
https://github.com/asanchez75/docker-virtuoso/blob/brendan/Dockerfile


Best,




Le dim. 1 sept. 2019 à 13:38, Edgar Meij <edgar.meij@gmail.com> a écrit :
Thanks for this, Kingsley.

Based on https://docs.google.com/spreadsheets/d/1-stlTC_WJmMU3xA_NxA1tSLHw6_sbpjff-5OITtrbFw/edit#gid=1799898600 (copy-pasted below), it seems that it takes 43 hours to load, is that correct?

Also, what is the "patch for geometry" mentioned there? I'm assuming that is the patch meant to address https://github.com/openlink/virtuoso-opensource/issues/295 and https://community.openlinksw.com/t/non-terrestrial-geo-literals/359, correct? Is it simply disabling the data validation code? Can you share the patch?

Thanks,
Edgar


Other Information
Architecturex86_64
CPU op-mode(s)32-bit, 64-bit
Byte OrderLittle Endian
CPU(s)12.00
On-line CPU(s) list0-11
Thread(s) per core2.00
Core(s) per socket6.00
Socket(s)1.00
NUMA node(s)1.00
Vendor IDGenuineIntel
CPU family6.00
Model63.00
Model name
Intel(R) Xeon(R) CPU E5-1650 v3 @ 3.50GHz
Stepping2.00
CPU MHz1,199.92
CPU max MHz3,800.00
CPU min MHz1,200.00
BogoMIPS6,984.39
VirtualizationVT-x
L1d cache32K
L1i cache32K
L2 cache256K
L3 cache15360K
NUMA node0 CPU(s)0-11
RAM128G
wikidata-20190610-all-BETA.ttl383G
Virtuoso version
07.20.3230 (with patch for geometry)
Time to load43 hours
virtuoso.db340G

On Wed, Aug 14, 2019 at 12:10 AM Kingsley Idehen <kidehen@openlinksw.com> wrote:

Hi Everyone,

A little FYI.

We have loaded Wikidata into a Virtuoso instance accessible via SPARQL [1]. One benefit is helping to understand Wikidata using our Faceted Browsing Interface for Entity Relationship Types [2][3].

Links:

[1] http://wikidata.demo.openlinksw.com/sparql -- SPARQL endpoint

[2] http://wikidata.demo.openlinksw.com/fct -- Faceted Browsing Interface

[3] About New York

Enjoy!

Feedback always welcome too :)

-- 
Regards,

Kingsley Idehen	      
Founder & CEO 
OpenLink Software   
Home Page: http://www.openlinksw.com
Community Support: https://community.openlinksw.com
Weblogs (Blogs):
Company Blog: https://medium.com/openlink-software-blog
Virtuoso Blog: https://medium.com/virtuoso-blog
Data Access Drivers Blog: https://medium.com/openlink-odbc-jdbc-ado-net-data-access-drivers

Personal Weblogs (Blogs):
Medium Blog: https://medium.com/@kidehen
Legacy Blogs: http://www.openlinksw.com/blog/~kidehen/
              http://kidehen.blogspot.com

Profile Pages:
Pinterest: https://www.pinterest.com/kidehen/
Quora: https://www.quora.com/profile/Kingsley-Uyi-Idehen
Twitter: https://twitter.com/kidehen
Google+: https://plus.google.com/+KingsleyIdehen/about
LinkedIn: http://www.linkedin.com/in/kidehen

Web Identities (WebID):
Personal: http://kingsley.idehen.net/public_home/kidehen/profile.ttl#i
        : http://id.myopenlink.net/DAV/home/KingsleyUyiIdehen/Public/kingsley.ttl#this

_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata
_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata