Hopefully this is the right mailing list for my topic.
The German Verein für Computergenealogie is the largest genealogical
society in Germany with more than 3,700 members. We are currently
considering whether Wikibase is a suitable system for us. Most
interesting is the use for our *propographical data*.
Prosopographical data can be divided into three classes:
a) well-known and well-studied personalities, typically authors
b) lesser-known but well-studied personalities that can be clearly and
easily identified in historical sources
c) persons whose identifiability in various sources (such as church
records, civil record, city directory) has to be established using
(mostly manual) record linkage
Data from (a) can be found in the GND of the German National Libarary.
For data from class (b) systems such as FactGrid exists. The Verein für
Computergenealogie mostly works with data from class (c). We have a huge
amount of that kind of data, more than 40 million records. Currently it
is stored in several MySQL and MongoDB databases.
This leads me to the crucial question: Is the performance of Wikibase
sufficient for such an amount of data? One record for a person will
typically result in maybe ten statements in Wikibase. Using
QuickStatements or the WDI library I have not been able to insert more
than two or three statements per second. It would take month to import
the data.
Another question is whether the edit history of the entries can be
preserved. For some data set the edit history goes back to 2004.
I hope someone can give me hints on these questions.
Best wishes
Jesper
_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata