Hello Stas,
+1 for .nt RDF dump of WD due to (as you also said) easier processing!
Regards, Fariz
On Fri, Aug 26, 2016 at 10:52 PM, Stas Malyshev smalyshev@wikimedia.org wrote:
Hi!
Of course if providing both is easy, then there's no reason not to provide both.
Technically it's quite easy - you just run the same script with different options. So the only question is what is useful.
It is useful in such applications to know the online RDF documents in which a triple can be found. The document could be the entity, or it could be a physical location like:
That's where the tricky part is: many triples won't have specific document there since they may appear in many documents. Of course, if you merge all these documents in a dump, the triple would appear only once (we have special deduplication code to take care of that) but it's impossible to track it back to a specific document then. So I understand the idea, and see how it may be useful, but I don't see a real way to implement it now.
-- Stas Malyshev smalyshev@wikimedia.org
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata