Hi Stats,
out of curiosity, can you give an example of triples that do not originate
from a single wikidata item / property?
for me turtle dumps are process-able only by RDF tools while nt-like dumps
both by rdf tools and other kind of scripts and I fild the former redundant
On Fri, Aug 26, 2016 at 11:52 PM, Stas Malyshev <smalyshev(a)wikimedia.org>
wrote:
Hi!
Of course if providing both is easy, then
there's no reason not to
provide both.
Technically it's quite easy - you just run the same script with
different options. So the only question is what is useful.
It is useful in such applications to know the
online RDF documents in
which a triple can be found. The document could be the entity, or it
could be a physical location like:
http://www.wikidata.org/entity/Q13794921.ttl
That's where the tricky part is: many triples won't have specific
document there since they may appear in many documents. Of course, if
you merge all these documents in a dump, the triple would appear only
once (we have special deduplication code to take care of that) but it's
impossible to track it back to a specific document then. So I understand
the idea, and see how it may be useful, but I don't see a real way to
implement it now.
--
Stas Malyshev
smalyshev(a)wikimedia.org
_______________________________________________
Wikidata mailing list
Wikidata(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata
--
Kontokostas Dimitris