Hello Stas,

+1 for .nt RDF dump of WD due to (as you also said) easier processing!

Regards,
Fariz

On Fri, Aug 26, 2016 at 10:52 PM, Stas Malyshev <smalyshev@wikimedia.org> wrote:
Hi!

> Of course if providing both is easy, then there's no reason not to
> provide both.

Technically it's quite easy - you just run the same script with
different options. So the only question is what is useful.

> It is useful in such applications to know the online RDF documents in
> which a triple can be found. The document could be the entity, or it
> could be a physical location like:
>
> http://www.wikidata.org/entity/Q13794921.ttl

That's where the tricky part is: many triples won't have specific
document there since they may appear in many documents. Of course, if
you merge all these documents in a dump, the triple would appear only
once (we have special deduplication code to take care of that) but it's
impossible to track it back to a specific document then. So I understand
the idea, and see how it may be useful, but I don't see a real way to
implement it now.

--
Stas Malyshev
smalyshev@wikimedia.org

_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata