Am 01.07.2016 um 01:42 schrieb Nuria Ruiz:
> Is this data always requested via http from an api endpoint that will hit a
> varnish cache? (Daniel can probably answer this)
Yes. Special:EntityData is a regular special page, and action=wbgetentities is a
regular MW web API request, as your example shows.
> If the data you are interested in can be inferred from these requests there is
> no additional data gathering needed.
Yay!
> Nor does it tell us how
> often statements/RDF triples show up in the Wikidata Query Service.
I'm no expert on the query service, adding Stas for that. As far as I know,
SPARQL queries go through Varnish directly to BlazeGraph. In any case, they are
not processed by MediaWiki at all. Tracking how often an entity is mentioned in
a GET request to the SPARQL service should be possible based on the varnish
request logs, with a bit of regex magic. POST requests are more tricky, I suppose.
However, I don't think we are logging the contents of responses at all. I
suppose that would have to be build into BlazeGraph somehow. And even if we did
that, that would only tell use which entities were present in a result, not
which entities were used to answer a query. E.g. if you list all instances of a
class (including subclasses), the entities representing the classes are
essential to answering the query, but they are not present in the result (and
only the top-most class is present in the query).
--
Daniel Kinzler
Senior Software Developer
Wikimedia Deutschland
Gesellschaft zur Förderung Freien Wissens e.V.