On 2014-04-27 08:45, ENWP Pine wrote:
* I think it would be desirable to add an https option
to
stats.grok.se so that viewers' interests in page readership statistics
are more private.
Hm, why not? I'll request a certificate and start serving https also. I
hadn't really thought of page readership statistics as something all
that sensitive, but I don't see any downside to also serving https.
* There is an issue in the statistics given at [1]. As
you can see
from [2] editors created and edited the project page on days when
stats.grok.se said there were no pageviews. This may be the result of
a page move [3] [4] and the pre-move views were not integrated into
the results shown in [1]. Is this the expected and desired behavior?
From my point of view as a Signpost author, this is undesirable as we
try to track our readership statistics. I think it is the case that if
page A is moved to new page B then the statistics for page A should be
integrated into those for page B and a notice should be given to the
viewer that the statistics for page B includes those from page A which
was moved on date X. This problem may affect other pages that are the
subject of mergers. I think it would be the case that if page A is
merged into page B then we would want some notice to appear on
stats.grok.se alerting the viewer that there was a merger, the date of
the merger, and offering the viewer a way to select statistics with
and without the historical information from page A as they look at the
viewership statistics for page B.
That would indeed be better. However, the statistics data tracks URLs
rather than pages and it's computationally expensive to look up the page
history and what URLs it has been accessible through. The average
throughput of view statistics is that some tens of thousands of entries
are added per minute 24/7. One could perhaps do it as the data was
requested, but that would mean making several round-trips to the WMF
servers to look up the history of all moves and correlate URLs across
time. It's certainly possible to build a tool that does that on top of
the stats.grok.se and wikipedia APIs though.
-henrik