On Thu, Jun 4, 2009 at 10:53 AM, David Gerard dgerard@gmail.com wrote:
I understand the problem with stats before was that the stats server would melt under the load. Leon's old wikistats page sampled 1:1000. The current stats (on dammit.lt and served up nicely on http://stats.grok.se) are every hit, but I understand (Domas?) that it was quite a bit of work to get the firehose of data in such a form as not to melt the receiving server trying to process it.
OK, then the problem becomes: how to set up something like stats.grok.se feasibly internally for all the other data gathered from a hit? (Modulo stuff that needs to be blanked per privacy policy.)
What exactly are people looking for that isn't available from stats.grok.se that isn't a privacy concern?
I had assumed that people kept installing these bugs because they wanted source network break downs per-article and other clear privacy violations.