(cc-ing Tim starling who is credited on your dataset page and might know more about this)
>
I would like to ask for your comments about compiling a similar (updated) data set and making it public.
As far as I can see the prior dataset contained the following:
Counter, timestamp, url, save flag
I can see how we could get a dataset with timestamp and url and adding a counter is something it can be done (on our actual system though ordering of requests is not guranteed in logs). Now, I really do not know whether it is possible to add a flag of whether the request was a save or not. As far as I know that is not information we have on our current system and it seems that it will require tapping into the cache lookups to get that info. Meaning that you would need to get that info from varnish lookups as requests are happening which is before analytics systems get any of the data.
Anyways I hope other folks can chime in on how/whether this can be done somewhat easily, it certainly requires access to other parts of the stack besides analytics infrastructure.
Thanks,
Nuria