In fact the idea of publishing a subset or censored view of EventLogging logs is something we’ve been tinkering with for a while (I think Erik M brought it up on this list a while ago).

Replicating a subset of the logs stored on s1.log on labs db sounds like the best way to approach the problem. The added bonus is that by having this (public / censored) data exposed via labs db, wikimetrics would be able to access it out of the box.

Copying Sean to see if this seems even remotely possible.

D

On Jan 24, 2014, at 4:24 PM, Steven Walling <swalling@wikimedia.org> wrote:


On Fri, Jan 24, 2014 at 4:07 PM, Dario Taraborelli <dtaraborelli@wikimedia.org> wrote:
Steven, thanks for the heads up. Is the expectation that you want to make these logs publicly available? I guess we could explore this possibility for these 3 schemas (they encode already public data, although in a cleaner format), but not the other two logs that you mention.

No. Not intending to make these public, though I don't think they contain any data that isn't already public. 

--
Steven Walling,
Product Manager
_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics