Hi Steven,
On Tue, Jul 29, 2014 at 11:56:31AM -0700, Steven Walling wrote:
Growth team needs some data removed from [...] the raw logs [...]
Thanks for caring to clean up no longer needed data. It's greatly appreciated.
However, we typically do not scrub or clean the raw logs [1].
People are using those files for debugging and pushed back when we asked about whether we should clean them up.
Those raw files are only available to a limited set of people, so it is typically less of an issue.
Is it ok to just remove the data from databases (Thanks to Sean!) and let the data sit on the raw logs, or is there a hard requirement to scrub the raw logs clean too?
Have fun, Christian
[1] See 'Raw client and server side log files' item in http://lists.wikimedia.org/pipermail/analytics/2014-June/002256.html