Gerco:
On May 16th we lower the sampling rate of media viewer events as the event
rate was ~170 events per second. It looks like as of a week and a half ago
we are again at that rate.
Please see:
https://ganglia.wikimedia.org/latest/graph.php?r=month&z=xlarge&c=M…
This means that Media Viewer is generating about 15 million rows a day on
EL database, a data flow that seems quite high for our capacity to analyze
it.
Is this a mistake? Should sampling rates be lowered again?
So you know, right now media viewer is sampling more than twice as much the
rest of the teams at the foundation together. If every team sampled at this
ratio the system will go down. Now, at this time, event logging is not at
risk of going down but the replication is affected.
Thanks,
Nuria