On Mon, Feb 16, 2009 at 6:20 PM, Platonides <Platonides(a)gmail.com> wrote:
People did complain about long job queue on
#wikimedia-tech. I don't
think they were taken too seriously.
Yes, because they're not a bot who a) we know is actually noting a
real problem instead of subjective impressions, and who b) spams the
complaint on an ongoing basis like nagios does.
Part of the problem is that the measure of job queue length we really
care about is "what was the last job executed?", not "how many jobs
are in the queue?". If we added a job_timestamp column and put an
index on it, we could replace (or supplement) the cruddy poor-quality
estimate we have now with a probably more useful and certainly more
accurate one.