On Mon, Feb 16, 2009 at 6:20 PM, Platonides Platonides@gmail.com wrote:
People did complain about long job queue on #wikimedia-tech. I don't think they were taken too seriously.
Yes, because they're not a bot who a) we know is actually noting a real problem instead of subjective impressions, and who b) spams the complaint on an ongoing basis like nagios does.
Part of the problem is that the measure of job queue length we really care about is "what was the last job executed?", not "how many jobs are in the queue?". If we added a job_timestamp column and put an index on it, we could replace (or supplement) the cruddy poor-quality estimate we have now with a probably more useful and certainly more accurate one.