On Mon, Feb 16, 2009 at 9:18 AM, Tim Starling
<tstarling(a)wikimedia.org> wrote:
I've deleted all the slow refreshLinks2 jobs
which have apparently been
preventing the job queue from making any headway for the last few months.
Some people report that they have received hundreds of edit notification
emails in the last few hours, due to the months of backlog now being cleared.
So are there no alarm bells that go off when the job queue is
unreasonably long, or do people just not listen to them? Perhaps we
could have a bot in #wikimedia-tech that would complain every hour if
the oldest job in the queue is more than X days old?
If you check the server admin log, you'll find that this is the latest in
a long series of attempts to fix this problem. I don't think it's
completely fixed yet.
I'm not sure what good a complaining bot would do, any more than a
complaining user which we seem to have plenty of. Deleting the jobs was
not a solution, and can't really be repeated without breaking things.
There's still a fair bit more programming to do.
-- Tim Starling