Aryeh Gregor wrote:
On Mon, Feb 16, 2009 at 9:18 AM, Tim Starling tstarling@wikimedia.org wrote:
I've deleted all the slow refreshLinks2 jobs which have apparently been preventing the job queue from making any headway for the last few months. Some people report that they have received hundreds of edit notification emails in the last few hours, due to the months of backlog now being cleared.
So are there no alarm bells that go off when the job queue is unreasonably long, or do people just not listen to them? Perhaps we could have a bot in #wikimedia-tech that would complain every hour if the oldest job in the queue is more than X days old?
If you check the server admin log, you'll find that this is the latest in a long series of attempts to fix this problem. I don't think it's completely fixed yet.
I'm not sure what good a complaining bot would do, any more than a complaining user which we seem to have plenty of. Deleting the jobs was not a solution, and can't really be repeated without breaking things. There's still a fair bit more programming to do.
-- Tim Starling