Problem with one of the database servers? - Cloud

18 Dec 2017


      One of tools.dplbot's daily tasks has been having repeated problems
since yesterday. A script that ran without errors and completed in about
10 minutes on Friday ran for over 90 minutes on Saturday, and died with
a "MySQL server has gone away" error.  There were no edits to the script
in between Friday and Saturday, so I have to assume that something
changed on the server side.
The script reads from enwiki.analytics.db.svc.eqiad.wmflabs, and both
reads from and writes to tools.labsdb.  All of the errors occurred on
writes to the user database. I was able to work around the errors by
dropping the database connection and opening a new one immediately
before writing (I have no idea why this works, since the timeout setting
on the database for inactive connections is 8 hours, and this script was
not even running for two hours; but it did work). However, the script
continues to run for an order of magnitude longer than it did on Friday
(~100 minutes vs. ~10 minutes).  Is anyone else experiencing similar
issues?
-- 
  Russell Blau
  russblau@imapmail.org