The biggest issues I see are the lack of any good logging, monitoring and alerting tools.
Things like icinga, logstash, grafina. The kind of things that are standard for
supporting any production system. I've raised this before, so I won't belabor the
point here.
And
https://phabricator.wikimedia.org/T256426
<https://phabricator.wikimedia.org/T256426> continues to be an every-day pain in my
side. The related
https://phabricator.wikimedia.org/T127367
<https://phabricator.wikimedia.org/T127367> is triaged as high priority. It's
been open for 6-1/2 years.
On Sep 7, 2022, at 10:17 AM, Slavina Stefanova
<sstefanova(a)wikimedia.org> wrote:
On a side note, I'd be interested in hearing what you dislike about Toolforge, if
you'd like to share. We (the cloud services team) are working on improving Toolforge
and don't always get as much feedback, good or bad, as we'd want.