Hello!
Here is what I know.
- Over the next several hours systemd timers (cron jobs) across a lot of the fleet stopped running.
- 2023-02-22T17:34 UTC -
This patch reverted the change
After this jobs began to run again. However, because the webrequest dataset is so huge, it took hours for ingestion of it to catch up. Downstream jobs that use webrequest as input (including pageviews computation) began to timeout while waiting for input.
- We have been slowly restarting and recovering jobs now that webrequest ingestion has caught up again.
I don't know exactly how long data is delayed or when it will be fully available, but I'd guess: soon / today?
-Andrew Otto
WMF