Hello all,
This message is for those of you who do deployments to the WMF cluster.
On the [[How to deploy code]] wikitech page, there is a section on Testing your live code: https://wikitech.wikimedia.org/wiki/How_to_deploy_code#Test_your_code_live
That's a pretty basic overview of it and it could be greatly improved with information like: * How to monitor specific parts of the cluster that are relevant to what you deployed * What general monitoring should be looked at after you deploy
I know many of you already do much of this after you deploy, but the lack of documentation on *how* to do it was a recurring theme in the initial interviews I did with engineering teams when I first started. https://wikitech.wikimedia.org/wiki/Deployments/Features_Process/General_Fee...
== "The Ask" ==
I'm asking you ("you" being those of you who have experience doing post-deploy monitoring) to please add more documentation to this section of the How to deploy code page: https://wikitech.wikimedia.org/wiki/How_to_deploy_code#Test_your_code_live
I expect people from both engineering and ops will have feedback here.
Also, those of you who don't know how to monitor/log things post deploy but you have specific questions, please ask here so that someone who does know can answer on the wiki.
Thanks,
Greg