On Tue, 20 Oct 2020, Tobias Klausmann wrote:
In our quest to make teh GPU-equipped machines in
analytics ever more useful,
we are going to update the rocm software suite and driver on stat1005 and
stat1008 to the latest version, 3.8.0.
Maintenance today revealed that the kernel module shipped with rocm (rock-dkms)
is not fully compatible with the kernel version we use. I have put stat1005
back into the state it was in (using rocm33). The planned update of stat1008 is
canceled/postponed until we can make the driver and our kernels work together.
If anything is broken on stat1005, let us know.
Tobias Klausmann, SRE, Wikimedia Foundation