Tomasz Finc wrote:
Platonides wrote:
Tomasz Finc wrote:
All worker threads were stuck on the write() system call as NFS had started to flap around the time of our outage.
The dumps are working from a worker servers pool which are writing to storage via NFS?
It may be more efficient to save locally and transfer asynchronously to the storage node.
It turns out that NFS is likely not the root cause of the issue.
That was a general reflexion about improving it, not about the causes for this failure.
We've been debugging it in bug #23264 as we make progress.
-tomasz
I'm subscribed to it. However I fail to see from there which was the core issue.