Tomasz Finc wrote:
Conrad Irwin wrote:
I notice the dumps seem currently frozen, is this the best place to ask for information, or is it publicly available somewhere else? (in which case sorry for pestering).
Conrad
Thanks Conrad,
I just got into the office and will be taking a look today to see why were halted.
All worker threads were stuck on the write() system call as NFS had started to flap around the time of our outage.
The storage node itself is showing WARNINGS trace back from the kernel
I've mailed out ops crew to start digging at this and hopefully our new dataset1 node can be cleared for production use so that we don't have to worry about this anymore.
And ... I've kicked all the old threads to stop since they weren't going to do anything useful.
Were now seeing work go through the system :D
--tomasz