Tomasz Finc wrote:
Conrad Irwin wrote:
I notice the dumps seem currently frozen, is this
the best place to ask
for information, or is it publicly available somewhere else? (in which
case sorry for pestering).
Conrad
Thanks Conrad,
I just got into the office and will be taking a look today to see why
were halted.
All worker threads were stuck on the write() system call as NFS had
started to flap around the time of our outage.
The storage node itself is showing WARNINGS trace back from the kernel
http://pastebin.com/mjKiCsF2
I've mailed out ops crew to start digging at this and hopefully our new
dataset1 node can be cleared for production use so that we don't have to
worry about this anymore.
And ... I've kicked all the old threads to stop since they weren't going
to do anything useful.
Were now seeing work go through the system :D
--tomasz