hard deadlock(?) on -current; some debugging info, need help
Peter Jeremy
PeterJeremy at optushome.com.au
Thu May 26 01:09:37 PDT 2005
On Wed, 2005-May-25 17:18:06 -0700, Ted Faber wrote:
>The system slowly grinds to a halt, and the lockup seems to invlove the
>disk system.
Nothing is waiting on physical I/O, but there are lots of locked vnodes.
I notice there's a sh(? - pid 10715) blocked on nfsreq. Can you reproduce
the problem without the NFS mounted filesystems?
> I have not found a sequence that triggers them (other than
>trying to write mail to the list to report them), and I know how
>difficult that makes things. It is common to have 2-5 a day. Even when
>I can get to the debugger during a lockup, I cannot generate a crash
>dump - the kernel reports starting the dump and moves no bytes.
Not nice. That suggests something below the filesystem is sick
because a filesystem deadlock won't affect the crashdump.
>I've attached a dmesg from a -v boot and the kernel config (the dmesg is
>not from the lockup run). Last friday when the system locked I had a
>digital camera with me and took pictures of the ps output in the hopes
>that someone could look at them. These images are at
>
>http://www.isi.edu/~faber/tmp/deadlock/DSCN04{75,76,77,78,79,80,81,82}.JPG
The other information we need is "show lockedvnods". This will hopefully
point to the process that started the problem.
--
Peter Jeremy
More information about the freebsd-current
mailing list