Race in NFS in 6.0-RC1?

Frank Mayhar frank at exit.com
Mon Oct 24 23:34:36 PDT 2005


I've started using NFS in 6.0 a little more heavily lately, as since the
em(4) wedge has been fixed I can actually use it reliably.
Unfortunately there appears to be a problem.  Twice, now, in less than
24 hours the client has paniced under load.  Both times it was building
OpenOffice in an NFS-mounted /usr/ports.  In case it matters, it's a
soft mount from another 6.0 box over an em(4) interface with an MTU of
9000.

Both times it was a panic from vnlru while trying to flush a vnode and
both times it was a null-pointer dereference in nfs_putpages() at
nfs_bio.c:301.  In both cases vp->v_data was null.  The vnode itself
looks fine to my eyes, although there may well be FreeBSD-specific
subtleties that I'm missing.  I've just entered a PR for this problem,
kern/87967.  I'll keep the cores around; if anyone wants more
information from them, let me know.  As may be apparent, I can reproduce
this fairly easily, although it takes a few minutes for it to trigger.

The worrying thing about this is, in fact, its reproducibility.
-- 
Frank Mayhar frank at exit.com     http://www.exit.com/
Exit Consulting                 http://www.gpsclock.com/
                                http://www.exit.com/blog/frank/


More information about the freebsd-current mailing list