7.2 Stable Crash - possibly related to if_re

Pyun YongHyeon pyunyh at gmail.com
Sat Oct 31 21:21:05 UTC 2009


On Fri, Oct 30, 2009 at 06:23:51PM -0700, Norbert Papke wrote:
> On October 30, 2009, Pyun YongHyeon wrote:
> > On Thu, Oct 29, 2009 at 09:56:19PM -0700, Norbert Papke wrote:
> > > This occurred shortly after "scp"ing from a VirtualBox VM to the host. 
> > > The file transfer got stuck.  The "re" interface stopped working. 
> > > Shortly afterwards, the host crashed.  The "re" interface was used by the
> > > host, the guest was using a different NIC in bridged mode.
> > >
> > >
> > > FreeBSD proven.lan 7.2-STABLE FreeBSD 7.2-STABLE #5 r198666: Thu Oct 29
> > > 18:36:57 PDT 2009
> > >
> > > Fatal trap 12: page fault while in kernel mode
> > > cpuid = 0; apic id = 00
> > > fault virtual address   = 0x18
> >
> > It looks like a NULL pointer dereference, possibly mbuf related
> > one.
> >
> > > fault code              = supervisor write data, page not present
> > > instruction pointer     = 0x8:0xffffffff80d476ee
> > > stack pointer           = 0x10:0xffffff8000078ae0
> > > frame pointer           = 0x10:0xffffff8000078b40
> > > code segment            = base 0x0, limit 0xfffff, type 0x1b
> > >                         = DPL 0, pres 1, long 1, def32 0, gran 1
> > > processor eflags        = interrupt enabled, resume, IOPL = 0
> > > current process         = 18 (swi5: +)
> > > Physical memory: 8177 MB
> > >
> > >
> 
> > > #9  0xffffffff807e710e in calltrap ()
> > > at /usr/public/freebsd/sources/stable/sys/amd64/amd64/exception.S:218
> > > #10 0xffffffff80d476ee in re_rxeof () from /boot/kernel/if_re.ko
> >
> > Hmm, I think there is a missing information here. Not sure where it
> > dereferenced a NULL pointer in re_rxeof(). 
> 
> >> #11 0xffffffff80d4a481 in re_int_task (arg=Variable "arg" is not available.
> >> ) 
> >> 
> at /usr/public/freebsd/sources/stable/sys/modules/re/../../dev/re/if_re.c:2191  
> 
> I am not sure how much I trust frame 10.  The instruction 
> at "0xffffffff80d476ee" is the one after the "retq" from re_rxeof().  Frame 
> 11 seems OK to me.  The "struct rl_softc*", in particular, looks plausible 
> but I don't know enough to say for sure.
> 
> > Because that this is the 
> > first report for NULL pointer dereference in Rx handler I need more
> > information how to reproduce it with minimal configuration. Can you
> > also reproduce the issues without virtual box?
> 
> I am trying but no luck so far.
> 
> > By chance, did you stop the re0 interface with ifconfig when you
> > noticed the file transfer got stuck?
> 
> It is possible.  I had it happen twice.  The first time I definitely tried 
> to "down" re.  I cannot recall what I did the second time.  The crash dump is 
> from the second time.
> 

Ok, then would you try attached patch?
-------------- next part --------------
A non-text attachment was scrubbed...
Name: re.rxeof.patch
Type: text/x-diff
Size: 469 bytes
Desc: not available
Url : http://lists.freebsd.org/pipermail/freebsd-stable/attachments/20091031/00d3e348/re.rxeof.bin


More information about the freebsd-stable mailing list