Recent problems with 6-STABLE...

John Baldwin jhb at freebsd.org
Thu Jan 31 04:16:24 PST 2008


On Thursday 31 January 2008 04:37:13 am gnn at freebsd.org wrote:
> At Tue, 29 Jan 2008 11:57:39 -0500,
> John Baldwin wrote:
> > 
> > On Tuesday 29 January 2008 07:32:16 am gnn at freebsd.org wrote:
> > > Hi,
> > > 
> > > I have two boxes running 6-STABLE, post 6.3 release, which have both
> > > spontaneously rebooted, one under load and one not under load.  I have
> > > attached dmesg and some traceback information, from the one trace that
> > > looked interesting.  Any thoughts or hints would be apprecated.
> > > 
> > > To save you scanning all the dmesg first these are dual processor XEON
> > > boxes, each processor has 4 cores.
> > 
> > Can you do 'x/i 0xffffffff80296642' to show which instruction faulted?
> 
> (kgdb) x/i 0xffffffff80296642
> 0xffffffff80296642 <pfs_exit+114>:      cmp    %ecx,0x8(%rdx)

Hmm, and rdx from your last post was:

> printf "%x\n" 32491047111385957
736e6f69746365

> echo "0x73 0x6e 0x6f 0x69 0x74 0x63 0x65" | dh
snoitce

so it appears you have a data corruption issue.  You could check the hardware 
(RAM, etc.) but if that is ok you might want to see if you can isolate it to 
a specific driver if a driver has a bug (or hardware has an errata we don't 
work around yet).  Do you have any custom drivers for hardware that does DMA?  
If not, which storage driver (including pciconf output if ATA) and NIC(s) 
does this box have?  Also, how much RAM?

-- 
John Baldwin


More information about the freebsd-amd64 mailing list