"Fatal trap 12: page fault while in kernel mode" on 7.1/amd64,
but not 7.0
Kostik Belousov
kostikbel at gmail.com
Sat Mar 7 10:25:27 PST 2009
On Fri, Mar 06, 2009 at 05:01:12PM -0500, Boris Kochergin wrote:
> Gavin Atkinson wrote:
> >On Thu, 2009-03-05 at 19:55 -0500, Boris Kochergin wrote:
> >
> >>Ahoy. I recently upgraded an amd64 machine to 7.1-RELEASE, and started
> >>getting a bunch of these at a pretty high frequency (a few hours to a
> >>day apart):
> >>
> >>http://acm.poly.edu/~spawk/IMG00033.jpg
> >>
> >>The "current process" is always httpd. They're particularly annoying
> >>because the machine doesn't actually ever reboot, requiring manual
> >>intervention. Reverting the kernel back to 7.0 makes the panic go away,
> >>and the machine had been happily running 7.0 for about a year
> >>beforehand. I realize that the photo hardly contains any useful
> >>debugging information, but I was hoping it might look familiar to
> >>someone. If not, I guess I'll come back with a backtrace.
> >>
> >
> >A backtrace will almost certainly be necessary to figure out what this
> >issue is, although there is a possibility that the output of
> >"addr2line -e /boot/kernel/kernel.symbols 0x8:0xffffffff802d7010"
> >might help, assuming you've not recompiled your kernel yet. (That
> >number should be the same as the "instruction pointer" shown by the
> >panic, but as the photo is quite blurred there's a chance I've got it
> >wrong, if you have a better picture of it or wrote it down then use
> >that)
> >
> >Gavin
> >_______________________________________________
> >freebsd-stable at freebsd.org mailing list
> >http://lists.freebsd.org/mailman/listinfo/freebsd-stable
> >To unsubscribe, send any mail to "freebsd-stable-unsubscribe at freebsd.org"
> >
> Here it is, with some additional information afterward:
>
> Unread portion of the kernel message buffer:
> kernel trap 12 with interrupts disabled
>
>
> Fatal trap 12: page fault while in kernel mode
> cpuid = 1; apic id = 01
> fault virtual address = 0x30
> fault code = supervisor read data, page not present
> instruction pointer = 0x8:0xffffffff80293faf
> stack pointer = 0x10:0xffffffff9cbaea70
> frame pointer = 0x10:0xffffff000fc14000
> code segment = base 0x0, limit 0xfffff, type 0x1b
> = DPL 0, pres 1, long 1, def32 0, gran 1
> processor eflags = resume, IOPL = 0
> current process = 881 (httpd)
> trap number = 12
> panic: page fault
> cpuid = 1
> Uptime: 1m51s
> Physical memory: 8185 MB
> Dumping 328 MB: 313 297 281 265 249 233 217 201 185 169 153 137 121 105
> 89 73 57 41 25 9
>
> #0 doadump () at pcpu.h:195
> 195 pcpu.h: No such file or directory.
> in pcpu.h
> (kgdb) where
> #0 doadump () at pcpu.h:195
> #1 0xffffff000fc14000 in ?? ()
> #2 0xffffffff8025eba9 in boot (howto=260) at
> /usr/src-7.1/sys/kern/kern_shutdown.c:418
> #3 0xffffffff8025efb2 in panic (fmt=0x104 <Address 0x104 out of
> bounds>) at /usr/src-7.1/sys/kern/kern_shutdown.c:574
> #4 0xffffffff803df5c3 in trap_fatal (frame=0xffffff000fc14000,
> eva=Variable "eva" is not available.
> ) at /usr/src-7.1/sys/amd64/amd64/trap.c:764
> #5 0xffffffff803e018f in trap (frame=0xffffffff9cbae9c0) at
> /usr/src-7.1/sys/amd64/amd64/trap.c:290
> #6 0xffffffff803c5c4e in calltrap () at
> /usr/src-7.1/sys/amd64/amd64/exception.S:209
> #7 0xffffffff80293faf in turnstile_broadcast (ts=0x0, queue=0) at
> /usr/src-7.1/sys/kern/subr_turnstile.c:836
> #8 0xffffffff8025256a in _mtx_unlock_sleep (m=0xffffffff80593538,
> opts=Variable "opts" is not available.
> ) at /usr/src-7.1/sys/kern/kern_mutex.c:619
> #9 0xffffffff80275ed3 in __umtx_op_cv_wait (td=0x1ee, uap=Variable
> "uap" is not available.
> ) at /usr/src-7.1/sys/kern/kern_umtx.c:312
> #10 0xffffffff803dfb78 in syscall (frame=0xffffffff9cbaec80) at
> /usr/src-7.1/sys/amd64/amd64/trap.c:907
> #11 0xffffffff803c5e5b in Xfast_syscall () at
> /usr/src-7.1/sys/amd64/amd64/exception.S:330
> #12 0x0000000800f5354c in ?? ()
> Previous frame inner to this frame (corrupt stack?)
> (kgdb)
>
> The dump was difficult to acquire--the system would often lock up after
> dumping only a portion of the memory it wanted to save. I can also now
> trigger the panic pretty reliably using this bit of script:
>
> #!/usr/local/bin/bash
>
> for i in {1..900}
> do
> wget --quiet -O /dev/null http://acm.poly.edu/wiki/Hosting &
> done
>
> ...where the URL is a MediaWiki installation on the afflicted machine.
Can you, please, recompile the kernel with debugging options, and
provoke the panic on it ?
We need at least options INVARIANTS, INVARIANT_SUPPORT and WITNESS.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 195 bytes
Desc: not available
Url : http://lists.freebsd.org/pipermail/freebsd-stable/attachments/20090307/e1bb2ebb/attachment.pgp
More information about the freebsd-stable
mailing list