Hang on boot with 4.9-STABLE
Kevin Oberman
oberman at es.net
Mon Nov 24 10:30:27 PST 2003
> Date: Fri, 21 Nov 2003 14:24:24 -0500
> From: Peter Radcliffe <pir at pir.net>
> Sender: owner-freebsd-stable at freebsd.org
>
> Doug White <dwhite at gumbysoft.com> probably said:
> > Well, until you can spend some time on them to provoke crashdumps or find
> > the date things go bad or whatever, there isn't much we can do.
>
> I installed the spare remote controlled power strip today and managed
> to persuade the owner of the systems to give me some time to move the
> power to it and using one to debug for a bit.
>
>
> Switching to acpi (or simply removing apm from the config) does seem
> to work around the problem, but experimental acpi in 4.x doesn't give
> me warm fuzzy feelings.
>
> Don't seem to be able to provoke a crashdump, even with loader.conf
> setting where to dump to, but with RELENG_4 source from
> date=2003.10.15.05.00.00 I do get a kernel trap;
>
> apm0: <APM BIOS> on motherboard
> kernel trap 12 with interrupts disabled
>
> Fatal trap 12: page fault while in kernel mode
> mp_lock = 00000006; cpuid = 0; lapic.id = 00000000
> fault virtual address = 0x36
> fault code = supervisor write, page not present
> instruction pointer = 0x8:0xc020d05a
> stack pointer = 0x10:0xc03fad66
> frame pointer = 0x10:0xc03fae06
> code segment = base 0x0, limit 0xfffff, type 0x1b
> = DPL 0, pres 1, def32 1, gran 1
> processor eflags = interrupt enabled, resume, IOPL = 0
> current process = 0 (swapper)
> interrupt mask = net tty bio cam <- SMP: XXX
> kernel: type 12 trap, code=0
> Stopped at vm_fault+0x132: lock addw %ax,0x36(%edx)
> db> trace
> vm_fault(c0354e2c,c0000000,2,0,c) at vm_fault+0x132
> trap_pfault(c03fae82,0,c00004d8,ffffffff,0) at trap_pfault+0xda
> trap(18,70000010,60,1c,0) at trap+0x377
> calltrap() at calltrap+0x17
> --- trap 0xc, eip = 0x6096, esp = 0xc03faec2, ebp = 0xc03faec8 ---
> gd_idlestack(aedc0058,0,530e0102,80202,5061aa) at 0x6096
>
> I've limited the time of the problem being introduced to between
> date=2003.10.10.05.00.00 and date=2003.10.15.05.00.00 and am working
> on finding a more exact range.
Peter,
I've been on travel, so I'm just catching up.
The crash is the result of a long-standing bug in the apm code that
Peter Wemm fixed back on 10/16. Make sure that locore.s is 1.132.2.13.
This should not be happening on 4.9 unless there has been a
regression.
--
R. Kevin Oberman, Network Engineer
Energy Sciences Network (ESnet)
Ernest O. Lawrence Berkeley National Laboratory (Berkeley Lab)
E-mail: oberman at es.net Phone: +1 510 486-8634
More information about the freebsd-stable
mailing list