freebsd-5.4-stable panics

Rob Watt rob at hudson-trading.com
Wed Sep 28 11:08:50 PDT 2005


Robert,

On Tue, 27 Sep 2005, Robert Watson wrote:

> Great.  As mentioned I'll be offline for about the next 48 hours, but back
> after then.  If we can get a nice clean crash out of this, would really be
> best.  If it's top panicking, it could well be due to a bug in the process
> monitoring code, in kern_proc.  We've run into bugs a few times there in
> the past, generally associated with threading or races in process
> creation/teardown, in which partially initialized (or torn down) processes
> are accessed by another thread and are in an unexpected state.

We re-compiled the kernel with 'options KDB_STOP_NMI', and were able to
get a much more full analysis of what was happening on the 6-BETA5 crash.

We crashed in top again, and it does look like we may have hit a kern_proc
bug.

in the attached file type3-core.txt you can see that it hits an exception
in:

0xffffffff802b897a is in fill_kinfo_thread
(/usr/src/sys/kern/kern_proc.c:736).
731                     }
732
733                     kg = td->td_ksegrp;
734
735                     /* things in the KSE GROUP */
736                     kp->ki_estcpu = kg->kg_estcpu;
737                     kp->ki_slptime = kg->kg_slptime;
738                     kp->ki_pri.pri_user = kg->kg_user_pri;
739                     kp->ki_pri.pri_class = kg->kg_pri_class;
740
(kgdb) frame 8
#8  0xffffffff802b897a in fill_kinfo_thread (td=0xffffff0063311260,
kp=0xffffffffb62d8510)
    at /usr/src/sys/kern/kern_proc.c:733
733                     kg = td->td_ksegrp;
(kgdb) p kg->kg_estcpu
Cannot access memory at address 0x173
(kgdb) p td->td_ksegrp
$1 = (struct ksegrp *) 0x0
(kgdb) p kp->ki_estcpu
$2 = 0
(kgdb) p kg
$4 = (struct ksegrp *) 0x12b

it seems that kg is an invalid pointer.

We have started our tests again without running top.

Hope you have a great vacation.

-
Rob Watt
-------------- next part --------------
A non-text attachment was scrubbed...
Name: type3-core.txt
Type: application/octet-stream
Size: 7525 bytes
Desc: 
Url : http://lists.freebsd.org/pipermail/freebsd-amd64/attachments/20050928/ab3fa7d1/type3-core.obj
-------------- next part --------------
Copyright (c) 1992-2005 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
	The Regents of the University of California. All rights reserved.
FreeBSD 6.0-BETA5 #1: Tue Sep 27 17:38:32 EDT 2005
    root at quotetest2:/usr/obj/usr/src/sys/LOCAL-DEBUG-NMI
WARNING: WITNESS option enabled, expect reduced performance.
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: Dual Core AMD Opteron(tm) Processor 275 (2190.05-MHz K8-class CPU)
  Origin = "AuthenticAMD"  Id = 0x20f12  Stepping = 2
  Features=0x178bfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2,HTT>
  Features2=0x1<SSE3>
  AMD Features=0xe2500800<SYSCALL,NX,MMX+,<b25>,LM,3DNow+,3DNow>
  Hyperthreading: 2 logical CPUs
real memory  = 3942580224 (3759 MB)
avail memory = 3807399936 (3631 MB)
ACPI APIC Table: <A M I  OEMAPIC >
FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP): APIC ID:  1
 cpu2 (AP): APIC ID:  2
 cpu3 (AP): APIC ID:  3
MADT: Forcing active-low polarity and level trigger for SCI
ioapic0 <Version 1.1> irqs 0-23 on motherboard
ioapic1 <Version 1.1> irqs 24-27 on motherboard
ioapic2 <Version 1.1> irqs 28-31 on motherboard
acpi0: <A M I OEMRSDT> on motherboard
acpi0: Power Button (fixed)
pci_link0: <ACPI PCI Link LNKA> irq 10 on acpi0
pci_link1: <ACPI PCI Link LNKB> irq 5 on acpi0
pci_link2: <ACPI PCI Link LNKC> irq 11 on acpi0
pci_link3: <ACPI PCI Link LNKD> irq 9 on acpi0
Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000
acpi_timer0: <24-bit timer at 3.579545MHz> port 0x1008-0x100b on acpi0
cpu0: <ACPI CPU> on acpi0
acpi_throttle0: <ACPI CPU Throttling> on cpu0
cpu1: <ACPI CPU> on acpi0
cpu2: <ACPI CPU> on acpi0
cpu3: <ACPI CPU> on acpi0
pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0
pci0: <ACPI PCI bus> on pcib0
pcib1: <ACPI PCI-PCI bridge> at device 6.0 on pci0
pci3: <ACPI PCI bus> on pcib1
ohci0: <OHCI (generic) USB controller> mem 0xfeafc000-0xfeafcfff irq 19 at device 0.0 on pci3
ohci0: [GIANT-LOCKED]
usb0: OHCI version 1.0, legacy support
usb0: <OHCI (generic) USB controller> on ohci0
usb0: USB revision 1.0
uhub0: AMD OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 3 ports with 3 removable, self powered
ohci1: <OHCI (generic) USB controller> mem 0xfeafd000-0xfeafdfff irq 19 at device 0.1 on pci3
ohci1: [GIANT-LOCKED]
usb1: OHCI version 1.0, legacy support
usb1: <OHCI (generic) USB controller> on ohci1
usb1: USB revision 1.0
uhub1: AMD OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 3 ports with 3 removable, self powered
pci3: <display, VGA> at device 6.0 (no driver attached)
fxp0: <Intel 82551 Pro/100 Ethernet> port 0xbc00-0xbc3f mem 0xfeafb000-0xfeafbfff,0xfeaa0000-0xfeabffff irq 18 at device 8.0 on pci3
miibus0: <MII bus> on fxp0
inphy0: <i82555 10/100 media interface> on miibus0
inphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
fxp0: Ethernet address: 00:e0:81:31:89:1c
isab0: <PCI-ISA bridge> at device 7.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <AMD 8111 UDMA133 controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xffa0-0xffaf at device 7.1 on pci0
ata0: <ATA channel 0> on atapci0
ata1: <ATA channel 1> on atapci0
pci0: <serial bus, SMBus> at device 7.2 (no driver attached)
pci0: <bridge> at device 7.3 (no driver attached)
pcib2: <ACPI PCI-PCI bridge> at device 10.0 on pci0
pci2: <ACPI PCI bus> on pcib2
em0: <Intel(R) PRO/1000 Network Connection, Version - 2.1.7> port 0x8880-0x88bf mem 0xfc8c0000-0xfc8dffff,0xfc800000-0xfc83ffff irq 26 at device 2.0 on pci2
em0: Ethernet address: 00:04:23:ba:d0:42
em0:  Speed:N/A  Duplex:N/A
em1: <Intel(R) PRO/1000 Network Connection, Version - 2.1.7> port 0x8c00-0x8c3f mem 0xfc8e0000-0xfc8fffff,0xfc880000-0xfc8bffff irq 27 at device 2.1 on pci2
em1: Ethernet address: 00:04:23:ba:d0:43
em1:  Speed:N/A  Duplex:N/A
em2: <Intel(R) PRO/1000 Network Connection, Version - 2.1.7> port 0x8480-0x84bf mem 0xfc780000-0xfc79ffff,0xfc740000-0xfc77ffff irq 27 at device 3.0 on pci2
em2: Ethernet address: 00:04:23:ae:60:0a
em2:  Speed:1000 Mbps  Duplex:Full
em3: <Intel(R) PRO/1000 Network Connection, Version - 2.1.7> port 0x8800-0x883f mem 0xfc7a0000-0xfc7bffff irq 24 at device 3.1 on pci2
em3: Ethernet address: 00:04:23:ae:60:0b
em3:  Speed:1000 Mbps  Duplex:Full
bge0: <Broadcom BCM5704C Dual Gigabit Ethernet, ASIC rev. 0x2003> mem 0xfc6c0000-0xfc6cffff,0xfc6b0000-0xfc6bffff irq 24 at device 9.0 on pci2
miibus1: <MII bus> on bge0
brgphy0: <BCM5704 10/100/1000baseTX PHY> on miibus1
brgphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto
bge0: Ethernet address: 00:e0:81:31:8f:80
bge1: <Broadcom BCM5704C Dual Gigabit Ethernet, ASIC rev. 0x2003> mem 0xfc6f0000-0xfc6fffff,0xfc6e0000-0xfc6effff irq 25 at device 9.1 on pci2
miibus2: <MII bus> on bge1
brgphy1: <BCM5704 10/100/1000baseTX PHY> on miibus2
brgphy1:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto
bge1: Ethernet address: 00:e0:81:31:8f:81
pci0: <base peripheral, interrupt controller> at device 10.1 (no driver attached)
pcib3: <ACPI PCI-PCI bridge> at device 11.0 on pci0
pci1: <ACPI PCI bus> on pcib3
aac0: <Adaptec SCSI RAID 2230S> mem 0xfb800000-0xfbffffff,0xf0000000-0xf7ffffff irq 28 at device 3.0 on pci1
aac0: [FAST]
aac0: Enabling 64-bit address support
aacp0: <SCSI Passthrough Bus> on aac0
aacp1: <SCSI Passthrough Bus> on aac0
pci0: <base peripheral, interrupt controller> at device 11.1 (no driver attached)
acpi_button0: <Power Button> on acpi0
atkbdc0: <Keyboard controller (i8042)> port 0x60,0x64 irq 1 on acpi0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
sio0: type 16550A
sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0
sio1: type 16550A
fdc0: <floppy drive controller (FDE)> port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0
fdc0: [FAST]
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
ppc0: <Standard parallel printer port> port 0x378-0x37f irq 7 on acpi0
ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode
ppbus0: <Parallel port bus> on ppc0
lpt0: <Printer> on ppbus0
lpt0: Interrupt-driven port
ppi0: <Parallel I/O> on ppbus0
orm0: <ISA Option ROMs> at iomem 0xc0000-0xc7fff,0xc8000-0xcc7ff,0xcc800-0xcd7ff,0xcd800-0xcefff,0xcf000-0xd07ff,0xd0800-0xd17ff on isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
Timecounters tick every 1.000 msec
ipfw2 (+ipv6) initialized, divert loadable, rule-based forwarding disabled, default to deny, logging unlimited
acd0: CDROM <SONY CD-ROM CDU5215/7YS1> at ata1-master UDMA33
aacd0: <RAID 5> on aac0
aacd0: 209922MB (429920256 sectors)
(probe9:aacp0:0:10:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe9:aacp0:0:10:0): ILLEGAL REQUEST asc:55,3
(probe9:aacp0:0:10:0): Reserved ASC/ASCQ pair
(probe0:aacp0:0:0:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe0:aacp0:0:0:0): ILLEGAL REQUEST asc:55,3
(probe0:aacp0:0:0:0): Reserved ASC/ASCQ pair
(probe1:aacp0:0:1:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe1:aacp0:0:1:0): ILLEGAL REQUEST asc:55,3
(probe1:aacp0:0:1:0): Reserved ASC/ASCQ pair
(probe2:aacp0:0:2:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe2:aacp0:0:2:0): ILLEGAL REQUEST asc:55,3
(probe2:aacp0:0:2:0): Reserved ASC/ASCQ pair
(probe3:aacp0:0:3:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe3:aacp0:0:3:0): ILLEGAL REQUEST asc:55,3
(probe3:aacp0:0:3:0): Reserved ASC/ASCQ pair
(probe4:aacp0:0:4:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe4:aacp0:0:4:0): ILLEGAL REQUEST asc:55,3
(probe4:aacp0:0:4:0): Reserved ASC/ASCQ pair
(probe5:aacp0:0:5:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe5:aacp0:0:5:0): ILLEGAL REQUEST asc:55,3
(probe5:aacp0:0:5:0): Reserved ASC/ASCQ pair
(probe6:aacp0:0:6:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe6:aacp0:0:6:0): ILLEGAL REQUEST asc:55,3
(probe6:aacp0:0:6:0): Reserved ASC/ASCQ pair
(probe7:aacp0:0:8:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe7:aacp0:0:8:0): ILLEGAL REQUEST asc:55,3
(probe7:aacp0:0:8:0): Reserved ASC/ASCQ pair
(probe8:aacp0:0:9:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe8:aacp0:0:9:0): ILLEGAL REQUEST asc:55,3
(probe8:aacp0:0:9:0): Reserved ASC/ASCQ pair
(probe10:aacp0:0:11:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe10:aacp0:0:11:0): ILLEGAL REQUEST asc:55,3
(probe10:aacp0:0:11:0): Reserved ASC/ASCQ pair
(probe11:aacp0:0:12:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe11:aacp0:0:12:0): ILLEGAL REQUEST asc:55,3
(probe11:aacp0:0:12:0): Reserved ASC/ASCQ pair
(probe12:aacp0:0:13:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe12:aacp0:0:13:0): ILLEGAL REQUEST asc:55,3
(probe12:aacp0:0:13:0): Reserved ASC/ASCQ pair
(probe13:aacp0:0:14:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe13:aacp0:0:14:0): ILLEGAL REQUEST asc:55,3
(probe13:aacp0:0:14:0): Reserved ASC/ASCQ pair
(probe14:aacp0:0:15:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe14:aacp0:0:15:0): ILLEGAL REQUEST asc:55,3
(probe14:aacp0:0:15:0): Reserved ASC/ASCQ pair
(probe15:aacp1:0:0:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe15:aacp1:0:0:0): ILLEGAL REQUEST asc:55,3
(probe15:aacp1:0:0:0): Reserved ASC/ASCQ pair
(probe16:aacp1:0:1:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe16:aacp1:0:1:0): ILLEGAL REQUEST asc:55,3
(probe16:aacp1:0:1:0): Reserved ASC/ASCQ pair
(probe17:aacp1:0:2:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe17:aacp1:0:2:0): ILLEGAL REQUEST asc:55,3
(probe17:aacp1:0:2:0): Reserved ASC/ASCQ pair
(probe18:aacp1:0:3:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe18:aacp1:0:3:0): ILLEGAL REQUEST asc:55,3
(probe18:aacp1:0:3:0): Reserved ASC/ASCQ pair
(probe19:aacp1:0:4:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe19:aacp1:0:4:0): ILLEGAL REQUEST asc:55,3
(probe19:aacp1:0:4:0): Reserved ASC/ASCQ pair
(probe20:aacp1:0:5:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe20:aacp1:0:5:0): ILLEGAL REQUEST asc:55,3
(probe20:aacp1:0:5:0): Reserved ASC/ASCQ pair
(probe21:aacp1:0:6:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe21:aacp1:0:6:0): ILLEGAL REQUEST asc:55,3
(probe21:aacp1:0:6:0): Reserved ASC/ASCQ pair
(probe22:aacp1:0:8:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe22:aacp1:0:8:0): ILLEGAL REQUEST asc:55,3
(probe22:aacp1:0:8:0): Reserved ASC/ASCQ pair
(probe23:aacp1:0:9:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe23:aacp1:0:9:0): ILLEGAL REQUEST asc:55,3
(probe23:aacp1:0:9:0): Reserved ASC/ASCQ pair
(probe24:aacp1:0:10:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe24:aacp1:0:10:0): ILLEGAL REQUEST asc:55,3
(probe24:aacp1:0:10:0): Reserved ASC/ASCQ pair
(probe25:aacp1:0:11:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe25:aacp1:0:11:0): ILLEGAL REQUEST asc:55,3
(probe25:aacp1:0:11:0): Reserved ASC/ASCQ pair
(probe26:aacp1:0:12:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe26:aacp1:0:12:0): ILLEGAL REQUEST asc:55,3
(probe26:aacp1:0:12:0): Reserved ASC/ASCQ pair
(probe27:aacp1:0:13:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe27:aacp1:0:13:0): ILLEGAL REQUEST asc:55,3
(probe27:aacp1:0:13:0): Reserved ASC/ASCQ pair
(probe28:aacp1:0:14:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe28:aacp1:0:14:0): ILLEGAL REQUEST asc:55,3
(probe28:aacp1:0:14:0): Reserved ASC/ASCQ pair
(probe29:aacp1:0:15:0): INQUIRY. CDB: 12 0 0 0 24 0 
(probe29:aacp1:0:15:0): ILLEGAL REQUEST asc:55,3
(probe29:aacp1:0:15:0): Reserved ASC/ASCQ pair
SMP: AP CPU #1 Launched!
SMP: AP CPU #2 Launched!
SMP: AP CPU #3 Launched!
Trying to mount root from ufs:/dev/aacd0s1a


More information about the freebsd-amd64 mailing list