kern/37043: Latest stable causes SCSI bus freeze on sym0 when running SMP

Alan Lew alan at canweb.ca
Mon Dec 29 08:59:11 PST 2003


Gentlemen,

Since upgrading our kernels to 4.x, we’ve noticed this problem. After
installing Gérard’s Sym workaround
(http://docs.freebsd.org/cgi/mid.cgi?200208222210.g7MMABwT084798), the
issue seems to have vanished on 3 of our 5 affected servers. The 5
servers are all Asus AP1400 boxes (CUR-DLSR mobo, ServerWorks 3 LE
chipset) running various stable 4.x kernels, all containing the LSI
Logic 53C1010-33 chipset. Of the two remaining affected boxes, one runs
merrily for a few weeks with minimal load (serving two static web pages)
and then dies, spitting out “sym0:0:control msgout 80 22 25d” like
messages. The other box runs with no load (completely idle) and hard
locks after 2-3 days with no error or panic messages written anywhere.
The latter affected box was recently sent to Asus in Taiwan for repairs
to the SCSI backplane board, apparently a known ASUS AP1400 issue (don’t
know if this info helps, but...)

Below is the dmesg output of the two affected boxes (as described above,
in order) as well as the “pciconf -l –v” output. I hope this helps.
We’re also offering a chance if anyone wishes to mess around with one of
these affected boxes; we will make it available remotely for testing.

Any thoughts?

Regards,
...alan lew
alan at canweb.ca

-- SNIP –-

[ Box #1 DMESG ]
Copyright (c) 1992-2002 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
	The Regents of the University of California. All rights
reserved.
FreeBSD 4.7-RELEASE-p3 #3: Tue Jan  7 11:33:58 EST 2003
    alan at rigel.canweb.ca:/usr/obj/usr/src/sys/RIGEL
Timecounter "i8254"  frequency 1193182 Hz
CPU: Pentium III/Pentium III Xeon/Celeron (1000.04-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0x686  Stepping = 6

Features=0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,
MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE>
real memory  = 1073721344 (1048556K bytes)
avail memory = 1041727488 (1017312K bytes)
Changing APIC ID for IO APIC #1 from 3 to 1 in MP table
Changing APIC ID for IO APIC #1 from 3 to 1 on chip
Programming 16 pins in IOAPIC #0
IOAPIC #0 intpin 2 -> irq 0
Programming 16 pins in IOAPIC #1
FreeBSD/SMP: Multiprocessor motherboard
 cpu0 (BSP): apic id:  3, version: 0x00040011, at 0xfee00000
 cpu1 (AP):  apic id:  0, version: 0x00040011, at 0xfee00000
 io0 (APIC): apic id:  2, version: 0x000f0011, at 0xfec00000
 io1 (APIC): apic id:  1, version: 0x000f0011, at 0xfec01000
Preloaded elf kernel "kernel" at 0xc0356000.
Pentium Pro MTRR support enabled
md0: Malloc disk
Using $PIR table, 8 entries at 0xc00f1010
npx0: <math processor> on motherboard
npx0: INT 16 interface
pcib0: <ServerWorks NB6635 3.0LE host to PCI bridge> on motherboard
IOAPIC #1 intpin 6 -> irq 2
IOAPIC #1 intpin 7 -> irq 10
IOAPIC #1 intpin 22 -> irq 11
pci0: <PCI bus> on pcib0
fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0xd800-0xd83f mem
0xfd800000-0xfd8fffff,0xfe000000-0xfe000fff irq 2 at device 2.0 on pci0
fxp0: Ethernet address 00:e0:18:0a:b1:14
inphy0: <i82555 10/100 media interface> on miibus0
inphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
pci0: <ATI Mach64-GR graphics accelerator> at 7.0
fxp1: <Intel Pro 10/100B/100+ Ethernet> port 0xd000-0xd03f mem
0xfa800000-0xfa8fffff,0xfb000000-0xfb000fff irq 10 at device 8.0 on pci0
fxp1: Ethernet address 00:e0:18:0a:b1:15
inphy1: <i82555 10/100 media interface> on miibus1
inphy1:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
isab0: <ServerWorks IB6566 PCI to ISA bridge> at device 15.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <ServerWorks ROSB4 ATA33 controller> port 0xb800-0xb80f at
device 15.1 on pci0
ata0: at 0x1f0 irq 14 on atapci0
ata1: at 0x170 irq 15 on atapci0
ohci0: <OHCI (generic) USB controller> mem 0xfa000000-0xfa000fff irq 11
at device 15.2 on pci0
usb0: OHCI version 1.0, legacy support
usb0: <OHCI (generic) USB controller> on ohci0
usb0: USB revision 1.0
uhub0: (0x1166) OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 4 ports with 4 removable, self powered
ufm0: GemTek Corp USB FM Radio, rev 1.00/4.10, addr 2
pcib1: <ServerWorks NB6635 3.0LE host to PCI bridge> on motherboard
IOAPIC #1 intpin 8 -> irq 12
IOAPIC #1 intpin 9 -> irq 15
pci1: <PCI bus> on pcib1
sym0: <1010-33> port 0xb400-0xb4ff mem
0xf9000000-0xf9001fff,0xf9800000-0xf98003ff irq 12 at device 5.0 on pci1
sym0: Symbios NVRAM, ID 7, Fast-80, LVD, parity checking
sym0: open drain IRQ line driver, using on-chip SRAM
sym0: using LOAD/STORE-based firmware.
sym0: handling phase mismatch from SCRIPTS.
sym1: <1010-33> port 0xb000-0xb0ff mem
0xf8000000-0xf8001fff,0xf8800000-0xf88003ff irq 15 at device 5.1 on pci1
sym1: Symbios NVRAM, ID 7, Fast-80, LVD, parity checking
sym1: open drain IRQ line driver, using on-chip SRAM
sym1: using LOAD/STORE-based firmware.
sym1: handling phase mismatch from SCRIPTS.
orm0: <Option ROMs> at iomem 0xc0000-0xca7ff,0xcc000-0xcffff on isa0
fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on
isa0
fdc0: FIFO enabled, 8 bytes threshold
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
kbd0 at atkbd0
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on
isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
sio0: configured irq 4 not in bitmap of probed irqs 0
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 8250
sio1: configured irq 3 not in bitmap of probed irqs 0
ppc0: parallel port not found.
APIC_IO: Testing 8254 interrupt delivery
APIC_IO: Broken MP table detected: 8254 is not connected to IOAPIC #0
intpin 2
APIC_IO: routing 8254 via 8259 and IOAPIC #0 intpin 0
IP packet filtering initialized, divert disabled, rule-based forwarding
enabled, default to deny, logging disabled
ata0-slave: ATAPI identify retries exceeded
SMP: AP CPU #1 Launched!
acd0: CDROM <CD-224E> at ata0-master PIO4
Waiting 15 seconds for SCSI devices to settle
(noperiph:sym0:0:-1:-1): SCSI BUS reset delivered.
(noperiph:sym1:0:-1:-1): SCSI BUS reset delivered.
(probe10:sym0:0:11:0): phase change 6-7 6 at 0c918d8c resid=4.
pass1 at sym0 bus 0 target 11 lun 0
pass1: <SDR GEM318 0> Fixed Processor SCSI-2 device
pass1: 3.300MB/s transfers
da0 at sym0 bus 0 target 0 lun 0
da0: <FUJITSU MAN3184MC 0107> Fixed Direct Access SCSI-3 device
da0: 160.000MB/s transfers (80.000MHz, offset 62, 16bit), Tagged
Queueing Enabled
da0: 17522MB (35885448 512 byte sectors: 255H 63S/T 2233C)
Mounting root from ufs:/dev/da0s1a

[ Box #2 DMESG ]
Copyright (c) 1992-2002 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
	The Regents of the University of California. All rights
reserved.
FreeBSD 4.7-RELEASE-p17 #2: Mon Sep 22 14:16:48 EDT 2003
    root at newaltair.canweb.ca:/usr/obj/usr/src/sys/ALTAIR
Timecounter "i8254"  frequency 1193182 Hz
CPU: Pentium III/Pentium III Xeon/Celeron (1000.04-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0x686  Stepping = 6

Features=0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,
MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE>
real memory  = 1073721344 (1048556K bytes)
avail memory = 1041502208 (1017092K bytes)
Changing APIC ID for IO APIC #1 from 3 to 1 in MP table
Changing APIC ID for IO APIC #1 from 3 to 1 on chip
Programming 16 pins in IOAPIC #0
IOAPIC #0 intpin 2 -> irq 0
Programming 16 pins in IOAPIC #1
FreeBSD/SMP: Multiprocessor motherboard
 cpu0 (BSP): apic id:  3, version: 0x00040011, at 0xfee00000
 cpu1 (AP):  apic id:  0, version: 0x00040011, at 0xfee00000
 io0 (APIC): apic id:  2, version: 0x000f0011, at 0xfec00000
 io1 (APIC): apic id:  1, version: 0x000f0011, at 0xfec01000
Preloaded elf kernel "kernel" at 0xc038d000.
Pentium Pro MTRR support enabled
md0: Malloc disk
Using $PIR table, 8 entries at 0xc00f1010
npx0: <math processor> on motherboard
npx0: INT 16 interface
pcib0: <ServerWorks NB6635 3.0LE host to PCI bridge> on motherboard
IOAPIC #1 intpin 6 -> irq 2
IOAPIC #1 intpin 7 -> irq 3
pci0: <PCI bus> on pcib0
fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0xd800-0xd83f mem
0xfd800000-0xfd8fffff,0xfe000000-0xfe000fff irq 2 at device 2.0 on pci0
fxp0: Ethernet address 00:e0:18:0a:89:88
inphy0: <i82555 10/100 media interface> on miibus0
inphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
pci0: <ATI Mach64-GR graphics accelerator> at 7.0
fxp1: <Intel Pro 10/100B/100+ Ethernet> port 0xd000-0xd03f mem
0xfa800000-0xfa8fffff,0xfb000000-0xfb000fff irq 3 at device 8.0 on pci0
fxp1: Ethernet address 00:e0:18:0a:89:89
inphy1: <i82555 10/100 media interface> on miibus1
inphy1:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
isab0: <ServerWorks IB6566 PCI to ISA bridge> at device 15.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <ServerWorks ROSB4 ATA33 controller> port 0xb800-0xb80f at
device 15.1 on pci0
ata0: at 0x1f0 irq 14 on atapci0
ata1: at 0x170 irq 15 on atapci0
pcib1: <ServerWorks NB6635 3.0LE host to PCI bridge> on motherboard
IOAPIC #1 intpin 0 -> irq 7
IOAPIC #1 intpin 8 -> irq 10
IOAPIC #1 intpin 9 -> irq 11
pci1: <PCI bus> on pcib1
pci1: <unknown card> (vendor=0x9004, dev=0x7178) at 2.0 irq 7
sym0: <1010-33> port 0xb000-0xb0ff mem
0xf8800000-0xf8801fff,0xf9000000-0xf90003ff irq 10 at device 5.0 on pci1
sym0: Symbios NVRAM, ID 7, Fast-80, LVD, parity checking
sym0: open drain IRQ line driver, using on-chip SRAM
sym0: using LOAD/STORE-based firmware.
sym0: handling phase mismatch from SCRIPTS.
sym1: <1010-33> port 0xa800-0xa8ff mem
0xf7800000-0xf7801fff,0xf8000000-0xf80003ff irq 11 at device 5.1 on pci1
sym1: Symbios NVRAM, ID 7, Fast-80, LVD, parity checking
sym1: open drain IRQ line driver, using on-chip SRAM
sym1: using LOAD/STORE-based firmware.
sym1: handling phase mismatch from SCRIPTS.
orm0: <Option ROMs> at iomem
0xc0000-0xca7ff,0xcc000-0xcffff,0xd0000-0xd27ff on isa0
fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on
isa0
fdc0: FIFO enabled, 8 bytes threshold
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
kbd0 at atkbd0
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on
isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A
sio1: configured irq 3 not in bitmap of probed irqs 0
ppc0: parallel port not found.
APIC_IO: Testing 8254 interrupt delivery
APIC_IO: Broken MP table detected: 8254 is not connected to IOAPIC #0
intpin 2
APIC_IO: routing 8254 via 8259 and IOAPIC #0 intpin 0
IP packet filtering initialized, divert disabled, rule-based forwarding
enabled, default to deny, logging limited to 100 packets/entry by
default
ata0-slave: ATAPI identify retries exceeded
acd0: MODE_SENSE_BIG trying to write on read buffer
SMP: AP CPU #1 Launched!
acd0: MODE_SENSE_BIG - NO SENSE asc=0x00 ascq=0x00 error=0x04
acd0: CDROM <CD-224E> at ata0-master PIO4
Waiting 15 seconds for SCSI devices to settle
(noperiph:sym0:0:-1:-1): SCSI BUS reset delivered.
(noperiph:sym1:0:-1:-1): SCSI BUS reset delivered.
(probe10:sym0:0:11:0): phase change 6-7 6 at 0c936d8c resid=4.
pass1 at sym0 bus 0 target 11 lun 0
pass1: <SDR GEM318 0> Fixed Processor SCSI-2 device
pass1: 3.300MB/s transfers
da0 at sym0 bus 0 target 0 lun 0
da0: <FUJITSU MAN3184MC 0107> Fixed Direct Access SCSI-3 device
da0: 160.000MB/s transfers (80.000MHz, offset 62, 16bit), Tagged
Queueing Enabled
da0: 17522MB (35885448 512 byte sectors: 255H 63S/T 2233C)
Mounting root from ufs:/dev/da0s1a

[ PCICONF –l –V output ]
chip0 at pci0:0:0:	class=0x060000 card=0x00000000 chip=0x00091166 rev=0x05
hdr=0x00
    vendor   = 'Reliance Computer Corp./ServerWorks'
    device   = 'NB6536 (CNB20HE) AGP interface'
    class    = bridge
    subclass = HOST-PCI
chip1 at pci0:0:1:	class=0x060000 card=0x00000000 chip=0x00091166 rev=0x05
hdr=0x00
    vendor   = 'Reliance Computer Corp./ServerWorks'
    device   = 'NB6536 (CNB20HE) AGP interface'
    class    = bridge
    subclass = HOST-PCI
fxp0 at pci0:2:0:	class=0x020000 card=0x100c8086 chip=0x12298086 rev=0x08
hdr=0x00
    vendor   = 'Intel Corporation'
    device   = '82557/8/9 EtherExpress PRO/100(B) Ethernet Adapter'
    class    = network
    subclass = ethernet
none0 at pci0:7:0:	class=0x030000 card=0x802b1043 chip=0x47521002 rev=0x27
hdr=0x00
    vendor   = 'ATI Technologies'
    device   = 'Rage XL PCI'
    class    = display
    subclass = VGA
fxp1 at pci0:8:0:	class=0x020000 card=0x100c8086 chip=0x12298086 rev=0x08
hdr=0x00
    vendor   = 'Intel Corporation'
    device   = '82557/8/9 EtherExpress PRO/100(B) Ethernet Adapter'
    class    = network
    subclass = ethernet
isab0 at pci0:15:0:	class=0x060100 card=0x02001166 chip=0x02001166
rev=0x4f hdr=0x00
    vendor   = 'Reliance Computer Corp./ServerWorks'
    device   = 'OSB4 PCI to ISA Bridge'
    class    = bridge
    subclass = PCI-ISA
atapci0 at pci0:15:1:	class=0x01018a card=0x00000000 chip=0x02111166
rev=0x00 hdr=0x00
    vendor   = 'Reliance Computer Corp./ServerWorks'
    device   = 'OSB4 PCI EIDE Controller'
    class    = mass storage
    subclass = ATA
ohci0 at pci0:15:2:	class=0x0c0310 card=0x02201166 chip=0x02201166
rev=0x04 hdr=0x00
    vendor   = 'Reliance Computer Corp./ServerWorks'
    device   = 'OSB4 OpenHCI Compliant USB Controller'
    class    = serial bus
    subclass = USB
sym0 at pci1:5:0:	class=0x010000 card=0x00000000 chip=0x00201000 rev=0x01
hdr=0x00
    vendor   = 'LSI Logic (Was: Symbios Logic, NCR)'
    device   = 'LSI53C1010-33 PCI to Dual Channel Ultra160 SCSI
Multifunction Controller'
    class    = mass storage
    subclass = SCSI
sym1 at pci1:5:1:	class=0x010000 card=0x00000000 chip=0x00201000 rev=0x01
hdr=0x00
    vendor   = 'LSI Logic (Was: Symbios Logic, NCR)'
    device   = 'LSI53C1010-33 PCI to Dual Channel Ultra160 SCSI
Multifunction Controller'
    class    = mass storage
    subclass = SCSI

-- SNIP –-





More information about the freebsd-bugs mailing list