CALL FOR TESTERS! [Re: 6.2 SHOWSTOPPER - em completely unusable
on 6.2]
Philippe Pegon
Philippe.Pegon at crc.u-strasbg.fr
Thu Sep 28 13:29:15 PDT 2006
Mike Jakubik wrote:
> Scott Long wrote:
>> All,
>>
>> Attached is my first cut at addressing the problems described in this
>> thread. As I discussed earlier, the VM syncer thread is likely starving
>> the USB interrupt thread. This causes the shared usb+network
>> interrupt to remain masked, preventing network interrupts from being
>> delivered,
>> and thus triggering watchdog timeouts.
>
> Just to be clear, has it been established that the problem only occurs
> when em is sharing an interrupt? I have a lot of production machines
> using the PDSMi board, which is one of the boards that the problem was
> noticed on, however i do not share any irqs, i always disable USB in the
> BIOS.
On many of our servers, we have bge cards and I can see a lot of
watchdog timeouts. We always disable USB in the bios and they didn't
share irq.
>
> # vmstat -i
> interrupt total rate
> irq16: em0 13001181 7
> irq19: atapci0 76559511 42
> cpu0: timer 3643365617 1999
> cpu1: timer 3643365610 1999
> Total 7376291919 4048
example with our ftp server (ftp8.fr.freebsd.org), a HP DL360 G4 SMP :
# vmstat -i
interrupt total rate
irq1: atkbd0 1576 0
irq4: sio0 3 0
irq6: fdc0 12 0
irq14: ata0 57 0
irq24: ciss1 17181184 8
irq25: bge0 841821262 402
irq26: bge1 674342644 322
irq72: ciss0 24194679 11
cpu0: timer 4180478365 1999
cpu1: timer 4180886439 1999
Total 9918906221 4743
# bzgrep watchdog /var/log/messages*
/var/log/messages:Sep 23 02:47:06 anubis kernel: bge1: watchdog timeout
-- resetting
/var/log/messages.0.bz2:Sep 12 22:22:48 anubis kernel: bge1: watchdog
timeout -- resetting
/var/log/messages.0.bz2:Sep 17 15:22:01 anubis kernel: bge1: watchdog
timeout -- resetting
/var/log/messages.0.bz2:Sep 20 12:13:07 anubis kernel: bge1: watchdog
timeout -- resetting
/var/log/messages.1.bz2:Sep 6 08:33:54 anubis kernel: bge1: watchdog
timeout -- resetting
/var/log/messages.3.bz2:Aug 29 12:09:36 anubis kernel: bge0: watchdog
timeout -- resetting
/var/log/messages.4.bz2:Aug 22 15:44:00 anubis kernel: bge0: watchdog
timeout -- resetting
# pciconf -lv
hostb0 at pci0:0:0: class=0x060000 card=0x32000e11 chip=0x35908086
rev=0x0a hdr=0x00
vendor = 'Intel Corporation'
device = 'E752x Server Memory Controller Hub'
class = bridge
subclass = HOST-PCI
pcib1 at pci0:2:0: class=0x060400 card=0x00000050 chip=0x35958086 rev=0x0a
hdr=0x01
vendor = 'Intel Corporation'
device = 'E752x Memory Controller Hub PCI Express Port A0'
class = bridge
subclass = PCI-PCI
pcib2 at pci0:4:0: class=0x060400 card=0x00000050 chip=0x35978086 rev=0x0a
hdr=0x01
vendor = 'Intel Corporation'
device = 'E752x Memory Controller Hub PCI Express Port B0'
class = bridge
subclass = PCI-PCI
pcib5 at pci0:6:0: class=0x060400 card=0x00000050 chip=0x35998086 rev=0x0a
hdr=0x01
vendor = 'Intel Corporation'
device = 'E752x Memory Controller Hub PCI Express Port C0'
class = bridge
subclass = PCI-PCI
pcib6 at pci0:28:0: class=0x060400 card=0x00000050 chip=0x25ae8086
rev=0x02 hdr=0x01
vendor = 'Intel Corporation'
device = '6300ESB Hub Interface to PCI-X Bridge'
class = bridge
subclass = PCI-PCI
pcib7 at pci0:30:0: class=0x060400 card=0x00000000 chip=0x244e8086
rev=0x0a hdr=0x01
vendor = 'Intel Corporation'
device = '82801BA/CA/DB/DBL/EB/ER/FB (ICH2/3/4/4/5/5/6), 6300ESB
Hub Interface to PCI Bridge'
class = bridge
subclass = PCI-PCI
isab0 at pci0:31:0: class=0x060100 card=0x00000000 chip=0x25a18086
rev=0x02 hdr=0x00
vendor = 'Intel Corporation'
device = '6300ESB LPC Interface Bridge'
class = bridge
subclass = PCI-ISA
atapci0 at pci0:31:1: class=0x01018a card=0x32010e11 chip=0x25a28086
rev=0x02 hdr=0x00
vendor = 'Intel Corporation'
device = '6300ESB IDE Controller'
class = mass storage
subclass = ATA
pcib3 at pci6:0:0: class=0x060400 card=0x00000044 chip=0x03298086 rev=0x09
hdr=0x01
vendor = 'Intel Corporation'
device = '6700PXH PCI Express-to-PCI Express Bridge A'
class = bridge
subclass = PCI-PCI
pcib4 at pci6:0:2: class=0x060400 card=0x00000044 chip=0x032a8086 rev=0x09
hdr=0x01
vendor = 'Intel Corporation'
device = '6700PXH PCI Express-to-PCI Express Bridge B'
class = bridge
subclass = PCI-PCI
ciss0 at pci10:1:0: class=0x010400 card=0x409b0e11 chip=0x00460e11
rev=0x01 hdr=0x00
vendor = 'Compaq Computer Corp (Now owned by Hewlett-Packard)'
device = 'Smart Array 64xx Controller'
class = mass storage
subclass = RAID
ciss1 at pci2:1:0: class=0x010400 card=0x40910e11 chip=0x00460e11 rev=0x01
hdr=0x00
vendor = 'Compaq Computer Corp (Now owned by Hewlett-Packard)'
device = 'Smart Array 64xx Controller'
class = mass storage
subclass = RAID
bge0 at pci2:2:0: class=0x020000 card=0x00d00e11 chip=0x164814e4 rev=0x10
hdr=0x00
vendor = 'Broadcom Corporation'
device = 'BCM5704 NetXtreme Dual Gigabit Adapter'
class = network
subclass = ethernet
bge1 at pci2:2:1: class=0x020000 card=0x00d00e11 chip=0x164814e4 rev=0x10
hdr=0x00
vendor = 'Broadcom Corporation'
device = 'BCM5704 NetXtreme Dual Gigabit Adapter'
class = network
subclass = ethernet
none0 at pci1:3:0: class=0x030000 card=0x001e0e11 chip=0x47521002 rev=0x27
hdr=0x00
vendor = 'ATI Technologies Inc'
device = 'Rage XL PCI'
class = display
subclass = VGA
none1 at pci1:4:0: class=0x088000 card=0xb2060e11 chip=0xb2030e11 rev=0x01
hdr=0x00
vendor = 'Compaq Computer Corp (Now owned by Hewlett-Packard)'
device = 'iLo Integrated Lights Out Processor'
class = base peripheral
none2 at pci1:4:2: class=0x088000 card=0xb2060e11 chip=0xb2040e11 rev=0x01
hdr=0x00
vendor = 'Compaq Computer Corp (Now owned by Hewlett-Packard)'
device = 'iLo Integrated Lights Out Processor'
class = base peripheral
--
Philippe Pegon
More information about the freebsd-stable
mailing list