kern/168217: Watchdog timeouts with bce(4) on BCM5716

Xin LI delphij at FreeBSD.org
Tue May 22 05:30:11 UTC 2012


>Number:         168217
>Category:       kern
>Synopsis:       Watchdog timeouts with bce(4) on BCM5716
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    freebsd-bugs
>State:          open
>Quarter:        
>Keywords:       
>Date-Required:
>Class:          sw-bug
>Submitter-Id:   current-users
>Arrival-Date:   Tue May 22 05:30:10 UTC 2012
>Closed-Date:
>Last-Modified:
>Originator:     Xin LI
>Release:        FreeBSD 9.0-STABLE i386
>Organization:
The FreeBSD Project
>Environment:
System: FreeBSD sanji.chzip.com 9.0-RELEASE FreeBSD 9.0-RELEASE #1: Tue May 22 01:53:17 CST 2012     delphij at sanji.chzip.com:/usr/obj/usr/src/sys/GENERIC  amd64
>Description:
	The system sometimes stop to respond network traffic with the following
in system log:


May 22 07:40:01 sanji kernel: Limiting closed port RST response from 235 to 200 packets/sec
May 22 09:41:41 sanji kernel: bce1: /usr/src/sys/dev/bce/if_bce.c(7628): Watchdog timeout occurred, resetting!
May 22 09:41:41 sanji kernel: bce1: link state changed to DOWN
May 22 09:41:43 sanji kernel: bce1: discard frame w/o leading ethernet header (len 0 pkt len 0)

	This also sometimes lead to panic:

May 22 12:42:21 sanji kernel: bce0: discard frame w/o leading ethernet header (len 0 pkt len 0)
May 22 12:42:21 sanji kernel: bce0: discard frame w/o leading ethernet header (len 0 pkt len 0)
May 22 12:42:21 sanji kernel:
May 22 12:42:21 sanji kernel:
May 22 12:42:21 sanji kernel: Fatal trap 12: page fault while in kernel mode
May 22 12:42:21 sanji kernel: cpuid = 13; apic id = 13
May 22 12:42:21 sanji kernel: fault virtual address     = 0x18
May 22 12:42:21 sanji kernel: fault code                = supervisor read data, page not present
May 22 12:42:21 sanji kernel: instruction pointer       = 0x20:0xffffffff80403a46
			(this is RELENG_9_0 sys/dev/bce/bce.c:6449)
May 22 12:42:21 sanji kernel: stack pointer             = 0x28:0xffffff84601e5a80
May 22 12:42:21 sanji kernel: frame pointer             = 0x28:0xffffff84601e5b40
May 22 12:42:21 sanji kernel: code segment              = base 0x0, limit 0xfffff, type 0x1b
May 22 12:42:21 sanji kernel: = DPL 0, pres 1, long 1, def32 0, gran 1
May 22 12:42:21 sanji kernel: processor eflags  = interrupt enabled, resume, IOPL = 0
May 22 12:42:21 sanji kernel: current process           = 12 (irq256: bce0)

	Note that the system does not have jumbo enabled, however, split_hdr is enabled.
Looking at the code, it was a hardwired value.

	The system used to run stock FreeBSD 8.2-RELEASE and now run a patched 9.0-RELEASE.

	The host system is a Dell PowerEdge R410.


bce0: <Broadcom NetXtreme II BCM5716 1000Base-T (C0)> mem 0xd6000000-0xd7ffffff irq 36 at device 0.0 on pci1
miibus0: <MII bus> on bce0
brgphy0: <BCM5709 10/100/1000baseT PHY> PHY 1 on miibus0
brgphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-master, 1000baseT-FDX, 1000baseT-FDX-master, auto, auto-flow
bce0: Ethernet address: 78:2b:cb:74:82:a6
bce0: ASIC (0x57092008); Rev (C0); Bus (PCIe x4, 2.5Gbps); B/C (5.2.3); Bufs (RX:2;TX:2;PG:0); Flags (MSI|MFW); MFW (NCSI 2.0.11)
bce1: <Broadcom NetXtreme II BCM5716 1000Base-T (C0)> mem 0xd8000000-0xd9ffffff irq 48 at device 0.1 on pci1
miibus1: <MII bus> on bce1
brgphy1: <BCM5709 10/100/1000baseT PHY> PHY 1 on miibus1
brgphy1:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-master, 1000baseT-FDX, 1000baseT-FDX-master, auto, auto-flow
bce1: Ethernet address: 78:2b:cb:74:82:a7
bce1: ASIC (0x57092008); Rev (C0); Bus (PCIe x4, 2.5Gbps); B/C (5.2.3); Bufs (RX:2;TX:2;PG:0); Flags (MSI|MFW); MFW (NCSI 2.0.11)
bce2: <Broadcom NetXtreme II BCM5709 1000Base-T (C0)> mem 0xda000000-0xdbffffff irq 38 at device 0.0 on pci3
miibus2: <MII bus> on bce2
brgphy2: <BCM5709 10/100/1000baseT PHY> PHY 1 on miibus2
brgphy2:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-master, 1000baseT-FDX, 1000baseT-FDX-master, auto, auto-flow
bce2: Ethernet address: 00:10:18:bc:f6:30
bce2: ASIC (0x57092003); Rev (C0); Bus (PCIe x4, 2.5Gbps); B/C (5.2.3); Bufs (RX:2;TX:2;PG:0); Flags (MSI)
bce3: <Broadcom NetXtreme II BCM5709 1000Base-T (C0)> mem 0xdc000000-0xddffffff irq 45 at device 0.1 on pci3
miibus3: <MII bus> on bce3
brgphy3: <BCM5709 10/100/1000baseT PHY> PHY 1 on miibus3
brgphy3:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-master, 1000baseT-FDX, 1000baseT-FDX-master, auto, auto-flow
bce3: Ethernet address: 00:10:18:bc:f6:32
bce3: ASIC (0x57092003); Rev (C0); Bus (PCIe x4, 2.5Gbps); B/C (5.2.3); Bufs (RX:2;TX:2;PG:0); Flags (MSI)
bce1: Gigabit link up!
bce1: Gigabit link up!
bce0: Gigabit link up!
bce0: Gigabit link up!

bce0 at pci0:1:0:0:        class=0x020000 card=0x028c1028 chip=0x163b14e4 rev=0x20 hdr=0x00
    vendor     = 'Broadcom Corporation'
    device     = 'NetXtreme II BCM5716 Gigabit Ethernet'
    class      = network
    subclass   = ethernet
bce1 at pci0:1:0:1:        class=0x020000 card=0x028c1028 chip=0x163b14e4 rev=0x20 hdr=0x00
    vendor     = 'Broadcom Corporation'
    device     = 'NetXtreme II BCM5716 Gigabit Ethernet'
    class      = network
    subclass   = ethernet
mpt0 at pci0:2:0:0:        class=0x010000 card=0x1f0f1028 chip=0x00581000 rev=0x08 hdr=0x00
    vendor     = 'LSI Logic / Symbios Logic'
    device     = 'SAS1068E PCI-Express Fusion-MPT SAS'
    class      = mass storage
    subclass   = SCSI
bce2 at pci0:3:0:0:        class=0x020000 card=0x090714e4 chip=0x163914e4 rev=0x20 hdr=0x00
    vendor     = 'Broadcom Corporation'
    device     = 'NetXtreme II BCM5709 Gigabit Ethernet'
    class      = network
    subclass   = ethernet
bce3 at pci0:3:0:1:        class=0x020000 card=0x090714e4 chip=0x163914e4 rev=0x20 hdr=0x00
    vendor     = 'Broadcom Corporation'
    device     = 'NetXtreme II BCM5709 Gigabit Ethernet'
    class      = network
    subclass   = ethernet

	More information will be available upon request.

>How-To-Repeat:
>Fix:

	Not known at this time.  We are testing with split_hdr disabled.

>Release-Note:
>Audit-Trail:
>Unformatted:


More information about the freebsd-bugs mailing list