[Bug 254341] igb hung every 5-20 hours. ifconfig down+up solves the problem for next N hours

bugzilla-noreply at freebsd.org bugzilla-noreply at freebsd.org
Tue Mar 16 18:29:24 UTC 2021


https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=254341

            Bug ID: 254341
           Summary: igb hung every 5-20 hours. ifconfig down+up solves the
                    problem for next N hours
           Product: Base System
           Version: 12.2-RELEASE
          Hardware: arm64
                OS: Any
            Status: New
          Severity: Affects Only Me
          Priority: ---
         Component: kern
          Assignee: bugs at FreeBSD.org
          Reporter: vlad.shabanov at gmail.com

Created attachment 223330
  --> https://bugs.freebsd.org/bugzilla/attachment.cgi?id=223330&action=edit
full dmesg.boot

I have problem with FreeBSD 12.2p1 The problem started on FreeBSD 12.0 and
persists till now.

The network hung just after the boot. Ifconfig down+up solves the problem for
next 5-20 hours. I don’t know how to force NIC to hung immediately. Just do
ifconfig up/down and wait for next N hours.

MBoard: SuperMicro X10SRI-F
NIC: <Intel(R) PRO/1000 PCI-Express Network Driver> port 0xe020-0xe03f mem
0xfb120000-0xfb13ffff,0xfb144000-0xfb147fff irq 43 at device 0.0 on pci5
firewall: pf 
There is no vlan, only jails working on 10.0.0.1/16

Now I use simple watchdog script staring every minute. The script pings the
gateway and do ifconfig down/up on error.
The script dumps these values every minute:
        /sbin/pfctl -si > /root/WATCHDOG/failure.txt
        /usr/bin/netstat -m >> /root/WATCHDOG/failure.txt
        /sbin/sysctl -a | /usr/bin/grep dev.igb >> /root/WATCHDOG/failure.txt

I have two files: one before the hung, the second one just after the hung. Hope
that some values inside can help.


———————————————————————————————
- from dmesg.boot
igb0: <Intel(R) PRO/1000 PCI-Express Network Driver> port 0xe020-0xe03f mem
0xfb120000-0xfb13ffff,0xfb144000-0xfb147fff 
igb0: Using 1024 TX descriptors and 1024 RX descriptors
igb0: Using an MSI interrupt
igb0: Ethernet address: ac:1f:6b:02:8a:c4
igb0: netmap queues/slots: TX 1/1024, RX 1/1024

———————————————————————————————
$ ifconfig igb0
igb0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
       
options=e507bb<RXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU,VLAN_HWCSUM,TSO4,TSO6,LRO,VLAN_HWFILTER,VLAN_HWTSO,RXCSUM_IPV6,TXCSUM_IPV6>
        ether ac:1f:6b:02:8a:c4
        inet IP-ADDR netmask 0xffffff00 broadcast IP-ADDR
        media: Ethernet 100baseTX <full-duplex>
        status: active
        nd6 options=29<PERFORMNUD,IFDISABLED,AUTO_LINKLOCAL>

———————————————————————————————
- /etc/rc.conf:
-
zfs_enable="YES"
sshd_enable="YES"
ntpd_enable="YES"
powerd_enable="YES"
local_unbound_enable="YES"
hostname="..."
ifconfig_igb0="inet IP-ADDR netmask 255.255.255.0"
defaultrouter="IP-ADDR"
ifconfig_lo0_alias0="inet 10.0.0.1 netmask 0xffff0000"
pf_enable="YES"
pflog_enable="YES"

-- 
You are receiving this mail because:
You are the assignee for the bug.


More information about the freebsd-bugs mailing list