em(4) duplex problems with 82541EI on RELENG_8, -CURRENT on PowerEdge 1850

Brian A. Seklecki bseklecki at collaborativefusion.com
Tue Jun 1 19:28:42 UTC 2010


= Re-posted from freebsd-hardware@, since this is more of a bug
 report than a hardware comparability inquiry / buying strategy
 discussion. ==

All:

  Has anyone upgraded their PowerEdge 1850s to 8.0-PL or 
  RELENG_8 -stable?  We're seeing problems where 7.2-PL and
  6.3-PL were not affected on the same hardware.

  The problem is that forcing the duplex 100/full on both 
  sides no longer functions.

  Configuration:
 
   - A variety of Cisco L2/L3 switches over the last decade:
   -- 2848G-L3
   -- 2950
   -- 2960s
   -- 3550-12Ts
   -- 3550XLs
   -- Duplex forced 100/full on Cisco side
   - FreeBSD/amd64 RELENG_8 or 9-CURRENT with duplex
     forced '100baseTX mediaopt full-duplex',
   - This configuration has worked since FreeBSD 5.4
  
  When connected to PowerEdge 1850r1/r2, with the onboard Intel 
  82541EI, the parenthesis show an actual media speed/duplex of:

  media: Ethernet 100baseTX <full-duplex>  (100baseTX <half-duplex>)

  The same configuration using a Dell-sold Intel dual port
  82546EB, in the same system, on the same switch, works fine.


-----------------
ifconfig(8):
-----------------
em3: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX, \r
                 MULTICAST> metric 0 mtu 1500
options=9b<RXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,VLAN_HWCSUM>
ether 00:13:72:4f:70:81
inet 192.168.97.20 netmask 0xffffff80 broadcast 192.168.97.127
media: Ethernet 100baseTX <full-duplex> (100baseTX <half-duplex>)
status: active
-----------------
em0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,\
                 MULTICAST> metric 0 mtu 1500>
options=9b<RXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,VLAN_HWCSUM>
ether 00:04:23:c8:fe:ac
media: Ethernet 100baseTX <full-duplex>
status: active
-----------------
-----------------
pciconf(8):
-----------------
em3 at pci0:7:8:0:	class=0x020000 card=0x016d1028 chip=0x10768086 
                rev=0x05 hdr=0x00
    vendor     = 'Intel Corporation'
    device     = 'Gigabit Ethernet Controller (82541EI)'
    class      = network
    subclass   = ethernet
em0 at pci0:3:11:0:	class=0x020000 card=0x10128086 chip=0x10108086 rev=0x01
hdr=0x00
   vendor     = 'Intel Corporation'
   device  = 'Dual Port Gigabit Ethernet Controller (Copper) (82546EB)'
   class      = network
   subclass   = ethernet

-----------------

rc.conf(5) for shits & giggles:

ifconfig_em0="inet X netmask Y media 100baseTX mediaopt full-duplex"
ifconfig_em3="inet Z netmask F media 100baseTX mediaopt full-duplex"

--------

Example IOS switch config:
 interface FastEthernet0/39
  description I hate Dell
  switchport access vlan 100
  switchport mode access
  speed 100
  duplex full
  spanning-tree portfast
 end
--------

I've been clearing interface counters on the switch side, but I'll send
'netstat -i', 'show interface counters', and 'sudo sysctl -w
dev.em.3.stats=1' ASAP to illustrate connectivity errors soon.

Are we being punished for patronizing Dell?

Is it possible that ifconfig(8) output has simply changed?  Are the
values in the parenthesis on the right the Ethernet auto-sense desired
values where as outside the parenthesis the current active values?

In 6.3/7.2, once you forced a speed/duplex, the values in parenthesis
went away entirely. 

The only way I've been able to make that happen is to #define in
src/sys/dev/e1000/if_em.h:

  #define DO_AUTO_NEG 0
  /*
   * This parameter control whether or not the driver will wait for
   * autonegotiation to complete.
   * 1 - Wait for autonegotiation to complete
   * 0 - Don't wait for autonegotiation to complete
  */

Also seems odd that some ICs are affected but not others.

Its also possible that my problems are pf(4) + setfib(8) related and I
that this is a separate issue.

Two new notes since the original post:

 - I have confirmed this problem on two revisions of the Dell
   8th gen hardware in two different datacenters
 - The problem persists on -CURRENT from 05/2010
 - RELENG_7 does not seem to be impacted
 - More stats below.


Thanks,
    ~BAS
 
---------------



em1: link state changed to DOWN
em1: link state changed to UP
em1: link state changed to DOWN
em1: link state changed to UP
em1: link state changed to DOWN
em1: link state changed to UP
em1: link state changed to DOWN
em1: link state changed to UP
em1: link state changed to DOWN
em1: link state changed to UP
em1: link state changed to DOWN

em0: Excessive collisions = 0
em0: Sequence errors = 0
em0: Defer count = 0
em0: Missed Packets = 0
em0: Receive No Buffers = 0
em0: Receive Length Errors = 0
em0: Receive errors = 0
em0: Crc errors = 0
em0: Alignment errors = 0
em0: Collision/Carrier extension errors = 0
em0: RX overruns = 0
em0: watchdog timeouts = 0
em0: RX MSIX IRQ = 0 TX MSIX IRQ = 0 LINK MSIX IRQ = 0
em0: XON Rcvd = 0
em0: XON Xmtd = 0
em0: XOFF Rcvd = 0
em0: XOFF Xmtd = 0
em0: Good Packets Rcvd = 1319916
em0: Good Packets Xmtd = 1070646
em0: TSO Contexts Xmtd = 0
em0: TSO Contexts Failed = 0
em1: Excessive collisions = 0
em1: Sequence errors = 0
em1: Defer count = 0
em1: Missed Packets = 0
em1: Receive No Buffers = 0
em1: Receive Length Errors = 0
em1: Receive errors = 0
em1: Crc errors = 0
em1: Alignment errors = 0
em1: Collision/Carrier extension errors = 0
em1: RX overruns = 0
em1: watchdog timeouts = 0
em1: RX MSIX IRQ = 0 TX MSIX IRQ = 0 LINK MSIX IRQ = 0
em1: XON Rcvd = 0
em1: XON Xmtd = 0
em1: XOFF Rcvd = 0
em1: XOFF Xmtd = 0
em1: Good Packets Rcvd = 251348
em1: Good Packets Xmtd = 204160
em1: TSO Contexts Xmtd = 0
em1: TSO Contexts Failed = 0

--------------------


as0# sh int fa0/43
FastEthernet0/43 is up, line protocol is up (connected)
Hardware is Fast Ethernet, address is 0015.c683.51ab (bia
0015.c683.51ab)
Description: X-Server EM1
MTU 1500 bytes, BW 100000 Kbit, DLY 100 usec, 
reliability 255/255, txload 1/255, rxload 1/255
Encapsulation ARPA, loopback not set
Keepalive set (10 sec)
Full-duplex, 100Mb/s, media type is 100BaseTX
input flow-control is unsupported output flow-control is unsupported 
ARP type: ARPA, ARP Timeout 04:00:00
Last input never, output 00:00:08, output hang never
Last clearing of "show interface" counters 6d03h
Input queue: 0/75/0/0 (size/max/drops/flushes); Total output drops: 0
Queueing strategy: fifo
Output queue: 0/40 (size/max)
5 minute input rate 0 bits/sec, 0 packets/sec
5 minute output rate 1000 bits/sec, 3 packets/sec
291422 packets input, 131521274 bytes, 0 no buffer
Received 798 broadcasts (0 multicast)
0 runts, 0 giants, 0 throttles
0 input errors, 0 CRC, 0 frame, 0 overrun, 0 ignored
0 watchdog, 99 multicast, 0 pause input
0 input packets with dribble condition detected
651929 packets output, 73550092 bytes, 0 underruns
0 output errors, 0 collisions, 4 interface resets
0 babbles, 0 late collision, 0 deferred
0 lost carrier, 0 no carrier, 0 PAUSE output
0 output buffer failures, 0 output buffers swapped out


-- 
Brian A. Seklecki <bseklecki at collaborativefusion.com>
Collaborative Fusion, Inc.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 198 bytes
Desc: This is a digitally signed message part
Url : http://lists.freebsd.org/pipermail/freebsd-stable/attachments/20100601/61dd8ecf/attachment.pgp


More information about the freebsd-stable mailing list