[Bug 202680] Silent data corruption on em(4) interfaces
bugzilla-noreply at freebsd.org
bugzilla-noreply at freebsd.org
Wed Aug 26 16:35:06 UTC 2015
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=202680
Bug ID: 202680
Summary: Silent data corruption on em(4) interfaces
Product: Base System
Version: 10.2-STABLE
Hardware: Any
OS: Any
Status: New
Severity: Affects Some People
Priority: ---
Component: kern
Assignee: freebsd-bugs at FreeBSD.org
Reporter: KOT at MATPOCKuH.Ru
I got silent data corruption when transferring data via em(4) interfaces on
10.2-STABLE r286912.
Immediately after reboot does not occur - after reboot I can transfer files
without any errors.
Problem occurs *only* after few hours of uptime. I checked on system with
uptime > 1 day and got:
1. broken large file transferred via ftp (MD5 checksum mismatched);
For example:
Original MD5(eis-dvd.iso)= 3705908d84f3c0f9f922a00e6e9332d2
Downloaded MD5 (eis-dvd.iso) = 1f7de1fdc682ae9ca26ec0621b377e89
2. SSH disconnects when transferring large data with message:
Corrupted MAC on input.
Disconnecting: Packet corrupt
I using Sun Fire X4100 server with 4 build in ethernet ports Intel 82546EB:
em0 at pci0:1:1:0: class=0x020000 card=0x10118086 chip=0x10108086 rev=0x03
hdr=0x00
vendor = 'Intel Corporation'
device = '82546EB Gigabit Ethernet Controller (Copper)'
class = network
subclass = ethernet
This server works under solaris without any problems.
I tried to use:
1. lagg0.2 (lagg0 with vlan tag) over em0 and em2 interfaces:
cloned_interfaces="lagg0 lagg0.2"
ifconfig_em0="up"
ifconfig_em2="up"
create_args_lagg0="laggproto lacp laggport em0 laggport em2"
2. tagged em1.2 interface via manual ifconfig em1.2 create 192.168.X.Y
And I got same problem in both cases.
*One* time when transferring file I got this messages:
em0: Interface stopped DISTRIBUTING, possible flapping
em0: Watchdog timeout -- resetting
em2: Interface stopped DISTRIBUTING, possible flapping
em2: Watchdog timeout -- resetting
But this problem occurred only one time.
netstat -in does not see any problems on interfaces:
Name Mtu Network Address Ipkts Ierrs Idrop Opkts Oerrs
Coll
em0 1500 <Link#1> 00:14:4f:01:3f:7a 6689452 0 0 146720 0
0
em1 1500 <Link#2> 00:14:4f:01:3f:7b 5732168 0 0 2865912 0
0
em2 1500 <Link#3> 00:14:4f:01:3f:7c 501817 0 0 3392333 0
0
TCP_OFFLOAD was disabled in kernel's config.
System is not in production use, and I can try to roll back to pre-r269196 code
or try to collect any required additional diagnosis information, but I can't
provide remote access to this system.
--
You are receiving this mail because:
You are the assignee for the bug.
More information about the freebsd-bugs
mailing list