Problems with em interfaces on FreeBSD 6.1

Landon Fuller landonf at opendarwin.org
Fri Aug 11 16:48:04 UTC 2006


On Aug 11, 2006, at 09:22, Jack Vogel wrote:

> On 8/11/06, Gleb Smirnoff <glebius at freebsd.org> wrote:
>>   Daniel,
>>
>> On Fri, Aug 11, 2006 at 01:42:32PM +0200, Daniel Ryslink wrote:
>> D> We have started to use the em driver only recently, after the  
>> upgrade to
>> D> gigabit connectivity (100 MBit NICs from Intel used the fxp  
>> driver).
>> D>
>> D> As for the frequency of the incidents, here is a grep of the  
>> messages:
>> D>
>> D> ~~~~~~~~~~~~ <slash> ~~~~~~~~~~~~~~
>> D> Aug  4 22:35:23 b2 kernel: em0: watchdog timeout -- resetting
>> D> Aug  5 00:09:20 b2 kernel: em1: watchdog timeout -- resetting
>> D> Aug  5 06:08:59 b2 kernel: em1: watchdog timeout -- resetting
>> D> Aug  6 12:38:16 b2 kernel: em1: watchdog timeout -- resetting
>> D> Aug  6 20:39:47 b2 kernel: em0: watchdog timeout -- resetting
>> D> Aug  7 18:37:29 b2 kernel: em1: watchdog timeout -- resetting
>> D> Aug  8 07:27:48 b2 kernel: em0: watchdog timeout -- resetting
>> D> Aug  8 09:38:17 b2 kernel: em0: watchdog timeout -- resetting
>> D> Aug  8 12:54:54 b2 kernel: em1: watchdog timeout -- resetting
>> D> Aug  8 22:41:17 b2 kernel: em1: watchdog timeout -- resetting
>> D> Aug  9 05:17:24 b2 kernel: em1: watchdog timeout -- resetting
>> D> Aug  9 10:56:10 b2 kernel: em1: watchdog timeout -- resetting
>> D> Aug  9 20:10:06 b2 kernel: em1: watchdog timeout -- resetting
>> D> Aug 11 08:41:44 b2 kernel: em0: watchdog timeout -- resetting
>> D> Aug 11 10:35:43 b2 kernel: em0: watchdog timeout -- resetting
>> D> ~~~~~~~~~~~~ <slash> ~~~~~~~~~~~~~~
>> D>
>> D> The driver used is version 3.2.18 (I wanted to use the Intel  
>> 6.1.4 as a
>> D> module, but I have found out that I made a mistake and  
>> accidentally loaded
>> D> the old 3.2.18 driver).
>> D>
>> D> I have dilemma now - which new driver to try? The 6.0.5  
>> submitted to the
>> D> current FreeBSD 6.1 branch (modified by you, I believe, on 8th  
>> August), or
>> D> the newest driver from Intel 6.1.4? Do you think one of these  
>> drivers
>> D> could solve my problems?
>>
>> I'm not sure whether new driver will solve your problems. You  
>> should give
>> a try to 6.1-STABLE which has 6.0.5 in it. The difference between  
>> 6.1.4 and
>> 6.0.5 is quite small, I doubt that 6.1.4 worth a try in your case.
>
> Gleb is right, the difference between my 6.0.5 and 6.1.4 driver are  
> minor
> and don't seem to have anything to do with your problem.
>
> I am happy Gleb got my code merged with tip of STABLE and would take
> that driver code if I were you, it will become 6.2 before long :)
>
> Watchdogs happen because of transmit cleanup failing, your instances
> are pretty widely seperated, it looks like some external network  
> problem
> perhaps?

We saw this issue here on SMP systems running 6.1; I've been meaning  
to set up a reproduction case in the lab and dig into the issue further.
Disabling the mpsafe network stack (debug.mpsafenet=0) is our  
temporary work-around; rwatson mentioned that this has the effect of  
forcing the interrupt handler for if_em to not run in parallel with  
the transmit code, which is likely what caused the problem to disappear.

-landonf
-------------- next part --------------
A non-text attachment was scrubbed...
Name: PGP.sig
Type: application/pgp-signature
Size: 186 bytes
Desc: This is a digitally signed message part
Url : http://lists.freebsd.org/pipermail/freebsd-net/attachments/20060811/09c91b42/PGP.pgp


More information about the freebsd-net mailing list