interrupt storm on MSI IXP600 based motherboards

Dan Langille dan at langille.org
Fri Jan 23 20:21:05 PST 2009


On Jan 23, 2009, at 2:34 PM, Marat N.Afanasyev wrote:

> Dan Langille wrote:
>> On Jan 22, 2009, at 11:38 AM, Dan Langille wrote:
>>> Victor Balada Diaz wrote:
>>>> On Wed, Jan 21, 2009 at 07:22:06PM +0300, Marat N.Afanasyev wrote:
>>>>>>> trouble with onboard re(4) was resolved in -CURRENT and - 
>>>>>>> STABLE, but storms are not bound to ethernet only. storm may  
>>>>>>> appear on any device. if any device generates enough  
>>>>>>> interrupts rate, storm will arrive.
>>>>>> Yes, I just got another storm, on my ATA controller this time. Ah
>>>>>> well, so much for the idea of disabling unneeded devices!
>>>>>>
>>>>>> -pete.
>>>>>>
>>>>> it's a kind of magic, really. I built a new kernel with KDB and  
>>>>> DDB and after 1 day, 13:15 I'm still waiting for storm to  
>>>>> arrive. And I added
>>>>> hw.acpi.osname="Linux" to /boot/loader.conf.
>>>> Try doing lots of IO and you will get the problem soon. You might
>>>> want to try:
>>>> while true; do dd if=/dev/zero of=BAH bs=1M count=1024; sync; done
>>>
>>> FWIW, last night I changed the address of the comm port IO in my  
>>> BIOS. Then I ran the Bacula regression test suite (lots of IO).   
>>> For my machine, once the interrupt storm starts, it continues.  I  
>>> do not know if that happens to everyone.
>>>
>>> Since changing the address, I have had no interrupt storms. I have  
>>> been running the above IO loop for about ten minutes.
>>>
>>> No storm yet (knock on wood).
>> And it's back:
>> Jan 22 17:21:46 polo kernel: interrupt storm detected on "irq22:";  
>> throttling interrupt source
>> Jan 22 17:23:19 polo kernel: interrupt storm detected on "irq22:";  
>> throttling interrupt source
>> Jan 22 17:28:20 polo kernel: interrupt storm detected on "irq22:";  
>> throttling interrupt source
>> Jan 22 17:33:20 polo kernel: interrupt storm detected on "irq22:";  
>> throttling interrupt source
>> Jan 22 17:38:20 polo kernel: interrupt storm detected on "irq22:";  
>> throttling interrupt source
>> I shall try the hw.acpi.osname="Linux" option now.
>> From dmsg: Jan 22 18:10:07 polo kernel: ACPI: Overriding _OS  
>> definition with "Linux"
>
> it works for me for 3 days, 16:27 and still no sign of interrupt  
> storm. and emu10kx0 generates as many as 93 interrupt per second  
> without trouble.
>
> What is your situation?

The box has rebooted twice tonight.  The first time, it was running  
the "while true; do dd..." script.  The second time, it was not.

The box is now up responding to pings, but I cannot ssh to it.  I  
can't get to the console until Monday.

-- 
Dan Langille
http://langille.org/





More information about the freebsd-stable mailing list