System hangs during heavy sequential write to mfi device
Ireneusz Pluta
ipluta at wp.pl
Wed Jun 30 10:09:23 UTC 2010
Antony Mawer pisze:
> On Tue, Jun 29, 2010 at 4:09 AM, John Baldwin <jhb at freebsd.org> wrote:
>
>> On Monday 28 June 2010 1:57:27 pm Ireneusz Pluta wrote:
>>
>>> John Baldwin pisze:
>>>
>>>> On Monday 28 June 2010 12:00:06 pm Ireneusz Pluta wrote:
>>>>
>>>>
>>>>> John Baldwin pisze:
>>>>>
>>>>>
>>>>>> On Friday 25 June 2010 4:59:57 pm Ireneusz Pluta wrote:
>>>>>>
>>>>>>
>>>>>>
>>>>>>> John Baldwin pisze:
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>> Hmmm. You might have a hardware issue. OTOH, you can try seeing if
>>>>>>>>
>> you
>>
>>>> have
>>>>
>>>>
>>>>>>>> a BIOS option to disable PCIE error logging.
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>> is it one of them?:
>>>>>>>
>>>>>>> Assert NMI on SERR
>>>>>>> Assert NMI on PERR
>>>>>>>
>>>>>>> (pdf page 109 of: ->
>>>>>>>
>>>>>>>
>>>>>>>
>> http://download.intel.com/support/motherboards/server/s5520hc/sb/e39529013_s5520hc_s5500hcv_s5520hct_tps_r1_9.pdf)
>>
>>>>>>>
>>>>>> Well, that will turn off the NMIs. Not sure if it will affect the event
>>>>>> logging, but it is worth a shot.
>>>>>>
>>>>>>
>>>>>>
>>>>> Per BIOS setup documentation:
>>>>>
>>>>> On SERR, generate an NMI and log an error.
>>>>> Note: [Enabled] must be selected for the Assert NMI
>>>>> on PERR setup option to be visible.
>>>>>
>>>>> and:
>>>>>
>>>>> On PERR, generate an NMI and log an error.
>>>>> Note: This option is only active if the Assert NMI on
>>>>> SERR option is [Enabled] selected.
>>>>>
>>>>> However, disabling them did not change anything.
>>>>>
>>>>>
>>>> Is it still logging errors and sending NMIs with them disabled?
>>>>
>>>>
>>> with the options I mentioned disabled. They do not have to be the only
>>> sources of NMIs, do they?
>>>
>> Well, they should be the sources of the log messages you found in your system
>> event log. There is a good chance that you have some broken hardware
>> somewhere, I'm not sure how easy it is for you to debug that via swapping out
>> components, but the RAID controller is the first thing I would try.
>>
>
> You might want to try a BIOS update to see if it resolves these
> problems ... I seem to remember some mention of these sorts of errors
> in the change log of one of the recent Intel server board BIOSes.
>
Yeah, I tried, and the problem dissappeared.
I really could do that initially but none of the entries found in the
BIOS changelog from Intel seemed to address my particular issue.
More information about the freebsd-hardware
mailing list