'stray irq7's cause hang?
Kris Kennaway
kris at FreeBSD.org
Mon Jul 28 19:50:19 UTC 2008
Steve Franks wrote:
> I've got a new system that hangs after about 2 hours - no
> ctrl-alt-esc, not ctrl-alt-Fn, no ctrl-alt-delete.
>
> I tried hints.0.apic.disabled="YES" (that's apic, not acpi) (or
> whatever the correct syntax from the handbook is), but I still get the
> hang, and the stray irq 7's. As far as I can see, there's no other
> dmesg output related.
The stray interrupts may be a red herring. "Stray" means that no driver
is handling them, and so there is no driver to screw up :)
I see straq irq 7's on a HP proliant blade system, and also the hard
hangs (it doesn't even reply to a NMI; this means it is almost certainly
a hardware error). However I am now fairly certain the hangs are
associated to disk failure. Several of the blades that were hanging
went on to develop DMA errors from ATA, and after I validated the
remaining systems with smartctl and took offline yet more blades that
failed the self-tests, I have not had the problem recur.
Kris
More information about the freebsd-questions
mailing list