PLEASE TEST: IPI deadlock avoidance patch
Garance A Drosihn
drosih at rpi.edu
Thu Aug 26 11:16:04 PDT 2004
At 10:37 AM -0700 8/26/04, Doug White wrote:
>On Mon, 23 Aug 2004, Garance A Drosihn wrote:
>
> > ... No panics, but in the second set I
> > did have one buildworld (-j9) which failed with four processes
>> apparently getting a "*** Signal 6". I am not sure what that
>> was about. Note that all these builds were done with a `make'
>> that was compiled to USE_KQUEUE , so maybe that's where the
>> Signal's came from. In any case, the machine is still running
>> fine, even after that heavy pounding.
>
>Signal 6 is SIGABRT, which is usually intentional. You'd have to
>check the output for a specific process that abended. I'd also
>have to scan the make code for any abort() calls.
I have not pinned this down yet, but I have determined that it is
not related to the USE_KQUEUE `make'. It is only happening if I do
several -j buildworlds in a row, and (at least so far) only if I
also have the folding-at-home client running as a background task.
This combination is not something that I have done before, so I
no way of knowing if it is related to the IPI-fix that you wrote.
I do know that I have not been able to trigger the panic that I
got before, even though I have been hammering the machine MUCH
harder than I usually do.
The "Signal 6"s seem to be hitting various gcc processes, and I
did notice that gcc has plenty of calls to abort() in it. At the
moment I do not have time to pursue this any farther. (Our first
day of classes here @RPI is next Monday, and I have about a hundred
things that need to get done before then!).
--
Garance Alistair Drosehn = gad at gilead.netel.rpi.edu
Senior Systems Programmer or gad at freebsd.org
Rensselaer Polytechnic Institute or drosih at rpi.edu
More information about the freebsd-current
mailing list