[PATCH] Netdump for review and testing -- preliminary version

Attilio Rao attilio at freebsd.org
Thu Oct 14 15:20:23 UTC 2010


2010/10/14 Robert N. M. Watson <rwatson at freebsd.org>:
>
> On 14 Oct 2010, at 15:10, Attilio Rao wrote:
>
>>> My concern is less about occasional lost dumps that destabilising the dumping process: calls into the memory allocator can currently trigger a lot of interesting behaviours, such as further calls back into the VM system, which can then trigger calls into other subsystems. What I'm suggesting is that if we want the mbuf allocator to be useful in this context, we need to teach it about things not to do in the dumping / crash / ... context, which probably means helping uma out a bit in that regard. And a watchdog to make sure the dump is making progress.
>>
>> I think that this would be way too complicated just to cope with panic
>> within the VM/UMA (not sure what other subsystems you are referring
>> to, wrt supposed to call). Besides, if we have a panic in the VM I'm
>> sure that normal dumps could also be affected.
>> When dealing with netdump, I'm not trying to fix all the bugs related
>> to our dumping infrastructure because, as long as we already
>> discussed, we know there are quite a few of them, but trying at least
>> to follow the same fragile-ness than what we have today.
>> And again, while I think the "watchdog" idea is good, I think it still
>> applies to normal dumps too, it is not specific to netdump.
>
> No, what I'm saying is: UMA needs to not call its drain handlers, and ideally not call into VM to fill slabs, from the dumping context. That's easy to implement and will cause the dump to fail rather than causing the system to hang.

Ok.
My point is, however, still the same: that should not happen just for
the netdump specific case but for all the dumping/KDB/panic cases (I
know it is unlikely current code !netdump calls into UMA but it is not
an established pre-requisite and may still happen that some added code
does).
I still see this as a weakness on the infrastructure, independently
from netdump. I can see that your point is that it is vital to netdump
correct behaviour though, so I'd wonder if it worths fixing it now or
later.

More people's comment would be appreciated.

Thanks,
Attilio


-- 
Peace can only be achieved by understanding - A. Einstein


More information about the freebsd-current mailing list