[Bug 284743] System reproducably livelocks after a couple of hours in poudriere bulk -a

From: <bugzilla-noreply_at_freebsd.org>
Date: Thu, 08 May 2025 15:36:18 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=284743

--- Comment #23 from Olivier Certner <olce@FreeBSD.org> ---
(In reply to Robert Clausecker from comment #21)

I'd say it's quite unlikely.

There's somewhere (but right now don't remember where) a mention that the
reason for not allowing every users to use idprio is the risk of deadlocks. 
I've reviewed quite a lot of code related to idprio (and also have lots of
changes, yet uncommitted), and that convinced me that deadlocks should not be
possible as if the kernel has to sleep to obtain some resource, it will
normally boost its priority above the idle class and will eventually make
progress even on a loaded machine (if it's not hold by realtime processes), so
even if it holds another resource itself, it should eventually release it.  I
might have missed some problems though (or maybe I've already "fixed" some I've
forgotten about in some now-old uncommitted code).  Did you tweak
`kern.sched.static_boost`, or other scheduler tunables?

I would anyway follow Mark's advice: Try without idprio and see if you can
reproduce the deadlock/livelock, to determine whether this peculiar
configuration plays a role here.

-- 
You are receiving this mail because:
You are the assignee for the bug.