[Bug 284743] System reproducably livelocks after a couple of hours in poudriere bulk -a

From: <bugzilla-noreply_at_freebsd.org>
Date: Thu, 08 May 2025 18:49:08 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=284743

--- Comment #24 from Robert Clausecker <fuz@FreeBSD.org> ---
(In reply to Mark Johnston from comment #22)

> Have you been able to reproduce the hang without idprio?

I'll try that next.  Once again, there are no kernel dumps as the system is
completely stuck in a live lock, not even reacting to CR ~ ^B in the serial
console.  I can only reboot it by turning off power and then turning it back
on.

> Did you tweak `kern.sched.static_boost`, or other scheduler tunables?

/etc/sysctl.conf only has this entry:

security.bsd.unprivileged_idprio=1

/boot/loader.conf has:

kern.geom.label.disk_ident.enable="0"
kern.geom.label.gptid.enable="0"
cryptodev_load="YES"
zfs_load="YES"
radeonkms_load="YES"
kern.vty="vt"
hint.uart.1.disabled="1"

which should not affect anything.

It unfortunately takes very long for me to have another shot after a hang as
the machine needs to be physically power cycled and the admin of the datacenter
it is colocated in is a bit fed up with doing so, so he'll hold it off until
the next time he has to go down into the datacenter (about once a week).

-- 
You are receiving this mail because:
You are the assignee for the bug.