13-STABLE high idprio load gives poor responsiveness and excessive CPU time per task
- Reply: vester.thacker_a_fastmail.fm: "Re: 13-STABLE high idprio load gives poor responsiveness and excessive CPU time per task"
- Reply: Eugene Grosbein : "Re: 13-STABLE high idprio load gives poor responsiveness and excessive CPU time per task"
- Go to: [ bottom of page ] [ top of archives ] [ this month ]
Date: Tue, 27 Feb 2024 00:24:54 UTC
Currently trying to do port builds with
poudriere-devel-3.4.99.20240122 on 13.3-STABLE FreeBSD 13.3-STABLE
stable/13-n257396-134580c103b4 GENERIC amd64 and have had poor system
responsiveness on this and a -STABLE that was likely at least 2 months
before it with `/usr/bin/nice -n 18 /usr/sbin/idprio 31 poudriere bulk
-J2:12 -j local -p local -f /root/installed-port-list -f
/root/prime-origins` and also launching it with no nice change and just
starting with 'idprio 31' (in case it would react different with the
builtin idprio. It used to be that under just idprio 31 I could continue
to use the machine during builds with minor impact on responsiveness
with MAKE_JOBS_NUMBER=4 on a 2 core + hyperthreading i7 which lead to an
oversaturation of 8 build processes across the 4 virtual cores yet
stayed mostly smooth.
Responsiveness seems to have strange effects of laggy mouse/keyboard
input and thought I recall logs saying system couldn't keep up with the
mouse communication, programs intermittently freezing/unfreezing, and
even building becomes unstable with electron28 failing as runaway
process after over an hour when killed as runaway process during extract
phase. During the lag, programs seem to take many times as long to
respond while also getting high cpu use during that time such as tmux
switching between windows; ctrl b+ctrl+p getting almost 100% cpu core
time for about 5 seconds added to it under top. blacklistd has take
about 24 minutes of CPU time for < 3 days uptime per top while managing
a usual slow bot break-in attempt on ssh.
More recently looked and see top showing threads+system processes
shows I have one core getting 100% cpu for kernel{arc_prune} which has
21.2 hours over a 2 hour 23 minute uptime. Previously I know I have seen
higher system % time than I'd expected but not always sure when it is
justified or not. I started looking to see if
https://www.freebsd.org/security/advisories/FreeBSD-EN-23:18.openzfs.asc
was available as a fix for 13 but it is not (and doesn't quite sound
like it was supposed to apply to this issue). Would a kernel thread time
at 100% cpu for only 1 core explain the system becoming unusually
unresponsive? cron was stopped after last boot so shouldn't be throwing
up any unexpected background work.
System has 32GB ddr3 RAM with i7-3820 processor with only 2 enabled
cores + hyperthreading in BIOS. Issue appears specifically when CPU load
is high and idle% in top is 0.0% but it is only sometimes present under
that condition. I usually use ccache and WTIH_META_MODE to try to speed
up compiling base and ports and have zstd at best compression for
packages in hopes of faster extraction at the tradeoff of more disk space.
I haven't tried yet but considered trying OpenZFS from ports. Any
suggestions of what else to look at or watch for?
Thanks,
Edward Sanford Sutton, III