Re: Hang in umount in poudriere run

From: Craig Leres <leres_at_freebsd.org>
Date: Tue, 19 Aug 2025 17:53:14 UTC
On 7/20/25 01:51, Kurt Jaeger wrote:
> Hello,
> 
> I have hanging umount processes in the last two poudriere runs
> on a 14.3p1 server.
> 
> See https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=288345
> 
> The umount processes can not be killed:
> 
> 20104  2  D+       0:00.00 umount -f /pou/data/.m/143-default/21
> 20156  6- T+       0:00.00 umount -f /pou/data/.m/143-default/21/.p
> 
> Any ideas what I can do to debug this ?

I've been seeing random processes getting stuck in D state since 
14.3-RELEASE (I'm currently on p2) on my poudriere build server. This 
happens about twice a month and only while my daily ~1000 package build 
is in progress. I'll either get nagios alerts from things that have 
wedged or notice that the build hasn't completed and has been running 
more for more than 2X the normal build time (7 hours). Today ntpd (and 
others) got stuck. When this happens the only remedy is to reboot 
(including an ipmi reset unless I'm willing to wait a LONG time). Then 
cleanup the  zfs /.m/ filesystems...

I'm also curious what info I might collect before rebooting.

		Craig