[Bug 263908] Something spawning many "sh" process, system no longer boots, in single user /var/log empty

From: <bugzilla-noreply_at_freebsd.org>
Date: Thu, 12 May 2022 02:08:44 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=263908

--- Comment #1 from Greg <greg@teamworkweb.com> ---
I have made some progress, and I have to report that something is up with zfsd!
Cannot say for sure that is what was spawning all the "sh" processes, but I
suspect that to be the case.

After commenting out the following from /etc/rc.conf I am now able to boot
normally:

#zfsd_enable="YES"
#service zfsd start

This was after figuring out how to zfs set readonly=off and mount -a my zroot
pool in single user mode. And trying all of this first:

- Comment out everything in /etc/crontab
- Remove all the sysctl and other tunable tweaks I had customized

Now it is complaining about my dRAID test pool not being available, and that
appears to still be listed in zpool.cache. But it was having this issue with
not booting, prior to pulling one of the HBAs (LSI 9361 mentioned previously),
so this issue with zfsd existed while that pool was still available.

I will double check, but I am fairly sure there was nothing wrong with that
pool. Regardless, I cannot imagine it is intended behavior for zfsd to prevent
a system from booting, regardless of the state any zpools are in? Beyond
perhaps serious issues with zroot, which doesn't appear to be the case here (it
passes a scrub no issues).

If anyone is interested in getting more debugging, while I still have the test
case and hardware setup for this, please let me know. Willing to put a little
more effort into figuring this out.

Again, same setup under 13.0 was not having this issue. Same benchmarks run
back to back for days on end. Same dRAID design. Same used of zfsd.

Thanks!

-Greg-

-- 
You are receiving this mail because:
You are the assignee for the bug.