ZFS repeatable reboot 8.0-RC1
grarpamp
grarpamp at gmail.com
Wed Oct 14 22:02:34 UTC 2009
Hi.
I'm running i386 on i386, single P4 cpu, 1GiB RAM.
SiI 3114 -> SATA [single disk] -> GELI [AES-128] -> ZFS [sha256]
Straight RELENG_8 as of cvsup Oct 12 14:49:00 aka 8.0-RC1 plus.
ZFS pool is at v13, ZFS fs is at v3.
Hardware seems stable.
The only modification to config defaults is:
loader.conf.local: vfs.zfs.arc_max=100663296
After boot -v, geli, zpool import, xf86, browser, etc my mem looks like:
Mem: 33M Active, 22M Inact, 105M Wired, 676K Cache, 37M Buf, 827M Free
When putting load on ZFS it usually grows to about:
Mem: 95M Active, 22M Inact, 302M Wired, 468K Cache, 37M Buf, 569M Free
Ls -l in one of the dirs takes 10min plus and I get:
PID USERNAME PRI NICE SIZE RES STATE TIME WCPU COMMAND
11 root 171 ki31 0K 8K RUN 21:24 47.27% idle
1092 user 76 0 77328K 76116K zio->i 3:25 37.89% ls
802 root -8 - 0K 8K geli:w 1:42 8.98% g_eli[0] ad6
9 root -8 - 0K 128K arc_re 0:23 4.88% {arc_reclaim_thre}
I did not watch these during rm.
I have 1 parent dir holding 4 subdirs. The file count in each subdir
is respectively: 256363, 254086, 256017, 178054
Two thirds of files are about 14KiB in size, not many are more than
a few MiB nor less than 1KiB though a third are 1 byte.
I issue rm -r <parent_dir> and after maybe 30 seconds the machine
reboots. No syslog, panic or console messages. Dmesg from the prior
boot is still present in ram to prove kernel didn't emit any message.
memtest86 passes. There are maybe 10 seconds of complete GUI hangup
before the reboot occurs.
I also see it when make release'ing. Usually during what I _think_
is distributeworld or rolling up the tarballs under /R.
This is a big repeatable problem. How can I debug or fix it? Can
someone else create some mega sized dirs as above and replicate?
Thanks.
More information about the freebsd-stable
mailing list