UFS2 and/or sparse file bug causing copy process to land in 'D''
state?
Robert Watson
rwatson at FreeBSD.org
Mon Feb 23 02:26:16 PST 2009
On Sun, 22 Feb 2009, Carl wrote:
> I've come across what I'm thinking may be a bug in the context of FreeBSD
> 7.0 with a pair of gmirrored drives and gjournaled partitions when copying a
> large number of files into a file-backed memory device.
>
> The consequence of this problem is that a process enters the 'D' state
> (process in disk) indefinitely, cannot be killed, and the system cannot be
> shutdown. The only solution is to cold reboot the system, which is a really
> big problem for remote systems. This is happening to me intermittently with
> the standard tar-tar pipeline form of copying, but has happened with the
> rsync 3.0.4 port as well.
It would be interesting to get kernel stack traces of the involved
processes/threads; there are various ways to do this, such as using DDB. If
you have a kernel.symbols for the kernel, then you can run kgdb on
kernel.symbols and /dev/mem to generate traces without interrupting operation
(although if the system is in the throes of deadlocking, that may not be a
concern or even possible). You can also use procstat -kk to retrieve kernel
stack traces, with a bit less information (such as no arguments) to help
narrow things down more.
Unfortunately, debugging this type of problem, as you've intuited, is best
done with serial console access and a local box so that the debugging
information can be extracted. It would be interesting to know if you can
force a crashdump on the box to get the information for post-mortem debugging.
This may be possible using "reboot -d" -- I've never used this, but have every
reason to think it will work.
Robert N M Watson
Computer Laboratory
University of Cambridge
>
> I would appreciate it if some of you would see if you can repeat this
> problem. Here is a sequence of tcsh shell commands which manifest the problem
> (on occasion but not every time), which I will refer to as the "truncate
> sequence" (depends on fully populated /usr/src tree as data set):
>
> # truncate -s 671088640 target
> # mdconfig -f target -S 512 -y 255 -x 63 -u 7
> # bsdlabel -w /dev/md7 auto
> # newfs -O2 -m 0 -o space /dev/md7a
> # mount /dev/md7a /media
> # tar -cvf - -C /usr/src . | tar -xvpof - -C /media
> # umount /media ; mdconfig -d -u 7 ; rm target
>
> An alternate version has yet to fail for me and involves replacing the first
> line with this one:
>
> # dd if=/dev/zero of=target bs=1M count=640
>
> I'll call that the "dd sequence". Here is an ordered series of tests I just
> completed:
>
> a) Repeated truncate sequence 7 times - 1st, 5th, and 7th failed.
> b) Repeated dd sequence 7 times - no failures.
> c) Repeated truncate sequence 6 time - no failures.
> d) Used following sequence to ensure all disk caches flushed:
>
> # dd if=/dev/random of=target bs=1M count=4096
> # dd if=target of=/dev/null bs=1M
> # rm target
>
> e) Repeated truncate sequence 4 times - no failures.
> f) Performed orderly reboot.
> g) Repeated truncate sequence 2 times - 2nd failed.
> h) Performed orderly reboot.
> i) Repeated dd sequence 7 times - no failures.
>
> All failures involve the second tar in the pipeline hanging in the 'D' state.
> In each case I do a cold reboot before proceeding with the next test.
>
> It's tempting to speculate that a bug exists in code related to handling
> sparse files specifically, but perhaps it just raises the probability of
> tripping a bug that would eventually manifest in the dd sequence as well.
> OTOH, I don't know how to rule out a physical disk or disk firmware problem.
>
> This problem has occurred with different data sets and different sized memory
> disks, but only with the source and destination filesystems being UFS2. I
> have done similar sequences with EXT2 and FAT16 destinations with no failures
> thus far, but the memory disks and data sets were smaller so it's conceivable
> that probability worked against me.
>
> I should note that the drives are Seagate ST31000340AS Barracudas, but both
> drives have been upgraded to firmware version SD1A and are therefore
> supposedly free of the infamous little horror Seagate inflicted on so many of
> us. smartctl tells me that both disks still have a raw value of 0 for
> Reallocated_Sector_Ct and both pass the "short" self test.
>
> Carl / K0802647
>
> _______________________________________________
> freebsd-fs at freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-fs
> To unsubscribe, send any mail to "freebsd-fs-unsubscribe at freebsd.org"
>
More information about the freebsd-fs
mailing list