Many processes stuck in zfs
Pawel Jakub Dawidek
pjd at FreeBSD.org
Tue Mar 9 12:58:23 UTC 2010
On Tue, Mar 09, 2010 at 01:57:07PM +0100, Borja Marcos wrote:
>
> On Mar 9, 2010, at 1:29 PM, Pawel Jakub Dawidek wrote:
>
> > On Tue, Mar 09, 2010 at 10:15:53AM +0100, Stefan Bethke wrote:
> >> Over the past couple of months, I've more or less regularly observed machines having more and more processes stuck in the zfs wchan. The processes never recover from that, and trying to reboot only gets the entire system stuck, without any console messages. I can enter the debugger, and I have saved a couple of dumps.
> >>
> >> The situation seems to be triggered by zfs receive'ing snapshots from the sister machine (both synchronize their active ZFS filesystems to each other, using zfs send and zfs receive). It appears it's the receiving causing trouble.
> >>
> >> Both machines run 8-stable from mid-February, with a single-disk ZFS pool, with ARC limited to 512M, prefetch and ZIL disabled via loader.conf.
> >>
> >> What should I be looking at to further diagnose?
> >
> > What kind of hardware do you have there? There is 3-way deadlock I've a
> > fix for which would be hard to trigger on single or dual core machines.
> >
> > Feel free to try the fix:
> >
> > http://people.freebsd.org/~pjd/patches/zfs_3way_deadlock.patch
>
> Maybe related to the deadlock I reported when I was receiving an incremental snapshot while the target dataset was being read?
Could be. This deadlock is in general related to zfs recv functionality.
--
Pawel Jakub Dawidek http://www.wheelsystems.com
pjd at FreeBSD.org http://www.FreeBSD.org
FreeBSD committer Am I Evil? Yes, I Am!
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 196 bytes
Desc: not available
Url : http://lists.freebsd.org/pipermail/freebsd-stable/attachments/20100309/f2876fc1/attachment.pgp
More information about the freebsd-stable
mailing list