another? NFS deadlock on 9.2-PRERELEASE

Daniel Braniss danny at cs.huji.ac.il
Wed Aug 28 06:20:30 UTC 2013


> Daniel Braniss wrote:
> > > Daniel Braniss wrote:
> > > > > Daniel Braniss wrote:
> > > > > > I upgraded our web server, and only after 3 hours it hung :-(
> > > > > > (as a side note, I have 2 other web servers, also running 9.2
> > > > > > doing
> > > > > > great :-)
> > > > > > go figure.
> > > > > > 
> > > > > > anyways, in
> > > > > > 	ftp://ftp.cs.huji.ac.il/users/danny/freebsd/core.txt/0
> > > > > > 
> > > > > > is the info after a forced panic.
> > > > > > 
> > > > > Looks like the same hang to me. Several threads are sleeping on
> > > > > "pgrbwt"
> > > > > and lots are waiting for an NFS vnode lock.
> > > > > 
> > > > > It should be fixed in RC3 (or revert r250907). If it still
> > > > > hangs
> > > > > with
> > > > > RC3 (or r250907 reverted), email again.
> > > > > 
> > > > im following stable, hence it's till calling itself
> > > > 9.2-PRERELEASE,
> > > > but
> > > > I did a sync this morning - local time, after rc3 was anounced.
> > > > but after 3.45 minutes is hung, data in
> > > > 	ftp://ftp.cs.huji.ac.il/users/danny/freebsd/core.txt/1
> > > > 
> > > > I can't easely revert r250907, since i'm using mercuriall, but if
> > > > someone
> > > > can send me the pre r250907 files, i'll try.
> > > > 
> The pre-r250907 version of uipc_syscalls is at:
>  http://people.freebsd.org/~rmacklem/uipc_syscalls.c
> in case you want to try it.
> 
thanks, I think I have a kernel pre r250907.
In the meantime I did 2 things:
	1- made the /(root) local - as opposed to nfs'ed
	2- have a watchdog to reboot in case of hang

the host has been up for more than 14hs ( i doubt it's because of 2 -)

lets see how things develope

thanks,
	danny

> rick
> 
> > > r254947, which was committed to stable/9 a few hours ago is
> > > believed to
> > > fix the problem. Please update your stable/9 to post-r254947 and
> > > try it.
> > > 
> > the current kernel has that fix (sys/kern/uipc_syscalls.c)
> > and if you check the core.txt/1 you will see no pgrbwt, only newnsf
> > ...
> > 
> > danny
> > 
> > > rick
> > > 
> > > > thanks,
> > > > 	danny
> > > > 
> > > > > rick
> > > > > 
> > > > > > my guts say its running out of resources - mainly network
> > > > > > related,
> > > > > > but
> > > > > > can't pinpoint it.
> > > > > > 
> > > > > > any help will be most welcomed
> > > > > > 
> > > > > > cheers,
> > > > > > 	danny
> > > > > > 
> > > > > > 
> > > > > > 
> > > > > > 
> > > > > > 
> > > > 
> > > > 
> > > > _______________________________________________
> > > > freebsd-stable at freebsd.org mailing list
> > > > http://lists.freebsd.org/mailman/listinfo/freebsd-stable
> > > > To unsubscribe, send any mail to
> > > > "freebsd-stable-unsubscribe at freebsd.org"
> > > > 
> > 
> > 
> > 




More information about the freebsd-stable mailing list