SU+J: 185 processes in state "suspfs" for >8 hrs. ... not good, right?

Kirk McKusick mckusick at mckusick.com
Thu May 1 18:28:12 UTC 2014


> Date: Thu, 1 May 2014 11:20:57 -0700
> From: David Wolfskill <david at catwhisker.org>
> To: Kirk McKusick <mckusick at mckusick.com>
> Cc: fs at freebsd.org
> Subject: Re: SU+J: 185 processes in state "suspfs" for >8 hrs. ... not good,
>  right?
> 
> On Thu, May 01, 2014 at 09:51:43AM -0700, Kirk McKusick wrote:
> 
>> Let me know if it helps your problem. If it does, I will MFC it to 9.
>> There have been several other fixes made to SU+J that are more likely
>> to be the cause of your problem, but they are not easily back-ported
>> to stable/9. So if this does not fix your problem my only suggestions
>> are to turn off journaling or move to running on stable/10.
>> ...
> 
> Hrrrmmm...  Looks as if the above reflects stable/10's r251171 (in
> particular, "Convert the bufobj lock to rwlock.") -- stable/9 doesn't
> seem to know about BO_LOCKPTR(), and gcc makes some assumptions.  That
> doesn't turn out well.
> 
> I think that migrating to stable/10 might make more sense than figuring
> out how to fix this, especially if there are other causes of the
> observed failure that are fixed in stable/10.
> 
> Thanks....
> 
> Peace,
> david
> --
> David H. Wolfskill				david at catwhisker.org
> Taliban: Evil cowards with guns afraid of truth from a 14-year old girl.
> 
> See http://www.catwhisker.org/~david/publickey.gpg for my public key.

I think that you have now discovered why Jeff did not MFC to stable/9.
You are correct that putting in this fix requires seriously more work.
Sorry about sending you down that path.

	Kirk McKusick


More information about the freebsd-fs mailing list