cvs commit: src/sys/vm vm_fault.c

Alan Cox alc at cs.rice.edu
Sat Aug 21 23:54:00 PDT 2004


On Sun, Aug 22, 2004 at 01:35:35AM -0400, Brian Fundakowski Feldman wrote:
> On Sun, Aug 22, 2004 at 12:27:31AM -0500, Alan Cox wrote:
> > On Sun, Aug 22, 2004 at 01:01:55AM -0400, Brian Fundakowski Feldman wrote:
> > > On Sat, Aug 21, 2004 at 11:49:36PM -0500, Alan Cox wrote:
> > > > On Sun, Aug 22, 2004 at 12:18:49AM -0400, Robert Watson wrote:
> > > > > 
> > > > > On Sun, 22 Aug 2004, Brian Fundakowski Feldman wrote:
> > > > > 
> > > > > > Also, it was the system_map lock, so it was a mutex, not an sx. 
> > > > > > 
> > > > > > /Goes back to trying to figure out wtf portupgrade -rR kde\* keeps
> > > > > > causing
> > > > > >  hangs, but only in X, not at the command line, and not seemingly
> > > > > >  dependant on AGP/X driver. 
> > > > > 
> > > > > I believe mutexes are alright also as long as you're careful about
> > > > > sleeping and lock order.  I.e., don't sleep while holding a
> > > > > non-funnel-like mutex, etc.
> > > > > 
> > > > 
> > > > We should never page fault on a system map.  If that happens to you,
> > > > it indicates an error.  (In-kernel maps on which we do page fault,
> > > > such as the pipe submap, are treated the same as user maps and utilize
> > > > an sx lock.)
> > > 
> > > vm_fault
> > > vm_fault_wire
> > > vm_map_wire
> > > kmem_alloc
> > > vm_ksubmap_init
> > > cpu_startup
> > > mi_startup
> > > 
> > 
> > This is a call to vm_fault(), but not a page fault.  This works
> > without error because kmem_alloc() has preallocated and wired all of
> > the required pages before calling vm_map_wire() (which calls
> > vm_fault()).  Specifically, the preallocation and wiring guarantees
> > that vm_fault() will not follow any of the code paths on which it
> > could sleep with the system map mutex held.  (In fact, I believe that
> > we are guaranteed that it will not sleep under any circumstances.)
> 
> I know, but that has nothing to do with the fact that it's locking the
> system map mutex, then Giant, which causes a lock order reversal.  Do
> you run WITNESS?  If you do, you should see this in your startup.
> 

Yes, I do.  And, there is no complaint from WITNESS.  Giant is already
held by the initialization code.  So, the acquisition of Giant inside
of vm_fault() is recursive and thus not a lock-order violation.  The
bug was, however, causing Giant to be acquired once and released twice
by vm_fault().  So, the first call to vm_fault() by vm_map_wire()
would release the initialization code's hold on Giant.  Once Giant
became unlocked, the next call to vm_fault() by vm_map_wire() would
tigger a lock-order reversal report.

Alan


More information about the cvs-src mailing list