blockable sleep lock (sleep mutex) 16
John Baldwin
jhb at freebsd.org
Thu Feb 5 08:51:41 PST 2009
On Wednesday 04 February 2009 11:05:02 am Nikola Knežević wrote:
> On 2 Feb 2009, at 19:09 , Julian Elischer wrote:
>
> >>> It says "non-sleepable locks", yet it classifies click_instance
> >>> as sleep mutex. I think witness code should emit messages which
> >>> are more clear.
> >> It is confusing, but you can't do an M_WAITOK malloc while holding
> >> a mutex. Basically, sleeping actually means calling "*sleep()
> >> (such as mtx_sleep()) or cv_*wait*()". Blocking on a mutex is not
> >> sleeping, it's "blocking". Some locks (such as sx(9)) do "sleep"
> >> when you contest them. In the scheduler, sleeping and blocking are
> >> actually quite different (blocking uses turnstiles that handle
> >> priority inversions via priority propagation, sleeping uses sleep
> >> queues which do not do any of that). The underyling idea is that
> >> mutexes should be held for "short" periods of time, and that any
> >> sleeps are potentially unbounded. Holding a mutex while sleeping
> >> could result in a mutex being held for a long time.
> >
> >
> > the locking overview page
> > man 9 locking
> > tries to explain this..
> > I've been pestering John to proofread it and make suggestiosn for a
> > while now.
>
>
> Thanks John and Julian. I agree, man pages should be more clear :)
>
> I've switched from using mtx to sx locks, since they offer sleeping
> while hold.
>
> Unfortunately, I've ran into something really weird now, when I unload
> the module:
> ---8<---
> #0 doadump () at pcpu.h:195
> #1 0xffffffff8049ef98 in boot (howto=260) at /usr/src/sys/kern/
> kern_shutdown.c:418
> #2 0xffffffff8049f429 in panic (fmt=Variable "fmt" is not available.
> ) at /usr/src/sys/kern/kern_shutdown.c:574
> #3 0xffffffff8075cd26 in trap_fatal (frame=0xc, eva=Variable "eva" is
> not available.
> ) at /usr/src/sys/amd64/amd64/trap.c:764
> #4 0xffffffff8075da62 in trap (frame=0xffffffff87699940) at /usr/src/
> sys/amd64/amd64/trap.c:290
> #5 0xffffffff80743bfe in calltrap () at /usr/src/sys/amd64/amd64/
> exception.S:209
> #6 0xffffffff8052a411 in strcmp (s1=0xffffffff80824a0c "sigacts",
> s2=0xffffffff877cd3a9 <Address 0xffffffff877cd3a9 out of bounds>)
> at /usr/src/sys/libkern/strcmp.c:45
> #7 0xffffffff804d7c61 in enroll (description=0xffffffff80824a0c
> "sigacts", lock_class=0xffffffff80a19fe0)
> at /usr/src/sys/kern/subr_witness.c:1439
> #8 0xffffffff804d7fb1 in witness_init (lock=0xffffff00016f4ca8) at /
> usr/src/sys/kern/subr_witness.c:618
> #9 0xffffffff8049fd31 in sigacts_alloc () at /usr/src/sys/kern/
> kern_sig.c:3280
> #10 0xffffffff80481121 in fork1 (td=0xffffff0001384a50, flags=20,
> pages=Variable "pages" is not available.
> ) at /usr/src/sys/kern/kern_fork.c:453
> #11 0xffffffff80481450 in fork (td=0xffffff0001384a50, uap=Variable
> "uap" is not available.
> ) at /usr/src/sys/kern/kern_fork.c:106
> #12 0xffffffff8075d260 in syscall (frame=0xffffffff87699c80) at /usr/
> src/sys/amd64/amd64/trap.c:907
> #13 0xffffffff80743e0b in Xfast_syscall () at /usr/src/sys/amd64/amd64/
> exception.S:330
> #14 0x0000000800ca0a6c in ?? ()
> --->8---
>
> and in fra 7:
> (kgdb) p *w
> $5 = {w_name = 0xffffffff877cd3a9 <Address 0xffffffff877cd3a9 out of
> bounds>, w_class = 0xffffffff80a19fe0, w_list = {
> stqe_next = 0xffffffff80accce0}, w_typelist = {stqe_next =
> 0xffffffff80accce0}, w_children = 0x0,
> w_file = 0xffffffff877d1fa0 <Address 0xffffffff877d1fa0 out of
> bounds>, w_line = 307, w_level = 0, w_refcount = 2,
> w_Giant_squawked = 0 '\0', w_other_squawked = 0 '\0',
> w_same_squawked = 0 '\0', w_displayed = 0 '\0'}
> (kgdb) p *w->w_class
> $6 = {lc_name = 0xffffffff808564e0 "sleep mutex", lc_flags = 9,
> lc_ddb_show = 0xffffffff80492e6b <db_show_mtx>,
> lc_lock = 0xffffffff804938be <lock_mtx>, lc_unlock =
> 0xffffffff804933fc <unlock_mtx>}
>
> This happens after modevent exists.
>
> What puzzles me here is w_refcount of 2, while w_name is out of
> bounds. Locks I've created I properly destroyed (at least I think I
> did :)).
You are probably missing some sx_destroy()'s. You need to destroy each lock
you create with sx_init().
--
John Baldwin
More information about the freebsd-hackers
mailing list