atomic v_usecount and v_holdcnt

Konstantin Belousov kostikbel at gmail.com
Wed Mar 18 10:44:48 UTC 2015


On Tue, Mar 17, 2015 at 02:44:12AM +0100, Mateusz Guzik wrote:
> On Mon, Mar 16, 2015 at 11:46:43AM +0200, Konstantin Belousov wrote:
> > On Sat, Mar 14, 2015 at 11:52:26PM +0100, Mateusz Guzik wrote:
> > > On Mon, Nov 24, 2014 at 11:52:52AM +0200, Konstantin Belousov wrote:
> > > > On Sat, Nov 22, 2014 at 10:11:47PM +0100, Mateusz Guzik wrote:
> > > > > On Sat, Nov 22, 2014 at 11:25:27AM +0200, Konstantin Belousov wrote:
> > > > > > On Sat, Nov 22, 2014 at 01:28:12AM +0100, Mateusz Guzik wrote:
> > > > > > > The idea is that we don't need an interlock as long as we don't
> > > > > > > transition either counter 1->0 or 0->1.
> > > > > > I already said that something along the lines of the patch should work.
> > > > > > In fact, you need vnode lock when hold count changes between 0 and 1,
> > > > > > and probably the same for use count.
> > > > > > 
> > > > > 
> > > > > I don't see why this would be required (not that I'm an VFS expert).
> > > > > vnode recycling seems to be protected with the interlock.
> > > > > 
> > > > > In fact I would argue that if this is really needed, current code is
> > > > > buggy.
> > > > Yes, it is already (somewhat) buggy.
> > > > 
> > > > Most need of the lock is for the case of counts coming from 1 to 0.
> > > > The reason is the handling of the active vnode list, which is used
> > > > for limiting the amount of vnode list walking in syncer.  When hold
> > > > count is decremented to 0, vnode is removed from the active list.
> > > > When use count is decremented to 0, vnode is supposedly inactivated,
> > > > and vinactive() cleans the cached pages belonging to vnode.  In other
> > > > words, VI_OWEINACT for dirty vnode is sort of bug.
> > > > 
> > > 
> > > Modified the patch to no longer have the usecount + interlock dropped +
> > > VI_OWEINACT set window.
> > > 
> > > Extended 0->1 hold count + vnode not locked window remains. I can fix
> > > that if it is really necessary by having _vhold return with interlock
> > > held if it did such transition.
> > 
> > In v_upgrade_usecount(), you call v_incr_devcount() without without interlock
> > held.  What prevents the devfs vnode from being recycled, in particular,
> > from invalidation of v_rdev pointer ?
> > 
> 
> Right, that was buggy. Fixed in the patch below.
Why non-atomicity of updates to several counters is safe ?  This at least
requires an explanation in the comment, I mean holdcnt/usecnt pair.

Assume the thread increased the v_usecount, but did not managed to
acquire dev_mtx. Another thread performs vrele() and progressed to
v_decr_devcount(). It decreases the si_usecount, which might allow yet
another thread to see the si_usecount as too low and start unwanted
action. I think that the tests for VCHR must be done at the very
start of the functions, and devfs vnodes must hold vnode interlock
unconditionally.

> 
> > I think that refcount_acquire_if_greater() KPI is excessive.  You always
> > calls acquire with val == 0, and release with val == 1.
> > 
> 
> Yea i noted in my prevoius e-mail it should be changed (see below).
> 
> I replaced them with refcount_acquire_if_not_zero and
> refcount_release_if_not_last.
I dislike the length of the names.  Can you propose something shorter ?

The type for the local variable old in both functions should be u_int.

> 
> > WRT to _refcount_release_lock, why is lock_object->lc_lock/lc_unlock KPI
> > cannot be used ? This allows to make refcount_release_lock() a function
> > instead of gcc extension macros.  Not to mention that the macro is unused.
> 
> These were supposed to be used by other code, forgot to remove it from
> the patch I sent here.
> 
> We can discuss this in another thread.
> 
> Striclty speaking we could use it here for vnode interlock, but I did
> not want to get around VI_LOCK macro (which right now is just a
> mtx_lock, but this may change).
> 
> Updated patch is below:
Do not introduce ASSERT_VI_LOCK, the name difference between
ASSERT_VI_LOCKED and ASSERT_VI_LOCK is only in the broken grammar.
I do not see anything wrong with explicit if() statements where needed,
in all four places.

In vputx(), wrap the long line (if (refcount_release() || VI_DOINGINACT)).


More information about the freebsd-fs mailing list