rusage breakdown and cpu limits.

Tue May 29 21:10:34 UTC 2007

On Tue, 29 May 2007, Jeff Roberson wrote:

> The problem with using a pool or per-process spinlock is that it keeps the 
> contention in the process domain, rather than thread domain.  For 
> multithreaded processes this will give the same contention as a global 
> scheduler lock, only slightly reduced in scope.  I'd like to solve this in 
> such a way that we don't have to revisit it again.
>
> I think I'm going to make the rusage struct per-thread and aggregate it on 
> demand.  There will be a lot of code churn, but it will be simple. There are

Ugh.

> a few cases where which will be complicated, and cpulimit is one of them.

No, cpulimit is simple because it can be fuzzy, unlike calcru() which require
the rusage to be up to date.

I see how rusage accumulation can help for everything _except_ the
runtime and tick counts (i.e., for stuff updated by statclock()).  For
the runtime and tick counts, the possible savings seem to be small and
negative.  calcru() would have to run the accumulation code and the
accumulation code would have to acquire something like sched_lock to
transfer the per-thread data (since the lock for updating that data
is something like sched_lock).  This is has the same locking overheads
and larger non-locking overheads than accumulating the runtime directly
into the rusage at context switch time -- calcru() needs to acquire
something like sched_lock either way.

Bruce