cpuset and affinity implementation
Jeff Roberson
jroberson at chesapeake.net
Tue Feb 26 01:10:26 UTC 2008
On Mon, 25 Feb 2008, Brooks Davis wrote:
> On Sun, Feb 24, 2008 at 05:38:37PM -1000, Jeff Roberson wrote:
>> Hello,
>>
>> I have implemented a new api similar to processors sets on solaris. This
>> allows you to assign processes to sets of cpus and dynamically change those
>> sets. This is useful for provisioning purposes to add and remove cpu
>> resources for a particular process or group of processes. This new
>> facility also supports binding secific threads to specific cpus which some
>> applications may want to do. At some point in the future this will be
>> integrated with jail so you can restrict the cpus any jail is allowed to
>> use.
>>
>> This api should not be considered final and the 'cpuset' tool is quite
>> rough. This also only works with ULE and is unfortunately intertwined with
>> a big ULE patch I've been working on. The set management code is generic
>> but 4BSD doesn't contain the hooks to actually constrain threads.
>
> I took a look at the patch this morning. The API looks like it's
> capable of doing what I need, at least at a first pass. It looks like I
> should be able to implement the semantics currently employed by the Sun
> Grid Engine scheduler on Irix systems.
>
> The one thing I noticed that I found worrying was the recursion in
> cpuset_(test)update(). It wasn't immediately clear to me there there
> is anything to would prevent an arbitrarily deep hierarchy from being
> created and blowing the kernel stack. I'm I missing something?
Yes, presently it can never be more than 3 levels deep. Once we have
jails the max would be 6 levels, unless you can make jails within jails.
There is presently now way for the user to create a cpuset that is a
subset of another set. So the three cpu sets are:
1) Root set - immutable, all cpus, may be root of jail in which case root
outside of the jail can change the set.
2) cpuset - the set this process is a member of.
3) mask - the anonymous set that is applied to an individual thread.
Did you look at the userland tool at all? I think this needs the most
improvement. I basically just made something that would allow me to pass
every possible parameter to the api. Not exactly engineered for
usability.
>
> -- Brooks
>
More information about the freebsd-current
mailing list