Question about adding flags to mmap system call / NVIDIA amd64 driver implementation

Tue Apr 28 23:58:51 UTC 2009

Robert Noland wrote:
> On Tue, 2009-04-28 at 16:48 -0500, Kevin Day wrote:
>> On Apr 28, 2009, at 3:19 PM, Julian Bangert wrote:
>>
>>> Hello,
>>>
>>> I am currently trying to work a bit on the remaining "missing  
>>> feature" that NVIDIA requires ( http://wiki.freebsd.org/NvidiaFeatureRequests 
>>>   or a back post in this ML) -  the improved mmap system call.

you might check with jhb (john Baldwin) as I think (from his
p4 work) that he may be doing something in this area in p4.

>>> For now, I am trying to extend the current system call and  
>>> implementation to add cache control ( the type of memory caching  
>>> used) . This feature inherently is very architecture specific- but  
>>> it can lead to enormous performance improvements for memmapped  
>>> devices ( useful for drivers, etc). I would do this at the user site  
>>> by adding 3 flags to the mmap system call (MEM_CACHE__ATTR1 to  
>>> MEM_CACHE__ATTR3 ) which are a single octal digit corresponding to  
>>> the various caching options ( like Uncacheable,Write Combining,  
>>> etc... ) with the same numbers as the PAT_* macros from i386/include/ 
>>> specialreg.h except that the value 0 ( PAT_UNCACHEABLE ) is replaced  
>>> with value 2 ( undefined), whereas value 0 ( all 3 flags cleared) is  
>>> assigned the meaning "feature not used, use default cache control".
>>> For each cache behaviour there would of course also be a macro  
>>> expanding to the rigth combination of these flags for enhanced  
>>> useability.
>>>
>>> The mmap system call would, if any of these flags are set, decode  
>>> them and get a corresponding PAT_* value, perform the mapping and  
>>> then call into the pmap module to modify the cache attributes for  
>>> every page.
>> Have you looked at mem(4) yet?
>>
>>       Several architectures allow attributes to be associated with  
>> ranges of
>>       physical memory.  These attributes can be manipulated via  
>> ioctl() calls
>>       performed on /dev/mem.  Declarations and data types are to be  
>> found in
>>       <sys/memrange.h>.
>>
>>       The specific attributes, and number of programmable ranges may  
>> vary
>>       between architectures.  The full set of supported attributes is:
>>
>>       MDF_UNCACHEABLE
>>               The region is not cached.
>>
>>       MDF_WRITECOMBINE
>>               Writes to the region may be combined or performed out of  
>> order.
>>
>>       MDF_WRITETHROUGH
>>               Writes to the region are committed synchronously.
>>
>>       MDF_WRITEBACK
>>               Writes to the region are committed asynchronously.
>>
>>       MDF_WRITEPROTECT
>>               The region cannot be written to.
>>
>> This requires knowledge of the physical addresses, but I believe  
>> that's probably already necessary for what it sounds like you're  
>> trying to accomplish.
>>
>> Back in the FreeBSD-3.0 days, I was writing a custom driver for an AGP  
>> graphics controller, and setting the MTRR flags for the exposed buffer  
>> was a definite improvement (200-1200% faster in most cases).
> 
> This is MTRR, which is what we currently do, when we can.  The issue is
> that often times the BIOS maps ranges in a way that prevents us from
> using MTRR.  This is generally ideal for things like agp and
> framebuffers when it works, since they have a specific physical range
> that you want to work with.
> 
> With PCI(E) cards it isn't as cut and dry... In the ATI and Nouveau
> cases, we map scatter gather pages into the GART, which generally are
> allocated using contigmalloc behind the scenes, so it is also possible
> for it to work in that case. Moving forward, we may actually be mapping
> random pages into and out of the GART (GEM / TTM).  In those cases we
> really don't have a large contiguous range that we could set MTRR on.
> Intel CPUs are limited to 8 MTRR registers for the entire system also,
> so that can become an issue quickly if you are trying to manipulate
> several areas of memory.  With PAT we can manipulate the caching
> properties on a page level.  PAT also allows for some overlap conditions
> that MTRR won't, such as mapping a page write-combining on top on an
> UNCACHEABLE MTRR.
> 
> jhb@ has started some work on this, since I've been badgering him about
> this recently as well.
> 
> robert.
> 
>> -- Kevin
>>
>> _______________________________________________
>> freebsd-hackers at freebsd.org mailing list
>> http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
>> To unsubscribe, send any mail to "freebsd-hackers-unsubscribe at freebsd.org"