kernel usage of fxsave/fxrstor

b. f. bf1783 at googlemail.com
Thu May 20 21:45:19 UTC 2010


On 5/20/10, Kostik Belousov <kostikbel at gmail.com> wrote:
> On Thu, May 20, 2010 at 01:41:22PM -0400, b. f. wrote:
>> I'm wondering why we equate cpu_fxsr and hw_instruction_sse in our
>> kernel, when several families of Intel and AMD processors have
>> fxsave/fxrstor, but not sse, and various documents from both companies
>> suggest that fxsave/fxrstor is faster than fsave/fnsave/frstor, even
>> when only saving the fpu/mmx state, and ought to be used for context
>> switches and calls and returns from interrupt and exception handlers
>> (e.g.. Sections 8.1.11, 10.5, and 11.6.5 of the Intel 64 and IA-32
>> Software Developers' Manual, Volume 1:
>>
>> http://www.intel.com/Assets/PDF/manual/253665.pdf
>>
>> ).
> What are the several families ? I am aware only of Pentium II
> that did have FXSAVE implemented, but not SSE. I am not even sure
> that all Pentium IIs have it, or only the later models.

I think only later models of Pentium II, from Deschutes onward.  As
for the number of families, I guess it depends upon how you classify
them.  I was thinking of the several Athlon variants in addition to
the Pentium II models.

>
> It is funny that I disposed my 2CPU Pentium II machine several weeks
> ago. I do not consider it worth an effort trying to optimize for
> some Pentiums II in 2010.

I have two Pentium II laptops that see occasional use, and an Athlon
desktop, that fall into this category. I'm sure that I'm not alone.
If a few simple changes, using code similar to that we already are
using for later models would yield some performance gains for them,
I"d be interested in testing patches.

Regards,
               b.


>>
>>
>> As far as I can tell from a cursory check, Linux draws a distinction
>> between cpu_has_fxsr, and cpu_has_xmm/xmm2, and uses fxsave/fxrstor on
>> all processors that have the feature, regardless of whether they have
>> sse.  Shouldn't we do the same?  Was this overlooked in the initial
>> sse commits? Or are the Intel assertions that the newer instructions
>> are faster incorrect?  Or was the extra handling needed for the
>> different semantics of the newer instructions, and/or concerns over
>> FreeBSD-SA-06:14.fpu.asc/CVE-2006-1056 responsible for their
>> suppression in pre-sse processors, even though safe methods of using
>> them was suggested:
>>
>> http://security.freebsd.org/advisories/FreeBSD-SA-06:14-amd.txt ?
>>
>> (Note that I'm not asking about setting the CR4.OSFXSR bit when sse
>> isn't needed or present, just using the newer fxsave/fxrstor when they
>> are present.)
>>
>> Regards,
>>                  b.
>> _______________________________________________
>> freebsd-hackers at freebsd.org mailing list
>> http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
>> To unsubscribe, send any mail to "freebsd-hackers-unsubscribe at freebsd.org"
>


More information about the freebsd-hackers mailing list