Use of C99 extra long double math functions after r236148

Stephen Montgomery-Smith stephen at missouri.edu
Sun Aug 12 23:07:48 UTC 2012


On 07/18/2012 03:56 PM, Steve Kargl wrote:
> On Wed, Jul 18, 2012 at 10:07:41AM -0500, Stephen Montgomery-Smith wrote:
>>
>>> The most obvious immediate difficulty in translating the above into C is
>>> that y*y and z*z may overflow when the result shouldn't.
>>
>> This will be a lot easier than I originally expected.  When we are in
>> conditions when overflow might occur, we can simply make the approximations
>> sqrt(y*y-1) = y
>> csqrt(z*z+1) = signum(x)*z
>> because in floating point arithmetic, these will not be approximations,
>> but true exactly.  And I am thinking that the test I will use for when
>> to use these approximations will be (y==y+1) and (z==z+1) respectively.
>>   (I would use (z*z==z*z+1) but that test has the overflow problem.)
>
> I could be mistaken, but I believe that you need to raise the
> inexact flag with these approximations because in fact you
> are doing floating point math.
>


Thanks for this observation.  I am looking through the C99 standard, 
trying to understand the inexact flag.  But I am struggling to interpret it.

Am I to understand that the inexact flag should be set anytime a 
floating point operation produces an answer that is not guaranteed 
exact?  For example, should 1.0/3.0 and sqrt(2.0) raise the inexact flag?



More information about the freebsd-numerics mailing list