RFR: 8145096: Undefined behaviour in HotSpot
Andrew Haley
aph at redhat.com
Thu Dec 17 10:31:21 UTC 2015
On 17/12/15 01:07, Kim Barrett wrote:
> On Dec 15, 2015, at 10:14 AM, Andrew Haley <aph at redhat.com> wrote:
>>
>> On 12/10/2015 09:44 PM, Kim Barrett wrote:
>>> src/os/posix/vm/os_posix.cpp
>>> 756 unsigned int i;
>>>
>>> I'm not sure what the purpose of changing this to unsigned might be.
>>> The sa_flags member of struct sigaction is of type int.
>>
>> There are assignments into this field that overflow: SA_RESETHAND
>> (0x80000000) is implicitly unsigned, so assigning it to an int field
>> is at best implementation-define. I suppose that this is really a bug
>> in the Linux headers. I'll take it out.
>
> Ick, that's really annoying. It does seem like a bug in the Linux
> headers, with a poorly chosen value for SA_RESETHAND. Or in POSIX,
> for using a signed type for a bitmask. I suppose asking you to fix
> this root cause would be a bit much. ;-)
We have to work with old kernels, so it wouldn't help for a while.
> Now that I understand it, and given its very localized scope, I would
> be fine with the original change to unsigned. That would prevent
> future verification/analysis from tripping over the same problem. My
> only concern is that someone reading this who is familiar with POSIX
> might wonder why an unsigned type is being used; maybe a brief comment?
> Up to you.
OK.
>>> 136 return (double)(method->rate() + 1) * ((double)(method->invocation_count() + 1) * (method->backedge_count() + 1));
>>>
>>> I suspect the problem encountered here is that the multiplication of
>>> the two count values can overflow. If that multiplication was not
>>> grouped, but instead the rate term and invocation count were
>>> multiplied, then multiplied by the backedge count (e.g. use the
>>> natural grouping for operator*), the result should be mathematically
>>> the same, and I think should not run into overflow problems. Recall
>>> that method->rate() returns float.
>>
>> I guess this would be okay, but it seems prudent to use a double. Is
>> there some special performance optimization we have to care about?
>> The return type of AdvancedThresholdPolicy::weight() is double, so
>> I really can't think of any good reason not to do the arithmetic as
>> double.
>>>
>>> So I think a simpler change here might be
>>>
>>> return (method->rate() + 1) * (method->invocation_count() + 1) * (method->backedge_count() + 1);
>>>
>>> i.e. remove the parens that group the count multiplication before
>>> multiplying by the rate.
>
> The only use of weight() is about 10 lines later in the same file, in
> compare_methods. It seems unlikely that the extra bits of precision
> from computing in double vs float is going to matter in any
> interesting way there. Certainly it can't be worse than the
> "substantially noise" version we've been using!
I'm trying not to break anything. It's not precision I'm worried
about, it's overflow. I assumed that the double return type was there
for a reason, given that the existing code very carefully casts to a
double. If you tell me that there is no reason for that cast I'll
happily do the whole function as float.
> I've now looked at the opto changes too. They mostly look ok to me.
>
> ------------------------------------------------------------------------------
> src/share/vm/opto/mulnode.cpp
> 805 const jlong bits_mask = java_subtract((jlong)CONST64(1) << (jlong)(BitsPerJavaLong - con), CONST64(1));
>
> Overflow in a left shift of a signed value is UB.
Argh. I am aware of that, but IMVHO it's a rather pedantic rule: it
disallows simple and obvious expressions such as -16 << 2. (What for?
Sign-and-absolute-magnitude machines?) However, them's the rules, I
guess.
I'm sure that it makes no sense ever to construct bitmasks using a
signed type. I am doing my best to be certain that I don't break
anything. I'll have a look to see if there is some way that I can
change this to unsigned without changing behaviour.
Andrew.
More information about the hotspot-dev
mailing list