RFR(S): 8069124 - runtime/NMT/MallocSiteHashOverflow.java failing in nightlies
Coleen Phillimore
coleen.phillimore at oracle.com
Thu Feb 26 04:50:14 UTC 2015
On 2/25/15, 11:04 PM, David Holmes wrote:
> On 26/02/2015 1:55 PM, Coleen Phillimore wrote:
>>
>>
>> In this case, I think 'long' is chosen because we want to turn address
>> into an integral type to create the hashcode and that'll cover a 64 bit
>> address.
>
> Sounds like a job for intptr_t.
Or maybe uintptr_t if they're arithmetic.
>
>> Even if we change the signedness of the value, it's still a
>> useful hash code. I don't see a bug here, only unfortunate casts.
>
> I'm surprised this change doesn't induce further warnings about mixing
> signed and unsigned. But the code screams "type confusion" to me.
>
>> This change looks good to me. It's a really good find, Christian! We've
>> been chasing this bug for months.
>
> The real bug was reading an uninitialized value no? In which case the
> change from signed to unsigned seems incidental, unless also guarding
> against overflow.
There were two bugs. The signed hash value was 0x8000000 and when
negated was 0x8000000, so created a negative index into the hash table.
Coleen
>
> David
> -----
>
>>
>> Thanks,
>> Coleen
>>
>>
>> On 2/24/15, 11:09 PM, David Holmes wrote:
>>> Hi Christian,
>>>
>>> On 25/02/2015 12:57 PM, Christian Tornqvist wrote:
>>>> Hi everyone,
>>>>
>>>>
>>>>
>>>> Please review this small fix for an issue with NMT.
>>>
>>> The use of long in this function looks wrong, particularly as
>>> sizeof(long) might equal sizeof(int) and long is signed but now
>>> _hash_value is unsigned:
>>>
>>> 72 // Hash code. Any better algorithm?
>>> 73 unsigned int NativeCallStack::hash() const {
>>> 74 long hash_val = _hash_value;
>>> 75 if (hash_val == 0) {
>>> 76 long pc;
>>> 77 int index;
>>> 78 for (index = 0; index < NMT_TrackingStackDepth; index ++) {
>>> 79 pc = (long)_stack[index];
>>> 80 if (pc == 0) break;
>>> 81 hash_val += pc;
>>> 82 }
>>> 83
>>> 84 NativeCallStack* p = const_cast<NativeCallStack*>(this);
>>> 85 p->_hash_value = (unsigned int)(hash_val & 0xFFFFFFFF);
>>> 86 }
>>> 87 return _hash_value;
>>> 88 }
>>>
>>> Even with the original code the use of long seems wrong.
>>>
>>> David
>>>
>>>
>>>>
>>>>
>>>> The failure was caused by reading random memory from the uninitialized
>>>> _hash_value, when this value happened to be 0x80000000,
>>>> hash_to_index()
>>>> failed to negate the value and ended up with an index of -16:
>>>>
>>>>
>>>>
>>>> # Internal Error
>>>> (C:\\jprt\\T\\P1\\130630.ctornqvi\\s\\hotspot\\src\\share\\vm\\services\\mal
>>>>
>>>>
>>>> locSiteTable.cpp:139), pid=5680, tid=4928
>>>>
>>>> # assert(index >= 0) failed: Negative index -16
>>>>
>>>>
>>>>
>>>> Reproduced the issue and verified the fix using a debugger. Ran
>>>> vm.quick and
>>>> hotspot/test/:hotspot_jprt tests on Linux i586/x64 and Windows
>>>> i586/x64 with
>>>> -XX:NativeMemoryTracking=detail
>>>>
>>>>
>>>>
>>>> Webrev:
>>>>
>>>> http://cr.openjdk.java.net/~ctornqvi/webrev/8069124/webrev.00/
>>>>
>>>>
>>>>
>>>> Bug:
>>>>
>>>> https://bugs.openjdk.java.net/browse/JDK-8069124
>>>>
>>>>
>>>>
>>>> Thanks,
>>>>
>>>> Christian
>>>>
>>>>
>>>>
>>
More information about the hotspot-runtime-dev
mailing list