[16] RFR: 8247319: Remove on-stack nmethod hotness counter sampling from safepoints

Erik Österlund erik.osterlund at oracle.com
Fri Jun 12 07:37:31 UTC 2020


Hi David,

Thanks! :)

/Erik

On 2020-06-12 09:36, David Holmes wrote:
> Hi Erik,
>
> I can't comment on your rationale as I know nothing about how the 
> hotness counters operate, but I can verify that what you propose 
> indeed removes the processing from the safepoint cleanup logic. :) So 
> in that limited sense LGTM.
>
> Cheers,
> David
> -----
>
> On 12/06/2020 5:29 pm, Erik Österlund wrote:
>> Hi,
>>
>> The sweeper is moving away from using safepoints for its heuristics. 
>> It used to count safepoints to figure out when to sweep, but no 
>> longer does that. At the same time, we have for a while been removing 
>> more and more safepoints. Safepoints are becoming increasingly rare 
>> events, dominated by when we need to GC (GuaranteedSafepointInterval 
>> is going to disappear). The frequency of how often we need to GC does 
>> not have an obvious connection to how often we need to sweep the code 
>> cache... any more.
>>
>> What still remains from the safepoint-based heuristics is the nmethod 
>> hotness counter sampling that is performed in safepoint cleanup. I 
>> would like to get rid of this.
>> The rationale is that the use of hotness counters is kicking in when 
>> the code cache is starting to fill up quite a bit, and there is a 
>> need to kill off nmethods heuristically, rather than because they are 
>> invalid. But when the code cache fills up, we sweep more and more 
>> aggressively. And during these sweeper cycles, we perform nmethod 
>> marking using handshakes. That operation also fills in hotness 
>> counters for all sampled nmethods.
>>
>> In other words, when there is need for acting on the hotness 
>> counters, we are in a state where we may be getting more nmethod 
>> hotness counter sample information from the sweeping cycles than we 
>> are from safepoint sampling. Conversely, when code cache pressure is 
>> high and we need more samples, we might end up getting very few from 
>> safepoint based sampling (because the heap is large). The correlation 
>> between safepoint frequency and code cache pressure is simply not 
>> there any more. And for us to walk all stacks in the system in every 
>> single safepoint (which for ZGC is starting to dominate pauses when I 
>> remove our stack sampling from safepoints), there better be a really 
>> good reason to do this sampling in safepoints. And there simply 
>> isn't. So I propose we delete it, in favour of using the hotness 
>> counter samples we get from the sweeping cycles instead, that are 
>> indeed proportional in frequency, to the code cache pressure.
>>
>> Bug:
>> https://bugs.openjdk.java.net/browse/JDK-8247319
>>
>> Webrev:
>> http://cr.openjdk.java.net/~eosterlund/8247319/webrev.00/
>>
>> Thanks,
>> /Erik



More information about the hotspot-dev mailing list