RFR: 8267532: C2: Profile and prune untaken exception handlers [v9]

Vladimir Ivanov vlivanov at openjdk.org
Thu Nov 23 03:05:14 UTC 2023


On Fri, 17 Nov 2023 13:41:58 GMT, Jorn Vernee <jvernee at openjdk.org> wrote:

>> The issue is essentially that for the Java try-with-resource construct, javac generates multiple calls to `close(`) the resource. One of those calls is inside the hidden exception handler of the try block. The issue for us is that typically the exception handler is never entered (since no exception is thrown), however we don't profile exception handlers at the moment, so the block is not pruned. C2 doesn't inline the `close()` call in the handler due to low call site frequency. As a result, the receiver of that call escapes and can not be scalar replaced, which then leads to a loss in performance.
>> 
>> There has been some discussion on the JBS issue that this could be fixed by profiling catch blocks. And another suggestion that partial escape analysis could help here to prevent the object from escaping. But, I think there are other benefits to being able to prune dead catch blocks, such as general reduction in code size, and other optimizations being possible by dead code being eliminated. So, I've implemented catch block profiling + pruning in this patch.
>> 
>> The implementation is essentially very straightforward: we allocate an extra bit of profiling data for each
>> exception handler of a method in the `MethodData` for that method (which holds all the profiling
>> data). Then when looking up the exception handler after an exception is thrown, we mark the
>> exception handler as entered. When C2 parses the exception handler block, and it sees that it has
>> never been entered, we emit an uncommon trap instead.
>> 
>> I've also cleaned up the handling of profiling data sections a bit. After adding the extra section of data to MethodData, I was seeing several crashes when ciMethodData was used. The underlying issue seemed to be that the offset of the parameter data was computed based on the total data size - parameter data size (which doesn't work if we add an additional section for exception handler data). I've re-written the code around this a bit to try and prevent issues in the future. Both MethodData and ciMethodData now track offsets of parameter data and exception handler data, and the size of the each data section is derived from the offsets.
>> 
>> Finally, there was an assert firing in `freeze_internal` in `continuationFreezeThaw.cpp`:
>> 
>>     assert(monitors_on_stack(current) == ((current->held_monitor_count() - current->jni_monitor_count()) > 0),
>>          "Held monitor count and locks on stack invariant: " INT64_FORMAT " JNI: " INT64_FORMAT, (int...
>
> Jorn Vernee has updated the pull request incrementally with two additional commits since the last revision:
> 
>  - fix linux compile
>  - Revert "add too_many_traps check"
>    
>    This reverts commit bee05534777dc2caf10362f66fea90a06705a144.

Overall, looks very good.

I have been thinking about the following choices made in this PR:
* amount of profiling data: binary (seen vs not seen) vs integral (branch count)
* deoptimization action: `reinterpret` vs `made_not_entrant`
* place where uncommon trap is inserted (`Parse` vs `ciTypeFlow`)

I haven't come with strong arguments to change any of these choices, so I'm the patch as it is now. We can adjust them later as follow-up enhancements if we decide to do so.

On naming: `ex_handler` is used only once - `GraphKit::has_ex_handler()`. Everywhere else in the code base `exception_handler` is used. Please, align the naming. Feel free to adjust `GraphKit::has_ex_handler()`.

The tests are very nice! Can you, please, point me to the test case which covers profiling in interpreter?

-------------

Marked as reviewed by vlivanov (Reviewer).

PR Review: https://git.openjdk.org/jdk/pull/16416#pullrequestreview-1745639528


More information about the hotspot-dev mailing list