RFR: 8324969: assert(false) failed: Non-balanced monitor enter/exit!
Vladimir Kozlov
kvn at openjdk.org
Sat Feb 3 22:00:15 UTC 2024
The issue is that when we do nested locks elimination we don't check that synchronized region has both `Lock` and `Unlock` nodes.
Which can happen if locks coarsening optimization eliminated pair of Unlock/Lock nodes from adjacent locking regions before we check for nested locks.
Consider this code (all locks/unlocks use the same object):
lock1(obj) { // outer synchronized region
lock2(obj) {
// nested synchronized region 1
} unlock2(obj);
lock3(obj) {
// nested synchronized region 2
} unlock3(obj);
} unlock1(obj);
If `lock3` directly follows `unlock2` (no branches or safepoints) locks coarsening optimization will remove them:
lock1(obj) { // outer synchronized region
lock2(obj) {
// nested synchronized region 1
};
{
// nested synchronized region 2
} unlock3(obj);
} unlock1(obj);
Nested locks elimination code checks only `Lock` node in one region to find if it is nested (inside other lock region which use the same object) and then eliminate it. So we end up with not eliminated `Unlock` node in second nested region.
Why we don't hit this issue before? Normally nested locks elimination executed first and only then we do locks coarsening elimination. In the example above we eliminate all nested `Lock` and `Unlock` nodes, leaving only outer `Lock` and `Unlock`.
The additional factors which leads to the failure is fully unrolled loop around nested sync regions and some allocation to trigger Escape Analysis:
lock1(obj) { // outer synchronized region
Test var = new Test(); // Triggers EA
for (I = 0; I < 3; i++) { // small iteration number to fully unroll
lock2(obj) {
// nested synchronized region 1
} unlock2(obj);
lock3(obj) {
// nested synchronized region 2
} unlock3(obj);
}
} unlock1(obj);
Before executing Escape Analysis we do loops optimization to simplify graph: [compile.cpp#L2332](https://github.com/openjdk/jdk/blob/master/src/hotspot/share/opto/compile.cpp#L2332)
We also allow to fully unroll short loops (LoopOptsMaxUnroll) to remove merges from graph. It helps EA eliminate allocations.
Such unrolling creates several `Lock` and `Unlock` nodes per synchronized region. But nested locks elimination look for region with only one unique `Lock` node: [callnode.cpp#L2117](https://github.com/openjdk/jdk/blob/master/src/hotspot/share/opto/callnode.cpp#L2117). So we skip first iteration of nested locks elimination and execute locks coarsening optimization which eliminates all inners `Lock` and `Unlock` nodes leaving only first `Lock` and last `Unlock`. On next round of macro nodes expansion we execute nested locks elimination and remove first `Lock` node leaving `Unlock`.
The fix is to add check for unbalanced synchronized region (locks or unlocks) are missing.
New regression test was added. It has two `test` methods. One is based on original test from bug report. The other I wrote to show the issue clearly. Both triggers the assert.
Tested tier1-5, xcomp, stress
-------------
Commit messages:
- 8324969: assert(false) failed: Non-balanced monitor enter/exit!
Changes: https://git.openjdk.org/jdk/pull/17697/files
Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=17697&range=00
Issue: https://bugs.openjdk.org/browse/JDK-8324969
Stats: 95 lines in 2 files changed: 95 ins; 0 del; 0 mod
Patch: https://git.openjdk.org/jdk/pull/17697.diff
Fetch: git fetch https://git.openjdk.org/jdk.git pull/17697/head:pull/17697
PR: https://git.openjdk.org/jdk/pull/17697
More information about the hotspot-compiler-dev
mailing list