From stefank at openjdk.org Tue Apr 1 07:04:54 2025 From: stefank at openjdk.org (Stefan Karlsson) Date: Tue, 1 Apr 2025 07:04:54 GMT Subject: RFR: 8352994: ZGC: Fix regression introduced in JDK-8350572 Message-ID: We have seen a bunch of timeouts that all points towards the introduction of a check against VMError::is_error_reported_in_current_thread() in the ZGC verification code. I propose this workaround to first check if there's really an error reporting event that is going on by checking VMError::is_error_reported(). The underlying performance issue (or hang(?)) when calling os::current_thread_id() is being investigated as a separate bug. This fix just tries to clean up issues we see when running ZGC testing. Thanks to @plummercj for digging into this and proposing the same workaround. Testing: GHA is clean, I'll run this through a few tiers of our CI pipeline ------------- Commit messages: - 8352994: ZGC: Fix regression introduced in JDK-8350572 Changes: https://git.openjdk.org/jdk/pull/24349/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=24349&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8352994 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/24349.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24349/head:pull/24349 PR: https://git.openjdk.org/jdk/pull/24349 From cjplummer at openjdk.org Tue Apr 1 07:34:10 2025 From: cjplummer at openjdk.org (Chris Plummer) Date: Tue, 1 Apr 2025 07:34:10 GMT Subject: RFR: 8352994: ZGC: Fix regression introduced in JDK-8350572 In-Reply-To: References: Message-ID: <1S8NSOeUGbiCGZVwqiX0WGoHBguDWHvwwsxziFaFdtk=.3f5d4b1a-8d4e-47c3-a72c-9b8fc00e529d@github.com> On Tue, 1 Apr 2025 06:58:56 GMT, Stefan Karlsson wrote: > We have seen a bunch of timeouts that all points towards the introduction of a check against VMError::is_error_reported_in_current_thread() in the ZGC verification code. I propose this workaround to first check if there's really an error reporting event that is going on by checking VMError::is_error_reported(). > > The underlying performance issue (or hang(?)) when calling os::current_thread_id() is being investigated as a separate bug. This fix just tries to clean up issues we see when running ZGC testing. > > Thanks to @plummercj for digging into this and proposing the same workaround. > > Testing: GHA is clean, I'll run this through a few tiers of our CI pipeline I think you should also remove com/sun/jdi/JdbStopInNotificationThreadTest.java from the ZGC problem list. ------------- PR Review: https://git.openjdk.org/jdk/pull/24349#pullrequestreview-2731743846 From manc at openjdk.org Tue Apr 1 08:33:52 2025 From: manc at openjdk.org (Man Cao) Date: Tue, 1 Apr 2025 08:33:52 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v4] In-Reply-To: References: Message-ID: <_pxXWVlRMa_NcaIQWm6RS_CCrMuHpKZiKIXzxJuer6g=.ba7c6007-cc1f-44a4-b7cd-dd55f3322c65@github.com> > Hi all, > > I have implemented SoftMaxHeapSize for G1 as attached. It is completely reworked compared to [previous PR](https://github.com/openjdk/jdk/pull/20783), and excludes code for `CurrentMaxHeapSize`. I believe I have addressed all direct concerns from [previous email thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2024-November/050214.html), such as: > > - does not respect `MinHeapSize`; > - being too "blunt" and does not respect other G1 heuristics and flags for resizing, such as `MinHeapFreeRatio`, `MaxHeapFreeRatio`; > - does not affect heuristcs to trigger a concurrent cycle; > > [This recent thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2025-March/051619.html) also has some context. Man Cao has updated the pull request incrementally with one additional commit since the last revision: Add two tests ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24211/files - new: https://git.openjdk.org/jdk/pull/24211/files/6f201fac..fc22cbfe Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24211&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24211&range=02-03 Stats: 162 lines in 2 files changed: 162 ins; 0 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/24211.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24211/head:pull/24211 PR: https://git.openjdk.org/jdk/pull/24211 From tschatzl at openjdk.org Tue Apr 1 08:43:01 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 1 Apr 2025 08:43:01 GMT Subject: RFR: 8271870: G1: Add objArray splitting when scanning object with evacuation failure Message-ID: <7hH3ohZ65_msEVaZ0qAI1D3pNI1iyZbKM9sYgfEMbwg=.1d21c70e-788b-43a0-8720-ca0231a70a45@github.com> Hi all, please review this change that makes the object iteration path for evacuation failed objects the same as the one for regular objects (and indeed make both use the same code). This has been made possible with the refactoring of object array task queues. At the same time this also covers [JDK-8271871](https://bugs.openjdk.org/browse/JDK-8271871). Testing: tier1-5, some perf testing with no differences Thanks, Thomas ------------- Commit messages: - 8271870 Changes: https://git.openjdk.org/jdk/pull/24222/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=24222&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8271870 Stats: 101 lines in 3 files changed: 46 ins; 32 del; 23 mod Patch: https://git.openjdk.org/jdk/pull/24222.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24222/head:pull/24222 PR: https://git.openjdk.org/jdk/pull/24222 From manc at openjdk.org Tue Apr 1 08:44:55 2025 From: manc at openjdk.org (Man Cao) Date: Tue, 1 Apr 2025 08:44:55 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v4] In-Reply-To: <_pxXWVlRMa_NcaIQWm6RS_CCrMuHpKZiKIXzxJuer6g=.ba7c6007-cc1f-44a4-b7cd-dd55f3322c65@github.com> References: <_pxXWVlRMa_NcaIQWm6RS_CCrMuHpKZiKIXzxJuer6g=.ba7c6007-cc1f-44a4-b7cd-dd55f3322c65@github.com> Message-ID: <0rUbRHQuIv6bhZEiaalc5Qcfq5E7FJb51TtEf9qeYTk=.b084a316-7352-4c1b-8bea-5485740704e9@github.com> On Tue, 1 Apr 2025 08:33:52 GMT, Man Cao wrote: >> Hi all, >> >> I have implemented SoftMaxHeapSize for G1 as attached. It is completely reworked compared to [previous PR](https://github.com/openjdk/jdk/pull/20783), and excludes code for `CurrentMaxHeapSize`. I believe I have addressed all direct concerns from [previous email thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2024-November/050214.html), such as: >> >> - does not respect `MinHeapSize`; >> - being too "blunt" and does not respect other G1 heuristics and flags for resizing, such as `MinHeapFreeRatio`, `MaxHeapFreeRatio`; >> - does not affect heuristcs to trigger a concurrent cycle; >> >> [This recent thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2025-March/051619.html) also has some context. > > Man Cao has updated the pull request incrementally with one additional commit since the last revision: > > Add two tests This PR is ready for review. Included tests cover important functionality of `SoftMaxHeapSize`. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24211#issuecomment-2768618593 From manc at openjdk.org Tue Apr 1 08:44:55 2025 From: manc at openjdk.org (Man Cao) Date: Tue, 1 Apr 2025 08:44:55 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v5] In-Reply-To: References: Message-ID: > Hi all, > > I have implemented SoftMaxHeapSize for G1 as attached. It is completely reworked compared to [previous PR](https://github.com/openjdk/jdk/pull/20783), and excludes code for `CurrentMaxHeapSize`. I believe I have addressed all direct concerns from [previous email thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2024-November/050214.html), such as: > > - does not respect `MinHeapSize`; > - being too "blunt" and does not respect other G1 heuristics and flags for resizing, such as `MinHeapFreeRatio`, `MaxHeapFreeRatio`; > - does not affect heuristcs to trigger a concurrent cycle; > > [This recent thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2025-March/051619.html) also has some context. Man Cao has updated the pull request incrementally with one additional commit since the last revision: Revise test summary ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24211/files - new: https://git.openjdk.org/jdk/pull/24211/files/fc22cbfe..68f03cad Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24211&range=04 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24211&range=03-04 Stats: 5 lines in 2 files changed: 0 ins; 0 del; 5 mod Patch: https://git.openjdk.org/jdk/pull/24211.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24211/head:pull/24211 PR: https://git.openjdk.org/jdk/pull/24211 From tschatzl at openjdk.org Tue Apr 1 08:57:58 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 1 Apr 2025 08:57:58 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v5] In-Reply-To: References:

Message-ID: On Tue, 1 Apr 2025 08:44:55 GMT, Man Cao wrote: >> Hi all, >> >> I have implemented SoftMaxHeapSize for G1 as attached. It is completely reworked compared to [previous PR](https://github.com/openjdk/jdk/pull/20783), and excludes code for `CurrentMaxHeapSize`. I believe I have addressed all direct concerns from [previous email thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2024-November/050214.html), such as: >> >> - does not respect `MinHeapSize`; >> - being too "blunt" and does not respect other G1 heuristics and flags for resizing, such as `MinHeapFreeRatio`, `MaxHeapFreeRatio`; >> - does not affect heuristcs to trigger a concurrent cycle; >> >> [This recent thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2025-March/051619.html) also has some context. > > Man Cao has updated the pull request incrementally with one additional commit since the last revision: > > Revise test summary Initial comments. src/hotspot/share/gc/g1/g1CollectedHeap.cpp line 2066: > 2064: size_t G1CollectedHeap::soft_max_capacity() const { > 2065: return clamp(align_up(SoftMaxHeapSize, HeapAlignment), MinHeapSize, > 2066: max_capacity()); Maybe this clamping of `SoftMaxHeapSize` should be part of argument processing. src/hotspot/share/gc/g1/g1CollectedHeap.hpp line 1203: > 1201: size_t max_capacity() const override; > 1202: > 1203: // Print the soft maximum heap capacity. Suggestion: // Returns the soft maximum heap capacity. src/hotspot/share/gc/g1/g1IHOPControl.cpp line 119: > 117: return (size_t)MIN2( > 118: G1CollectedHeap::heap()->soft_max_capacity() * (100.0 - safe_total_heap_percentage) / 100.0, > 119: _target_occupancy * (100.0 - _heap_waste_percent) / 100.0 This looks wrong. G1ReservePercent is in some way similar to soft max heap size, intended to keep the target below the real maximum capacity. I.e. it is not intended that G1 keeps another reserve of G1ReservePercent size below soft max capacity (which is below maximum capacity). There has been some internal discussion about whether the functionality of G1ReservePercent and SoftMaxHeapSize is too similar to warrant the former, but removing it is another issue. Imo, SoftMaxHeapSize should be an separate, actual target for this calculation. (`default_conc_mark_start_threshold()` also does not subtract `G1ReservePercent` from `SoftMaxHeapSize`). test/hotspot/jtreg/gc/g1/TestSoftMaxHeapSize.java line 29: > 27: * @test > 28: * @bug 8236073 > 29: * @requires vm.gc.G1 & vm.opt.ExplicitGCInvokesConcurrent != true It's nicer to put and-ed conditions in separate lines. test/hotspot/jtreg/gc/g1/TestSoftMaxHeapSize.java line 46: > 44: private static final long ALLOCATED_BYTES = 20_000_000; // About 20M > 45: private static final long MAX_HEAP_SIZE = > 46: 200 * 1024 * 1024; // 200MiB, must match -Xmx on command line. Is it possible to get that value from the `MemoryMXBean` instead of relying on manual update? I.e. `getMax()`? ------------- Changes requested by tschatzl (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/24211#pullrequestreview-2731934928 PR Review Comment: https://git.openjdk.org/jdk/pull/24211#discussion_r2022415626 PR Review Comment: https://git.openjdk.org/jdk/pull/24211#discussion_r2022415016 PR Review Comment: https://git.openjdk.org/jdk/pull/24211#discussion_r2022430412 PR Review Comment: https://git.openjdk.org/jdk/pull/24211#discussion_r2022434814 PR Review Comment: https://git.openjdk.org/jdk/pull/24211#discussion_r2022438436 From tschatzl at openjdk.org Tue Apr 1 09:24:12 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 1 Apr 2025 09:24:12 GMT Subject: RFR: 8342382: Implementation of JEP G1: Improve Application Throughput with a More Efficient Write-Barrier [v29] In-Reply-To: References: Message-ID: > Hi all, > > please review this change that implements (currently Draft) JEP: G1: Improve Application Throughput with a More Efficient Write-Barrier. > > The reason for posting this early is that this is a large change, and the JEP process is already taking very long with no end in sight but we would like to have this ready by JDK 25. > > ### Current situation > > With this change, G1 will reduce the post write barrier to much more resemble Parallel GC's as described in the JEP. The reason is that G1 lacks in throughput compared to Parallel/Serial GC due to larger barrier. > > The main reason for the current barrier is how g1 implements concurrent refinement: > * g1 tracks dirtied cards using sets (dirty card queue set - dcqs) of buffers (dirty card queues - dcq) containing the location of dirtied cards. Refinement threads pick up their contents to re-refine. The barrier needs to enqueue card locations. > * For correctness dirty card updates requires fine-grained synchronization between mutator and refinement threads, > * Finally there is generic code to avoid dirtying cards altogether (filters), to avoid executing the synchronization and the enqueuing as much as possible. > > These tasks require the current barrier to look as follows for an assignment `x.a = y` in pseudo code: > > > // Filtering > if (region(@x.a) == region(y)) goto done; // same region check > if (y == null) goto done; // null value check > if (card(@x.a) == young_card) goto done; // write to young gen check > StoreLoad; // synchronize > if (card(@x.a) == dirty_card) goto done; > > *card(@x.a) = dirty > > // Card tracking > enqueue(card-address(@x.a)) into thread-local-dcq; > if (thread-local-dcq is not full) goto done; > > call runtime to move thread-local-dcq into dcqs > > done: > > > Overall this post-write barrier alone is in the range of 40-50 total instructions, compared to three or four(!) for parallel and serial gc. > > The large size of the inlined barrier not only has a large code footprint, but also prevents some compiler optimizations like loop unrolling or inlining. > > There are several papers showing that this barrier alone can decrease throughput by 10-20% ([Yang12](https://dl.acm.org/doi/10.1145/2426642.2259004)), which is corroborated by some benchmarks (see links). > > The main idea for this change is to not use fine-grained synchronization between refinement and mutator threads, but coarse grained based on atomically switching card tables. Mutators only work on the "primary" card table, refinement threads on a se... Thomas Schatzl has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 37 commits: - Merge branch 'master' into 8342382-card-table-instead-of-dcq - Merge branch 'master' into 8342382-card-table-instead-of-dcq - Merge branch 'master' into submit/8342382-card-table-instead-of-dcq - * make young gen length revising independent of refinement thread * use a service task * both refinement control thread and young gen length revising use the same infrastructure to get the number of available bytes and determine the time to the next update - * fix IR code generation tests that change due to barrier cost changes - * factor out card table and refinement table merging into a single method - Merge branch 'master' into 8342382-card-table-instead-of-dcq3 - * obsolete G1UpdateBufferSize G1UpdateBufferSize has previously been used to size the refinement buffers and impose a minimum limit on the number of cards per thread that need to be pending before refinement starts. The former function is now obsolete with the removal of the dirty card queues, the latter functionality has been taken over by the new diagnostic option `G1PerThreadPendingCardThreshold`. I prefer to make this a diagnostic option is better than a product option because it is something that is only necessary for some test cases to produce some otherwise unwanted behavior (continuous refinement). CSR is pending. - * more documentation on why we need to rendezvous the gc threads - Merge branch 'master' into 8342381-card-table-instead-of-dcq - ... and 27 more: https://git.openjdk.org/jdk/compare/aff5aa72...51fb6e63 ------------- Changes: https://git.openjdk.org/jdk/pull/23739/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23739&range=28 Stats: 7089 lines in 110 files changed: 2610 ins; 3555 del; 924 mod Patch: https://git.openjdk.org/jdk/pull/23739.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23739/head:pull/23739 PR: https://git.openjdk.org/jdk/pull/23739 From iwalulya at openjdk.org Tue Apr 1 10:55:27 2025 From: iwalulya at openjdk.org (Ivan Walulya) Date: Tue, 1 Apr 2025 10:55:27 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v5] In-Reply-To: References:

Message-ID: On Tue, 1 Apr 2025 08:44:55 GMT, Man Cao wrote: >> Hi all, >> >> I have implemented SoftMaxHeapSize for G1 as attached. It is completely reworked compared to [previous PR](https://github.com/openjdk/jdk/pull/20783), and excludes code for `CurrentMaxHeapSize`. I believe I have addressed all direct concerns from [previous email thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2024-November/050214.html), such as: >> >> - does not respect `MinHeapSize`; >> - being too "blunt" and does not respect other G1 heuristics and flags for resizing, such as `MinHeapFreeRatio`, `MaxHeapFreeRatio`; >> - does not affect heuristcs to trigger a concurrent cycle; >> >> [This recent thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2025-March/051619.html) also has some context. > > Man Cao has updated the pull request incrementally with one additional commit since the last revision: > > Revise test summary With the changes to `young_collection_expansion_amount()`, once we reach the `SoftMaxHeapSize`, we cannot expand the heap except during GC where expansion can happen without regard for `SoftMaxHeapSize`. Thus, after exceeding `SoftMaxHeapSize` we go into a phase of repeated GCs where we expand the heap almost one region at a time. Is this the expected effect of the `SoftMaxHeapSize` as implemented by this patch? ------------- PR Comment: https://git.openjdk.org/jdk/pull/24211#issuecomment-2768966455 From stefan.johansson at oracle.com Tue Apr 1 12:49:10 2025 From: stefan.johansson at oracle.com (Stefan Johansson) Date: Tue, 1 Apr 2025 14:49:10 +0200 Subject: [EXTERNAL] Re: RFC: G1 as default collector (for real this time) In-Reply-To: References: <74d05686-9c57-4262-881d-31c269f34bc5@oracle.com> <61CEE33A-6718-479D-A498-697C1063B5AA@oracle.com> Message-ID: <792ad340-5160-413b-b766-c49b4ff6d4c5@oracle.com> Thanks for sharing these results Monica, As Thomas mentioned we have done some testing comparing Serial to G1 in small environments as well. Our conclusions are similar to yours, G1 nowdays handles the small environments pretty good. I used SPECjbb2005, and my focus was to compare throughput given a fixed memory usage. The reason for this is that the low native memory overhead of Serial (no marking bitmap etc) is often used as an argument to use it in small environments. On the other hand, the region based heap layout of G1 can in many cases offer a better out of the box heap utilization compared to Serial. To test this and to make a fair comparison I configure Serial to have a slightly larger heap to get an overall equal memory consumption (using the peak PSS usage in Linux as the measure). SpecJBB2005 by default runs 1 to 8 warehouses, where warehouses corresponds to worker threads. I did run this in a cgroup environment with 1CPU and 1G memory. By default this will give G1 a 256m max heap, which I fixed using Xmx and Xms. To let Serial use as much memory in total as G1 I configured it with a 288MB heap. With this setup Serial and G1 get a very similar score with a recent JDK 25 build. The calculated score only takes warehouse 1 and 2 into account and looking at the result/score for 8 warehouses G1 is ~10% better. So it looks like G1 is able to handle high pressure better compared to Serial. These results are without the new improved barriers for G1, when using a build with the new barrier the G1 results are improved by roughly 3%. This is a use-case not at all caring about latency and the fact the G1 is still performing this good, also points towards it being a suitable default even for small environments. I've also played around a bit with restricting the amount of concurrent work done with G1, to see how a G1 STW-only mode would perform, and on a single CPU system this looks beneficial when we start to run with more worker threads. But I don't suspect it's that common to run small cloud services at 100% load, so having a default that can do concurrent work seems reasonable. Thanks, Stefan On 2025-03-18 00:59, Monica Beckwith wrote: > Hi Thomas, Erik, and all, > > This is an important and timely discussion, and I appreciate the > insights on how the gap between SerialGC and G1GC has diminished over > time. Based on recent comparative tests of out-of-the-box GC > configurations (-Xmx only), I wanted to share some data-backed > observations that might help validate this shift. > > I tested G1GC and SerialGC under 1-core/2GB and 2-core/2GB > containerized environments (512MB < -Xmx <1.5GB), running SPECJBB2015 > with and without stress tests. The key findings: > > *Throughput (max_jOPS & critical_jOPS):* > > * > G1GC consistently outperforms SerialGC. > * > 1 core: G1GC shows a 1.78? increase in max_jOPS. > * > 2 cores: G1GC shows a 2.84? improvement over SerialGC. > > > *Latency and Stop-the-World (STW) Impact:* > > * > SerialGC struggles under stress, with frequent full GCs leading to > long pauses. > * > G1GC?s incremental?collections keep pause times lower, especially > under stress load. > * > critical_jOPS, a key SLA metric, is 4.5? higher for G1GC on 2 cores. > > > *Memory Behavior & Stability:* > > * > In 512MB heap configurations, SerialGC encountered OOM failures > due to heap exhaustion. > > > Given these results, it seems reasonable to reconsider why SerialGC > remains the default in small environments when G1GC offers clear > performance and stability advantages. > > Looking forward to thoughts on this. > > Best, > Monica > > P.S.: I haven?t tested for <512MB heaps yet, as that requires a > different test config I?m still working on. I?d also love to hear from > anyone running single-threaded, CPU-bound workloads if they have > observations to share. > > > ------------------------------------------------------------------------ > *From:*?hotspot-gc-dev on behalf of > Thomas Schatzl > *Sent:*?Monday, February 24, 2025 2:33 AM > *To:*?Erik Osterlund > *Cc:* hotspot-gc-dev at openjdk.org > *Subject:*?[EXTERNAL] Re: RFC: G1 as default collector (for real this > time) > Hi, > > On 21.02.25 15:02, Erik Osterlund wrote: > > Hi Thomas, > > > [...]> There is however a flip side for that argument on the other side > of the scaling spectrum, where ZGC is probably a better fit on the even > larger scale. So while it?s true that the effect of a Serial -> G1 > default change is a static default GC, I just think we should mind the > fact that there is more uncertainty on the larger end of the scale. I?m > not proposing any changes, just saying that maybe we should be careful > about stressing the importance of having a static default GC, if we > don?t know if that is the better strategy on the larger end of the scale > or not, going forward. > > +1 > > Thomas > -------------- next part -------------- An HTML attachment was scrubbed... URL: From tschatzl at openjdk.org Tue Apr 1 16:09:20 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 1 Apr 2025 16:09:20 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v5] In-Reply-To: References:

Message-ID: On Tue, 1 Apr 2025 08:40:09 GMT, Thomas Schatzl wrote: >> Man Cao has updated the pull request incrementally with one additional commit since the last revision: >> >> Revise test summary > > src/hotspot/share/gc/g1/g1CollectedHeap.cpp line 2066: > >> 2064: size_t G1CollectedHeap::soft_max_capacity() const { >> 2065: return clamp(align_up(SoftMaxHeapSize, HeapAlignment), MinHeapSize, >> 2066: max_capacity()); > > Maybe this clamping of `SoftMaxHeapSize` should be part of argument processing. Ignore this - `SoftMaxHeapsize` is managable after all. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24211#discussion_r2023162750 From wkemper at openjdk.org Tue Apr 1 18:21:12 2025 From: wkemper at openjdk.org (William Kemper) Date: Tue, 1 Apr 2025 18:21:12 GMT Subject: RFR: 8353115: GenShen: mixed evacuation candidate regions need accurate live_data In-Reply-To: References: Message-ID: On Mon, 31 Mar 2025 03:17:51 GMT, Kelvin Nilsen wrote: > The existing implementation of get_live_data_bytes() and git_live_data_words() does not always behave as might be expected. In particular, the value returned ignores any allocations that occur subsequent to the most recent mark effort that identified live data within the region. This is typically ok for young regions that are going to be added or not to the collection set during final-mark safepoint. > > However, old-gen regions that are placed into the set of candidates for mixed evacuation are more complicated. In particular, by the time the old-gen region is added to a mixed evacuation, its live data may be much larger than at the time concurrent old marking ended. > > This PR provides comments to clarify the shortcomings of the existing functions, and adds new functions that provide more accurate accountings of live data for mixed-evacuation candidate regions. Changes requested by wkemper (Reviewer). src/hotspot/share/gc/shenandoah/shenandoahHeapRegion.cpp line 78: > 76: _live_data(0), > 77: _critical_pins(0), > 78: _mixed_candidate_garbage_words(0), Do we need a new field to track this? During `final_mark`, we call `increase_live_data_alloc_words` to add `TAMS + top` to `_live_data` to account for objects allocated during mark. Could we "fix" `get_live_data` so that it always returned marked objects (counted by `increase_live_data_gc_words`) _plus_ `top - TAMS`. This way, the live data would not become stale after `final_mark` and we wouldn't have another field to manage. What do you think? src/hotspot/share/gc/shenandoah/shenandoahHeapRegion.inline.hpp line 159: > 157: > 158: inline size_t ShenandoahHeapRegion::get_mixed_candidate_live_data_bytes() const { > 159: assert(SafepointSynchronize::is_at_safepoint(), "Should be at Shenandoah safepoint"); Could we use `shenandoah_assert_safepoint` here (and other places) instead? ------------- PR Review: https://git.openjdk.org/jdk/pull/24319#pullrequestreview-2733584314 PR Review Comment: https://git.openjdk.org/jdk/pull/24319#discussion_r2023461623 PR Review Comment: https://git.openjdk.org/jdk/pull/24319#discussion_r2023396124 From manc at openjdk.org Tue Apr 1 20:54:36 2025 From: manc at openjdk.org (Man Cao) Date: Tue, 1 Apr 2025 20:54:36 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v6] In-Reply-To: References: Message-ID: <3tPGLO7tcSAMgLFlLTlQCXWZ1Dvlk4xInkqdxoYTxwM=.5b8740c2-8ed3-4387-8a50-325007ed027e@github.com> > Hi all, > > I have implemented SoftMaxHeapSize for G1 as attached. It is completely reworked compared to [previous PR](https://github.com/openjdk/jdk/pull/20783), and excludes code for `CurrentMaxHeapSize`. I believe I have addressed all direct concerns from [previous email thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2024-November/050214.html), such as: > > - does not respect `MinHeapSize`; > - being too "blunt" and does not respect other G1 heuristics and flags for resizing, such as `MinHeapFreeRatio`, `MaxHeapFreeRatio`; > - does not affect heuristcs to trigger a concurrent cycle; > > [This recent thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2025-March/051619.html) also has some context. Man Cao has updated the pull request incrementally with one additional commit since the last revision: Address comments and try fixing test failure on macos-aarch64 ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24211/files - new: https://git.openjdk.org/jdk/pull/24211/files/68f03cad..0bc55654 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24211&range=05 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24211&range=04-05 Stats: 12 lines in 3 files changed: 2 ins; 3 del; 7 mod Patch: https://git.openjdk.org/jdk/pull/24211.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24211/head:pull/24211 PR: https://git.openjdk.org/jdk/pull/24211 From manc at openjdk.org Tue Apr 1 20:54:37 2025 From: manc at openjdk.org (Man Cao) Date: Tue, 1 Apr 2025 20:54:37 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v5] In-Reply-To: References:

Message-ID: On Tue, 1 Apr 2025 08:48:53 GMT, Thomas Schatzl wrote: >> Man Cao has updated the pull request incrementally with one additional commit since the last revision: >> >> Revise test summary > > src/hotspot/share/gc/g1/g1IHOPControl.cpp line 119: > >> 117: return (size_t)MIN2( >> 118: G1CollectedHeap::heap()->soft_max_capacity() * (100.0 - safe_total_heap_percentage) / 100.0, >> 119: _target_occupancy * (100.0 - _heap_waste_percent) / 100.0 > > This looks wrong. G1ReservePercent is in some way similar to soft max heap size, intended to keep the target below the real maximum capacity. > I.e. it is not intended that G1 keeps another reserve of G1ReservePercent size below soft max capacity (which is below maximum capacity). > > There has been some internal discussion about whether the functionality of G1ReservePercent and SoftMaxHeapSize is too similar to warrant the former, but removing it is another issue. > > Imo, SoftMaxHeapSize should be an separate, actual target for this calculation. (`default_conc_mark_start_threshold()` also does not subtract `G1ReservePercent` from `SoftMaxHeapSize`). Thanks. Yes, that makes sense. Now it uses `MIN3` to take `soft_max_capacity()` as a separate constraint. > test/hotspot/jtreg/gc/g1/TestSoftMaxHeapSize.java line 46: > >> 44: private static final long ALLOCATED_BYTES = 20_000_000; // About 20M >> 45: private static final long MAX_HEAP_SIZE = >> 46: 200 * 1024 * 1024; // 200MiB, must match -Xmx on command line. > > Is it possible to get that value from the `MemoryMXBean` instead of relying on manual update? I.e. `getMax()`? Yes, it is a good idea. Done. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24211#discussion_r2023659889 PR Review Comment: https://git.openjdk.org/jdk/pull/24211#discussion_r2023660401 From wkemper at openjdk.org Tue Apr 1 22:27:07 2025 From: wkemper at openjdk.org (William Kemper) Date: Tue, 1 Apr 2025 22:27:07 GMT Subject: RFR: 8351892: GenShen: Remove enforcement of generation sizes [v2] In-Reply-To: References: <-BEi4FpPLjKx07-J7ix9fHkKVhkcYylA0ojI-a1zrJs=.a3c073d3-7e52-46fd-8e2a-1ea601bd2074@github.com> Message-ID: On Sat, 29 Mar 2025 00:08:06 GMT, Kelvin Nilsen wrote: >> William Kemper has updated the pull request incrementally with one additional commit since the last revision: >> >> Don't let old have the entire heap > > src/hotspot/share/gc/shenandoah/shenandoahGenerationalFullGC.cpp line 120: > >> 118: if (old_capacity > old_usage) { >> 119: size_t excess_old_regions = (old_capacity - old_usage) / ShenandoahHeapRegion::region_size_bytes(); >> 120: gen_heap->transfer_to_young(excess_old_regions); > > should we assert result is successful? Or replace with force_transfer? (just seems bad practice to ignore a status result) Yes, will try an assert here. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24268#discussion_r2023754542 From wkemper at openjdk.org Tue Apr 1 22:44:35 2025 From: wkemper at openjdk.org (William Kemper) Date: Tue, 1 Apr 2025 22:44:35 GMT Subject: RFR: 8351892: GenShen: Remove enforcement of generation sizes [v3] In-Reply-To: References: Message-ID: > * The option to configure minimum and maximum sizes for the young generation have been combined into `ShenandoahInitYoungPercentage`. > * The remaining functionality in `shGenerationSizer` wasn't enough to warrant being its own class, so the functionality was rolled into `shGenerationalHeap`. William Kemper has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains five additional commits since the last revision: - Simplify confusing (and confused) comment - Assert that region transfers succeed when expected - Merge remote-tracking branch 'jdk/master' into stop-enforcing-gen-size-limits - Don't let old have the entire heap - Stop enforcing young/old generation sizes. Move what's left of generation sizing logic into shGenerationalHeap. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24268/files - new: https://git.openjdk.org/jdk/pull/24268/files/bc171089..33a2f19d Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24268&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24268&range=01-02 Stats: 18299 lines in 378 files changed: 10486 ins; 6499 del; 1314 mod Patch: https://git.openjdk.org/jdk/pull/24268.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24268/head:pull/24268 PR: https://git.openjdk.org/jdk/pull/24268 From wkemper at openjdk.org Tue Apr 1 22:44:36 2025 From: wkemper at openjdk.org (William Kemper) Date: Tue, 1 Apr 2025 22:44:36 GMT Subject: RFR: 8351892: GenShen: Remove enforcement of generation sizes [v2] In-Reply-To: References: <-BEi4FpPLjKx07-J7ix9fHkKVhkcYylA0ojI-a1zrJs=.a3c073d3-7e52-46fd-8e2a-1ea601bd2074@github.com> Message-ID: On Sat, 29 Mar 2025 00:10:28 GMT, Kelvin Nilsen wrote: >> William Kemper has updated the pull request incrementally with one additional commit since the last revision: >> >> Don't let old have the entire heap > > src/hotspot/share/gc/shenandoah/shenandoahGenerationalHeap.cpp line 134: > >> 132: ShenandoahHeap::initialize_heuristics(); >> 133: >> 134: // Max capacity is the maximum _allowed_ capacity. This means the sum of the maximum > > I don't understand the relevance of this comment. Is there still a mximum allowed for old and a maximum allowed for young? This comment stemmed from own confusion over fields and variables called _max_ `capacity` . I would like to rename the `_max_capacity` field to just `_capacity`. In my mind, the _max_ should be immutable, but that isn't how Shenandoah uses this field. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24268#discussion_r2023766431 From jsikstro at openjdk.org Wed Apr 2 06:57:22 2025 From: jsikstro at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Wed, 2 Apr 2025 06:57:22 GMT Subject: RFR: 8353471: ZGC: Redundant generation id in ZGeneration Message-ID: The ZGeneration class (and in turn ZGenerationOld and ZGenerationYoung) keeps track of its own ZGenerationId, which means that the generation id does not need to be passed along as an argument when calling internal functions. I've removed the id parameter from `ZGeneration::select_relocation_set` in favor of using the member variable `_id`. ------------- Commit messages: - 8353471: ZGC: Redundant generation id in ZGeneration Changes: https://git.openjdk.org/jdk/pull/24374/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=24374&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8353471 Stats: 6 lines in 2 files changed: 0 ins; 0 del; 6 mod Patch: https://git.openjdk.org/jdk/pull/24374.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24374/head:pull/24374 PR: https://git.openjdk.org/jdk/pull/24374 From stefank at openjdk.org Wed Apr 2 07:06:13 2025 From: stefank at openjdk.org (Stefan Karlsson) Date: Wed, 2 Apr 2025 07:06:13 GMT Subject: RFR: 8353471: ZGC: Redundant generation id in ZGeneration In-Reply-To: References: Message-ID: On Wed, 2 Apr 2025 06:52:49 GMT, Joel Sikstr?m wrote: > The ZGeneration class (and in turn ZGenerationOld and ZGenerationYoung) keeps track of its own ZGenerationId, which means that the generation id does not need to be passed along as an argument when calling internal functions. > > I've removed the id parameter from `ZGeneration::select_relocation_set` in favor of using the member variable `_id`. Marked as reviewed by stefank (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/24374#pullrequestreview-2734854851 From eosterlund at openjdk.org Wed Apr 2 10:01:34 2025 From: eosterlund at openjdk.org (Erik =?UTF-8?B?w5ZzdGVybHVuZA==?=) Date: Wed, 2 Apr 2025 10:01:34 GMT Subject: RFR: 8353471: ZGC: Redundant generation id in ZGeneration In-Reply-To: References: Message-ID: On Wed, 2 Apr 2025 06:52:49 GMT, Joel Sikstr?m wrote: > The ZGeneration class (and in turn ZGenerationOld and ZGenerationYoung) keeps track of its own ZGenerationId, which means that the generation id does not need to be passed along as an argument when calling internal functions. > > I've removed the id parameter from `ZGeneration::select_relocation_set` in favor of using the member variable `_id`. Marked as reviewed by eosterlund (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/24374#pullrequestreview-2735717264 From ayang at openjdk.org Wed Apr 2 10:15:48 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 2 Apr 2025 10:15:48 GMT Subject: RFR: 8271870: G1: Add objArray splitting when scanning object with evacuation failure In-Reply-To: <7hH3ohZ65_msEVaZ0qAI1D3pNI1iyZbKM9sYgfEMbwg=.1d21c70e-788b-43a0-8720-ca0231a70a45@github.com> References: <7hH3ohZ65_msEVaZ0qAI1D3pNI1iyZbKM9sYgfEMbwg=.1d21c70e-788b-43a0-8720-ca0231a70a45@github.com> Message-ID: On Tue, 25 Mar 2025 10:35:58 GMT, Thomas Schatzl wrote: > Hi all, > > please review this change that makes the object iteration path for evacuation failed objects the same as the one for regular objects (and indeed make both use the same code). > > This has been made possible with the refactoring of object array task queues. > > At the same time this also covers [JDK-8271871](https://bugs.openjdk.org/browse/JDK-8271871). > > Testing: tier1-5, some perf testing with no differences > > Thanks, > Thomas Marked as reviewed by ayang (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/24222#pullrequestreview-2735758122 From stefank at openjdk.org Wed Apr 2 11:15:01 2025 From: stefank at openjdk.org (Stefan Karlsson) Date: Wed, 2 Apr 2025 11:15:01 GMT Subject: RFR: 8352994: ZGC: Fix regression introduced in JDK-8350572 [v2] In-Reply-To: References: Message-ID: > We have seen a bunch of timeouts that all points towards the introduction of a check against VMError::is_error_reported_in_current_thread() in the ZGC verification code. I propose this workaround to first check if there's really an error reporting event that is going on by checking VMError::is_error_reported(). > > The underlying performance issue (or hang(?)) when calling os::current_thread_id() is being investigated as a separate bug. This fix just tries to clean up issues we see when running ZGC testing. > > Thanks to @plummercj for digging into this and proposing the same workaround. > > Testing: GHA is clean, I'll run this through a few tiers of our CI pipeline Stefan Karlsson has updated the pull request incrementally with one additional commit since the last revision: Remove test from ProblemList ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24349/files - new: https://git.openjdk.org/jdk/pull/24349/files/8db3f6d0..fe07a340 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24349&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24349&range=00-01 Stats: 1 line in 1 file changed: 0 ins; 1 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/24349.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24349/head:pull/24349 PR: https://git.openjdk.org/jdk/pull/24349 From stefank at openjdk.org Wed Apr 2 11:15:02 2025 From: stefank at openjdk.org (Stefan Karlsson) Date: Wed, 2 Apr 2025 11:15:02 GMT Subject: RFR: 8352994: ZGC: Fix regression introduced in JDK-8350572 In-Reply-To: References: Message-ID: On Tue, 1 Apr 2025 06:58:56 GMT, Stefan Karlsson wrote: > We have seen a bunch of timeouts that all points towards the introduction of a check against VMError::is_error_reported_in_current_thread() in the ZGC verification code. I propose this workaround to first check if there's really an error reporting event that is going on by checking VMError::is_error_reported(). > > The underlying performance issue (or hang(?)) when calling os::current_thread_id() is being investigated as a separate bug. This fix just tries to clean up issues we see when running ZGC testing. > > Thanks to @plummercj for digging into this and proposing the same workaround. > > Testing: GHA is clean, I'll run this through a few tiers of our CI pipeline I've removed the test and will run tier1-tier3. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24349#issuecomment-2772225278 From stefank at openjdk.org Wed Apr 2 11:47:53 2025 From: stefank at openjdk.org (Stefan Karlsson) Date: Wed, 2 Apr 2025 11:47:53 GMT Subject: RFR: 8353264: ZGC: Windows heap unreserving is broken In-Reply-To: References: Message-ID: <4Uyw00r7p9C-1BSfQRNEQ0p5td8RylD7YVLOHj6HODM=.47100abf-8467-4b47-9edb-c30877152c56@github.com> On Wed, 2 Apr 2025 11:35:36 GMT, Stefan Karlsson wrote: > During the development of [JDK-8350441](https://bugs.openjdk.org/browse/JDK-8350441) we found that the functionality to release reserved memory for the heap is broken. The current implementation passes in the size of the reserved memory area, but according to the documentation the call should be done with `0` as the dwSize argument: > > If the dwFreeType parameter is MEM_RELEASE, dwSize must be 0 (zero) > > > Generational ZGC isn't affected by this because we never release any reserved memory for the heap. However, the changes in JDK-8350441 is going to change that and we will start to release memory in certain corner-cases. > > In Single-gen ZGC, which exists in older releases, we have paths that do release memory for "views" into the heap. This only happens if something blocks the memory areas were we want to set up our "views" of the heap. We should probably backport this fix to the affected releases. > > I've added a unit test that provokes the problem and I have run this fix together with the changes for JDK-8350441. I moved this PR from hotspot to hotspot-gc. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24377#issuecomment-2772303530 From eosterlund at openjdk.org Wed Apr 2 11:58:57 2025 From: eosterlund at openjdk.org (Erik =?UTF-8?B?w5ZzdGVybHVuZA==?=) Date: Wed, 2 Apr 2025 11:58:57 GMT Subject: RFR: 8353264: ZGC: Windows heap unreserving is broken In-Reply-To: References: Message-ID: On Wed, 2 Apr 2025 11:35:36 GMT, Stefan Karlsson wrote: > During the development of [JDK-8350441](https://bugs.openjdk.org/browse/JDK-8350441) we found that the functionality to release reserved memory for the heap is broken. The current implementation passes in the size of the reserved memory area, but according to the documentation the call should be done with `0` as the dwSize argument: > > If the dwFreeType parameter is MEM_RELEASE, dwSize must be 0 (zero) > > > Generational ZGC isn't affected by this because we never release any reserved memory for the heap. However, the changes in JDK-8350441 is going to change that and we will start to release memory in certain corner-cases. > > In Single-gen ZGC, which exists in older releases, we have paths that do release memory for "views" into the heap. This only happens if something blocks the memory areas were we want to set up our "views" of the heap. We should probably backport this fix to the affected releases. > > I've added a unit test that provokes the problem and I have run this fix together with the changes for JDK-8350441. Looks good. ------------- Marked as reviewed by eosterlund (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/24377#pullrequestreview-2736002080 From tschatzl at openjdk.org Wed Apr 2 13:04:08 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Wed, 2 Apr 2025 13:04:08 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v5] In-Reply-To: References:

Message-ID: <0xr7VMlEH9EAc8XB9HQKPdxOHUcLfwtZkNAkGrTPu_k=.72d5e5be-373f-4db2-bbfb-9026c82e3c94@github.com> On Tue, 1 Apr 2025 20:57:36 GMT, Man Cao wrote: > > With the changes to `young_collection_expansion_amount()`, once we reach the `SoftMaxHeapSize`, we cannot expand the heap except during GC where expansion can happen without regard for `SoftMaxHeapSize`. Thus, after exceeding `SoftMaxHeapSize` we go into a phase of repeated GCs where we expand the heap almost one region at a time. Is this the expected effect of the `SoftMaxHeapSize` as implemented by this patch? > > Yes. This is the expected behavior if user sets `SoftMaxHeapSize` too small. G1 will try its best to respect `SoftMaxHeapSize`, which could cause GC thrashing. However, it won't cause `OutOfMemoryError`. This problem is due to user's misconfiguration of `SoftMaxHeapSize`, which is similar to user misconfiguring `Xmx` to be too small. The original patch on the CR only set the guidance for the marking. It did not interact with heap sizing directly at all like the change does. What is the reason for this change? (Iirc, in tests long time ago, with that original patch, and also adapting `Min/MaxHeapFreeRatio`, did result the desired effect of G1/`SoftMaxHeapSize` decreasing the heap appropriately. Without it, the heap will almost never change, but that is expected how `Mindoes not work). So similar to @walulyai I would strongly prefer for `SoftMaxHeapSize` not interfere that much with the application's performance. To me, this behavior is not "soft", and there seems to be general consensus internally about allowing unbounded cpu usage for GC. Afaiu in ZGC, if heap grows beyond `SoftMaxHeapSize`, GC activity can grow up to 25% of cpu usage (basically maxing out concurrent threads). That could be a reasonable guidance as well here. GC thrashing will also prevent progress with marking, and actually cause more marking because of objects not having enough time to die. This just makes the situation worse until the heap gets scaled back to `SoftMaxHeapSize`. However at the moment, changing the GC activity threshold internally will not automatically shrink the heap as you would expect, since currently shrinking is controlled by marking using the `Min/MaxHeapFreeRatio` flags. That gets us back to (JDK-8238687)[https://bugs.openjdk.org/browse/JDK-8238687] and (JDK-8248324)[https://bugs.openjdk.org/browse/JDK-8248324]... @walulyai is currently working on the former issue again, testing it, maybe you two could together on that to see whether basing this work on what @walulyai is cooking up is a better way forward, if needed modifying `gctimeratio` if we are above `SoftMaxHeapSize`? Otherwise, if there really is need to get this functionality asap, even only making it a guide for the marking should at least give some effect (but I think without changing `Min/MaxHeapFreeRatio` at the same time there is not much effect anyway). But that is a fairly coarse and indirect way of getting the necessary effect to shrink the heap. We should not limit ourselves to what mainline provides at the moment. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24211#issuecomment-2772493942 From tschatzl at openjdk.org Wed Apr 2 13:04:11 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Wed, 2 Apr 2025 13:04:11 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v6] In-Reply-To: <3tPGLO7tcSAMgLFlLTlQCXWZ1Dvlk4xInkqdxoYTxwM=.5b8740c2-8ed3-4387-8a50-325007ed027e@github.com> References: <3tPGLO7tcSAMgLFlLTlQCXWZ1Dvlk4xInkqdxoYTxwM=.5b8740c2-8ed3-4387-8a50-325007ed027e@github.com> Message-ID: On Tue, 1 Apr 2025 20:54:36 GMT, Man Cao wrote: >> Hi all, >> >> I have implemented SoftMaxHeapSize for G1 as attached. It is completely reworked compared to [previous PR](https://github.com/openjdk/jdk/pull/20783), and excludes code for `CurrentMaxHeapSize`. I believe I have addressed all direct concerns from [previous email thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2024-November/050214.html), such as: >> >> - does not respect `MinHeapSize`; >> - being too "blunt" and does not respect other G1 heuristics and flags for resizing, such as `MinHeapFreeRatio`, `MaxHeapFreeRatio`; >> - does not affect heuristcs to trigger a concurrent cycle; >> >> [This recent thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2025-March/051619.html) also has some context. > > Man Cao has updated the pull request incrementally with one additional commit since the last revision: > > Address comments and try fixing test failure on macos-aarch64 There also seems to be a concurrency issue with reading the `SoftMaxHeapSize` variable: Since the flag is manageable, at least outside of safepoints (afaict `jcmd` is blocked by safepoints, but I'll ask), the variable can be written to it at any time. So e.g. the assignment of `G1IHOPControl::get_conc_mark_start_threshold` to `marking_initiating_used_threshold` in that call can be inlined in `G1Policy::need_to_start_conc_mark` (called by the mutator in `G1CollectedHeap::attempt_allocation_humongous`) in multiple places, and so `SoftMaxHeapSize` re-read with multiple different values in that method. Probably an `Atomic::load(&SoftMaxHeapSize)` in the getter is sufficient for that. The other multiple re-readings of the `soft_max_capacity()` in the safepoint seem okay - I do not think there is a way to update the value within a safepoint externally. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24211#issuecomment-2772496003 From zgu at openjdk.org Wed Apr 2 13:24:56 2025 From: zgu at openjdk.org (Zhengyu Gu) Date: Wed, 2 Apr 2025 13:24:56 GMT Subject: RFR: 8353263: Parallel: Remove locking in PSOldGen::resize In-Reply-To: <4QpvbYEywkzocWXFBkda0ymp3cdpp6PNNTylVqUFXig=.7ee05cda-222a-421c-b09c-1519dfea7bf1@github.com> References: <4QpvbYEywkzocWXFBkda0ymp3cdpp6PNNTylVqUFXig=.7ee05cda-222a-421c-b09c-1519dfea7bf1@github.com> Message-ID: On Mon, 31 Mar 2025 09:45:23 GMT, Albert Mingkun Yang wrote: > Simple removing the use of `PSOldGenExpand_lock` in resizing logic after full-gc, because the calling context is inside a safepoint. > > Test: tier1-5 LGTM ------------- Marked as reviewed by zgu (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/24322#pullrequestreview-2736263356 From stuefe at openjdk.org Wed Apr 2 13:30:51 2025 From: stuefe at openjdk.org (Thomas Stuefe) Date: Wed, 2 Apr 2025 13:30:51 GMT Subject: RFR: 8353264: ZGC: Windows heap unreserving is broken In-Reply-To: References: Message-ID: On Wed, 2 Apr 2025 11:35:36 GMT, Stefan Karlsson wrote: > During the development of [JDK-8350441](https://bugs.openjdk.org/browse/JDK-8350441) we found that the functionality to release reserved memory for the heap is broken. The current implementation passes in the size of the reserved memory area, but according to the documentation the call should be done with `0` as the dwSize argument: > > If the dwFreeType parameter is MEM_RELEASE, dwSize must be 0 (zero) > > > Generational ZGC isn't affected by this because we never release any reserved memory for the heap. However, the changes in JDK-8350441 is going to change that and we will start to release memory in certain corner-cases. > > In Single-gen ZGC, which exists in older releases, we have paths that do release memory for "views" into the heap. This only happens if something blocks the memory areas were we want to set up our "views" of the heap. We should probably backport this fix to the affected releases. > > I've added a unit test that provokes the problem and I have run this fix together with the changes for JDK-8350441. Okay. Curious, was this a day zero problem? Incidentally, I remember that we had a problem with NUMA on windows where we only released the first NUMA stripe, leaving the other stripes around for future commits to trip over. But ZGC is probably not affected by that, since it does not use os::reserve/release_memory, right? ------------- Marked as reviewed by stuefe (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/24377#pullrequestreview-2736284463 From stefank at openjdk.org Wed Apr 2 14:06:06 2025 From: stefank at openjdk.org (Stefan Karlsson) Date: Wed, 2 Apr 2025 14:06:06 GMT Subject: RFR: 8353264: ZGC: Windows heap unreserving is broken In-Reply-To: References:

Message-ID: On Wed, 2 Apr 2025 13:28:37 GMT, Thomas Stuefe wrote: > Okay. > > Curious, was this a day zero problem? I think it was. For completeness, this is the unreserve paths you need to hit to hit this bug: bool XVirtualMemoryManager::reserve_contiguous(uintptr_t start, size_t size) { assert(is_aligned(size, XGranuleSize), "Must be granule aligned"); // Reserve address views const uintptr_t marked0 = XAddress::marked0(start); const uintptr_t marked1 = XAddress::marked1(start); const uintptr_t remapped = XAddress::remapped(start); // Reserve address space if (!pd_reserve(marked0, size)) { return false; } if (!pd_reserve(marked1, size)) { pd_unreserve(marked0, size); return false; } if (!pd_reserve(remapped, size)) { pd_unreserve(marked0, size); pd_unreserve(marked1, size); return false; } // Register address views with native memory tracker nmt_reserve(marked0, size); nmt_reserve(marked1, size); nmt_reserve(remapped, size); // Make the address range free _manager.free(start, size); return true; } > > Incidentally, I remember that we had a problem with NUMA on windows where we only released the first NUMA stripe, leaving the other stripes around for future commits to trip over. But ZGC is probably not affected by that, since it does not use os::reserve/release_memory, right? It doesn't sound like ZGC would be affected by that. At least not via those APIs. FWIW, I've identified another corner-case bug on Windows that only happens if we end up allocating discontiguous heaps, which only every happens if all our attempts to allocate a contiguous heap fails. I'm in the process of trying to write a test showing this issue. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24377#issuecomment-2772671014 From ayang at openjdk.org Wed Apr 2 14:22:55 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 2 Apr 2025 14:22:55 GMT Subject: RFR: 8353263: Parallel: Remove locking in PSOldGen::resize In-Reply-To: <4QpvbYEywkzocWXFBkda0ymp3cdpp6PNNTylVqUFXig=.7ee05cda-222a-421c-b09c-1519dfea7bf1@github.com> References: <4QpvbYEywkzocWXFBkda0ymp3cdpp6PNNTylVqUFXig=.7ee05cda-222a-421c-b09c-1519dfea7bf1@github.com> Message-ID: On Mon, 31 Mar 2025 09:45:23 GMT, Albert Mingkun Yang wrote: > Simple removing the use of `PSOldGenExpand_lock` in resizing logic after full-gc, because the calling context is inside a safepoint. > > Test: tier1-5 Thanks for review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24322#issuecomment-2772714981 From ayang at openjdk.org Wed Apr 2 14:22:56 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 2 Apr 2025 14:22:56 GMT Subject: Integrated: 8353263: Parallel: Remove locking in PSOldGen::resize In-Reply-To: <4QpvbYEywkzocWXFBkda0ymp3cdpp6PNNTylVqUFXig=.7ee05cda-222a-421c-b09c-1519dfea7bf1@github.com> References: <4QpvbYEywkzocWXFBkda0ymp3cdpp6PNNTylVqUFXig=.7ee05cda-222a-421c-b09c-1519dfea7bf1@github.com> Message-ID: On Mon, 31 Mar 2025 09:45:23 GMT, Albert Mingkun Yang wrote: > Simple removing the use of `PSOldGenExpand_lock` in resizing logic after full-gc, because the calling context is inside a safepoint. > > Test: tier1-5 This pull request has now been integrated. Changeset: a0677d94 Author: Albert Mingkun Yang URL: https://git.openjdk.org/jdk/commit/a0677d94d8c83a75cee054700e098faa97edca3c Stats: 5 lines in 1 file changed: 1 ins; 2 del; 2 mod 8353263: Parallel: Remove locking in PSOldGen::resize Reviewed-by: tschatzl, zgu ------------- PR: https://git.openjdk.org/jdk/pull/24322 From iwalulya at openjdk.org Wed Apr 2 15:12:02 2025 From: iwalulya at openjdk.org (Ivan Walulya) Date: Wed, 2 Apr 2025 15:12:02 GMT Subject: RFR: 8271870: G1: Add objArray splitting when scanning object with evacuation failure In-Reply-To: <7hH3ohZ65_msEVaZ0qAI1D3pNI1iyZbKM9sYgfEMbwg=.1d21c70e-788b-43a0-8720-ca0231a70a45@github.com> References: <7hH3ohZ65_msEVaZ0qAI1D3pNI1iyZbKM9sYgfEMbwg=.1d21c70e-788b-43a0-8720-ca0231a70a45@github.com> Message-ID: On Tue, 25 Mar 2025 10:35:58 GMT, Thomas Schatzl wrote: > Hi all, > > please review this change that makes the object iteration path for evacuation failed objects the same as the one for regular objects (and indeed make both use the same code). > > This has been made possible with the refactoring of object array task queues. > > At the same time this also covers [JDK-8271871](https://bugs.openjdk.org/browse/JDK-8271871). > > Testing: tier1-5, some perf testing with no differences > > Thanks, > Thomas Marked as reviewed by iwalulya (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/24222#pullrequestreview-2736649318 From manc at openjdk.org Wed Apr 2 16:00:33 2025 From: manc at openjdk.org (Man Cao) Date: Wed, 2 Apr 2025 16:00:33 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v7] In-Reply-To: References: Message-ID: > Hi all, > > I have implemented SoftMaxHeapSize for G1 as attached. It is completely reworked compared to [previous PR](https://github.com/openjdk/jdk/pull/20783), and excludes code for `CurrentMaxHeapSize`. I believe I have addressed all direct concerns from [previous email thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2024-November/050214.html), such as: > > - does not respect `MinHeapSize`; > - being too "blunt" and does not respect other G1 heuristics and flags for resizing, such as `MinHeapFreeRatio`, `MaxHeapFreeRatio`; > - does not affect heuristcs to trigger a concurrent cycle; > > [This recent thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2025-March/051619.html) also has some context. Man Cao has updated the pull request incrementally with one additional commit since the last revision: Fix test failure on macos-aarch64 by using power-of-two sizes. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24211/files - new: https://git.openjdk.org/jdk/pull/24211/files/0bc55654..4435e89f Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24211&range=06 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24211&range=05-06 Stats: 4 lines in 1 file changed: 2 ins; 0 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/24211.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24211/head:pull/24211 PR: https://git.openjdk.org/jdk/pull/24211 From stuefe at openjdk.org Wed Apr 2 16:16:06 2025 From: stuefe at openjdk.org (Thomas Stuefe) Date: Wed, 2 Apr 2025 16:16:06 GMT Subject: RFR: 8353264: ZGC: Windows heap unreserving is broken In-Reply-To: References:

Message-ID: <639NoIyfKt-nwS-Pn2ia-83bQUjAykMzL0YKd8rSO7I=.8973dd8d-686c-42a5-95b5-443ca005ad4f@github.com> On Wed, 2 Apr 2025 14:03:36 GMT, Stefan Karlsson wrote: >> Okay. >> Curious, was this a day zero problem? > I think it was. For completeness, this is the unreserve paths you need to hit to hit this bug: Ah okay, this is probably rare. I wondered whether it affects the unmapper path. Because AFAIU, that would have led to out-of-address space at some point with high probability. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24377#issuecomment-2773087416 From kdnilsen at openjdk.org Wed Apr 2 17:49:49 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 2 Apr 2025 17:49:49 GMT Subject: RFR: 8352181: Shenandoah: Evacuate thread roots after early cleanup In-Reply-To: <99wc8_4LoODnc8E0fwS3VV3NTfdPJ3soau-_jaiLrGU=.ef48e18a-03f2-4863-b610-513b52e539a5@github.com> References: <99wc8_4LoODnc8E0fwS3VV3NTfdPJ3soau-_jaiLrGU=.ef48e18a-03f2-4863-b610-513b52e539a5@github.com> Message-ID: On Mon, 17 Mar 2025 21:37:14 GMT, William Kemper wrote: > Moving the evacuation of thread roots after early cleanup allows Shenandoah to recycle immediate garbage a bit sooner in the cycle. Marked as reviewed by kdnilsen (Committer). ------------- PR Review: https://git.openjdk.org/jdk/pull/24090#pullrequestreview-2737095478 From kdnilsen at openjdk.org Wed Apr 2 17:55:48 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 2 Apr 2025 17:55:48 GMT Subject: RFR: 8352181: Shenandoah: Evacuate thread roots after early cleanup In-Reply-To: <99wc8_4LoODnc8E0fwS3VV3NTfdPJ3soau-_jaiLrGU=.ef48e18a-03f2-4863-b610-513b52e539a5@github.com> References: <99wc8_4LoODnc8E0fwS3VV3NTfdPJ3soau-_jaiLrGU=.ef48e18a-03f2-4863-b610-513b52e539a5@github.com> Message-ID: On Mon, 17 Mar 2025 21:37:14 GMT, William Kemper wrote: > Moving the evacuation of thread roots after early cleanup allows Shenandoah to recycle immediate garbage a bit sooner in the cycle. Maybe the "best" tradeoff is "adaptive behavior". If allocatable memory is in "short supply", we should evacuate thread roots early. Otherwise, we should preserve existing behavior. Defining "short supply" might be a bit tricky. There's a related PR that is still in development, to surge GC worker threads when we are at risk of experiencing allocation failures. A lot of heuristic predictions feed into the decision of when and whether to surge. We could use that same feedback mechanism here. If we are under "worker surge" conditions, that suggests memory is in short supply, an this is the ideal time to shift some of the GC work onto the mutators, so this is when we should evacuate thread roots early. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24090#issuecomment-2773302505 From jsikstro at openjdk.org Wed Apr 2 18:20:00 2025 From: jsikstro at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Wed, 2 Apr 2025 18:20:00 GMT Subject: RFR: 8353559: Restructure CollectedHeap error printing Message-ID: Calling Universe::heap()->print_on_error() gets dispatched to the most specific implementation, which for some GCs is their own implementation instead of the default in CollectedHeap. Each GC-specific implementation calls back to CollectedHeap::print_on_error(), which then dispatches back into the specific implementation of print_on(). This is kind of awkward and creates a call-chain that's not straightforward to wrap your head around, jumping back and forth via CollectedHeap and the specific implementation. To make the call-chain cleaner, I have made print_on_error() a pure virtual method in CollectedHeap, and implemented print_on_error() in each GC's implementation of CollectedHeap. In addition, I have removed print_extended_on() from CollectedHeap and implemented that for the GCs that actually need/use it. Removing the usage of the common print_on_error() also means that GCs that do not print anything interesting for their barrier set can omit this. So, I've removed it from ZGC and Shenandoah. To make print_on_error() consistent with print_on(), I have moved the printing of "Heap:" to the caller(s) of print_on_error() (only inside vmError.cpp). This is a trivial change for all GCs except ZGC, which requires some restructuring in its error printing. The old and new printing orders are shown below for ZGC: # Old # New Testing: * GHA * Tiers 1 & 2 * Manually verified that printing still works and outputs the intended information via running the following commands and comparing the output. ../fastdebug-old/jdk/bin/java -XX:ErrorHandlerTest=14 -XX:+ErrorFileToStdout -XX:+Use${gc}GC --version > ${gc}_old.txt ../fastdebug-new/jdk/bin/java -XX:ErrorHandlerTest=14 -XX:+ErrorFileToStdout -XX:+Use${gc}GC --version > ${gc}_new.txt ------------- Commit messages: - Copyright years - 8353559: Restructure CollectedHeap error printing Changes: https://git.openjdk.org/jdk/pull/24387/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=24387&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8353559 Stats: 141 lines in 16 files changed: 75 ins; 52 del; 14 mod Patch: https://git.openjdk.org/jdk/pull/24387.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24387/head:pull/24387 PR: https://git.openjdk.org/jdk/pull/24387 From jsikstro at openjdk.org Wed Apr 2 18:40:57 2025 From: jsikstro at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Wed, 2 Apr 2025 18:40:57 GMT Subject: RFR: 8353264: ZGC: Windows heap unreserving is broken In-Reply-To: References: Message-ID: On Wed, 2 Apr 2025 11:35:36 GMT, Stefan Karlsson wrote: > During the development of [JDK-8350441](https://bugs.openjdk.org/browse/JDK-8350441) we found that the functionality to release reserved memory for the heap is broken. The current implementation passes in the size of the reserved memory area, but according to the documentation the call should be done with `0` as the dwSize argument: > > If the dwFreeType parameter is MEM_RELEASE, dwSize must be 0 (zero) > > > Generational ZGC isn't affected by this because we never release any reserved memory for the heap. However, the changes in JDK-8350441 is going to change that and we will start to release memory in certain corner-cases. > > In Single-gen ZGC, which exists in older releases, we have paths that do release memory for "views" into the heap. This only happens if something blocks the memory areas were we want to set up our "views" of the heap. We should probably backport this fix to the affected releases. > > I've added a unit test that provokes the problem and I have run this fix together with the changes for JDK-8350441. Should `_has_unreserved` and `test_unreserve` become be static like the other member variables and test methods? ------------- PR Review: https://git.openjdk.org/jdk/pull/24377#pullrequestreview-2737227639 From stefank at openjdk.org Wed Apr 2 20:16:59 2025 From: stefank at openjdk.org (Stefan Karlsson) Date: Wed, 2 Apr 2025 20:16:59 GMT Subject: RFR: 8353264: ZGC: Windows heap unreserving is broken In-Reply-To: References:

Message-ID: On Wed, 2 Apr 2025 18:38:34 GMT, Joel Sikstr?m wrote: > Should `_has_unreserved` and `test_unreserve` become be static like the other member variables and test methods? I'll look into that tomorrow. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24377#issuecomment-2773620954 From stefank at openjdk.org Wed Apr 2 20:16:58 2025 From: stefank at openjdk.org (Stefan Karlsson) Date: Wed, 2 Apr 2025 20:16:58 GMT Subject: RFR: 8353264: ZGC: Windows heap unreserving is broken In-Reply-To: <639NoIyfKt-nwS-Pn2ia-83bQUjAykMzL0YKd8rSO7I=.8973dd8d-686c-42a5-95b5-443ca005ad4f@github.com> References:

<639NoIyfKt-nwS-Pn2ia-83bQUjAykMzL0YKd8rSO7I=.8973dd8d-686c-42a5-95b5-443ca005ad4f@github.com> Message-ID: On Wed, 2 Apr 2025 16:13:30 GMT, Thomas Stuefe wrote: > > > Okay. > > > > Curious, was this a day zero problem? > > > I think it was. For completeness, this is the unreserve paths you need to hit to hit this bug: > > Ah okay, this is probably rare. I wondered whether it affects the unmapper path. The unmapper converts the mapped memory (virtual to the physical memory) to be just reserved memory (but using Window's placeholder mechanism). So, the memory is not unreserved by the unmapper. I hope this makes sense. > Because AFAIU, that would have led to out-of-address space at some point with high probability. If you try to call this faulty unreserve implementation then the JVM will immediately shut down. So, I don't think this bug will cause and address-space leak. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24377#issuecomment-2773620289 From stefank at openjdk.org Wed Apr 2 20:53:48 2025 From: stefank at openjdk.org (Stefan Karlsson) Date: Wed, 2 Apr 2025 20:53:48 GMT Subject: RFR: 8353559: Restructure CollectedHeap error printing In-Reply-To: References: Message-ID: On Wed, 2 Apr 2025 18:09:12 GMT, Joel Sikstr?m wrote: > Calling Universe::heap()->print_on_error() gets dispatched to the most specific implementation, which for some GCs is their own implementation instead of the default in CollectedHeap. Each GC-specific implementation calls back to CollectedHeap::print_on_error(), which then dispatches back into the specific implementation of print_on(). This is kind of awkward and creates a call-chain that's not straightforward to wrap your head around, jumping back and forth via CollectedHeap and the specific implementation. > > To make the call-chain cleaner, I have made print_on_error() a pure virtual method in CollectedHeap, and implemented print_on_error() in each GC's implementation of CollectedHeap. In addition, I have removed print_extended_on() from CollectedHeap and implemented that for the GCs that actually need/use it. > > Removing the usage of the common print_on_error() also means that GCs that do not print anything interesting for their barrier set can omit this. So, I've removed it from ZGC and Shenandoah. > > To make print_on_error() consistent with print_on(), I have moved the printing of "Heap:" to the caller(s) of print_on_error() (only inside vmError.cpp). This is a trivial change for all GCs except ZGC, which requires some restructuring in its error printing. > > The old and new printing orders are shown below for ZGC: > > # Old > > > > > > > > > > # New > > > > > > > > Testing: > * GHA > * Tiers 1 & 2 > * Manually verified that printing still works and outputs the intended information via running the following commands and comparing the output. > > ../fastdebug-old/jdk/bin/java -XX:ErrorHandlerTest=14 -XX:+ErrorFileToStdout -XX:+Use${gc}GC --version > ${gc}_old.txt > ../fastdebug-new/jdk/bin/java -XX:ErrorHandlerTest=14 -XX:+ErrorFileToStdout -XX:+Use${gc}GC --version > ${gc}_new.txt Marked as reviewed by stefank (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/24387#pullrequestreview-2737551377 From manc at openjdk.org Thu Apr 3 06:29:49 2025 From: manc at openjdk.org (Man Cao) Date: Thu, 3 Apr 2025 06:29:49 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v7] In-Reply-To: References:

Message-ID: On Wed, 2 Apr 2025 16:00:33 GMT, Man Cao wrote: >> Hi all, >> >> I have implemented SoftMaxHeapSize for G1 as attached. It is completely reworked compared to [previous PR](https://github.com/openjdk/jdk/pull/20783), and excludes code for `CurrentMaxHeapSize`. I believe I have addressed all direct concerns from [previous email thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2024-November/050214.html), such as: >> >> - does not respect `MinHeapSize`; >> - being too "blunt" and does not respect other G1 heuristics and flags for resizing, such as `MinHeapFreeRatio`, `MaxHeapFreeRatio`; >> - does not affect heuristcs to trigger a concurrent cycle; >> >> [This recent thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2025-March/051619.html) also has some context. > > Man Cao has updated the pull request incrementally with one additional commit since the last revision: > > Fix test failure on macos-aarch64 by using power-of-two sizes. Re [Thomas' comment](#issuecomment-2772493942): > The original patch on the CR only set the guidance for the marking. It did not interact with heap sizing directly at all like the change does. What is the reason for this change? Because without changing heap sizing directly, setting `SoftMaxHeapSize` alone is ineffective to shrink the heap in most cases. E.g., the included test `test/hotspot/jtreg/gc/g1/TestSoftMaxHeapSize.java` will fail. For other concerns, I think one fundamental issue is the precedence of heap sizing flags: should the JVM respect `SoftMaxHeapSize` over `GCTimeRatio`/`MinHeapFreeRatio`/`MaxHeapFreeRatio`? My preference is yes, that `SoftMaxHeapSize` should have higher precedence, for the following reasons: 1. Users that set `SoftMaxHeapSize` expect it to be effective to limit heap size. The JVM should do its best to respect user's request. As [JDK-8222181](https://bugs.openjdk.org/browse/JDK-8222181) mentions: "When -XX:SoftMaxHeapSize is set, the GC should strive to not grow heap size beyond the specified size, unless the GC decides it's necessary to do so." We might interpret "GC decides it's necessary" differently. I think the real necessary case is "the JVM will throw OutOfMemoryError if it does not grow the heap", instead of "the JVM will violate `MinHeapFreeRatio`/`MaxHeapFreeRatio`/`GCTimeRatio` if it does not grow the heap". 1. Having a single flag that makes G1 shrink heap more aggressively, is much more user-friendly than requiring users to tune 3 or more flags to achieve the same effect. As you mentioned, if `SoftMaxHeapSize` only guides marking, user has to also tune `MinHeapFreeRatio`/`MaxHeapFreeRatio` to make G1 shrink more aggressively. It is difficult to figure out a proper value for each flag. Moreover, if user wants to make G1 shrink to a specific heap size, it is a lot harder to achieve that through tuning `MinHeapFreeRatio`/`MaxHeapFreeRatio`. 1. Issues with expansion after young collections from `GCTimeRatio`. `MinHeapFreeRatio`/`MaxHeapFreeRatio` have no effect on how much G1 expands the heap after young collections. Users need to tune `GCTimeRatio` if they want to make G1 expand less aggressively, otherwise aggressive expansion would defeat the purpose of `SoftMaxHeapSize`. However, `GCTimeRatio` is not a manageable flag, so it cannot be changed at run time. If `SoftMaxHeapSize` has precedence, we don't need to bother making `GCTimeRatio` manageable and asking users to tune it at run time. (This is somewhat related to [JDK-8349978](https://bugs.openjdk.org/browse/JDK-8349978) and [email thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2025-February/051004.html). ) > So similar to @walulyai I would strongly prefer for SoftMaxHeapSize not interfere that much with the application's performance. If user sets a too small `SoftMaxHeapSize` and causes performance regression or GC thrashing, it is really user's misconfiguration, and they should take measures to adjust `SoftMaxHeapSize` based on workload. Also misconfiguring `GCTimeRatio`/`MinHeapFreeRatio`/`MaxHeapFreeRatio` could cause similar regressions (think of `-XX:GCTimeRatio=1 -XX:MinHeapFreeRatio=1 -XX:MaxHeapFreeRatio=1`). However, I can see that `SoftMaxHeapSize` may be easier to misconfigure than the other 3 flags, because it does not adapt to changing live size by itself. I wonder if we could try reaching a middle ground (perhaps this is also what you suggests with ZGC's example of growing up to 25% of cpu usage?): - `SoftMaxHeapSize` still takes higher precedence over `GCTimeRatio`/`MinHeapFreeRatio`/`MaxHeapFreeRatio`. - G1 could have an internal mechanism to detect GC thrashing, and expands heap above `SoftMaxHeapSize` if thrashing happens. > That gets us back to [JDK-8238687](https://bugs.openjdk.org/browse/JDK-8238687) and [JDK-8248324](https://bugs.openjdk.org/browse/JDK-8248324)... Yes, fixing these two issues would be great regardless of `SoftMaxHeapSize`. However, they do not address the 3 issues above about flag precedence. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24211#issuecomment-2774619383 From manc at openjdk.org Thu Apr 3 07:08:19 2025 From: manc at openjdk.org (Man Cao) Date: Thu, 3 Apr 2025 07:08:19 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v8] In-Reply-To: References: Message-ID: > Hi all, > > I have implemented SoftMaxHeapSize for G1 as attached. It is completely reworked compared to [previous PR](https://github.com/openjdk/jdk/pull/20783), and excludes code for `CurrentMaxHeapSize`. I believe I have addressed all direct concerns from [previous email thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2024-November/050214.html), such as: > > - does not respect `MinHeapSize`; > - being too "blunt" and does not respect other G1 heuristics and flags for resizing, such as `MinHeapFreeRatio`, `MaxHeapFreeRatio`; > - does not affect heuristcs to trigger a concurrent cycle; > > [This recent thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2025-March/051619.html) also has some context. Man Cao has updated the pull request incrementally with one additional commit since the last revision: Use Atomic::load for flag ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24211/files - new: https://git.openjdk.org/jdk/pull/24211/files/4435e89f..c60ade41 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24211&range=07 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24211&range=06-07 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/24211.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24211/head:pull/24211 PR: https://git.openjdk.org/jdk/pull/24211 From manc at openjdk.org Thu Apr 3 07:30:51 2025 From: manc at openjdk.org (Man Cao) Date: Thu, 3 Apr 2025 07:30:51 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v8] In-Reply-To: References:

Message-ID: On Thu, 3 Apr 2025 06:27:22 GMT, Man Cao wrote: > 1. Users that set `SoftMaxHeapSize` expect it to be effective to limit heap size. The JVM should do its best to respect user's request. As [JDK-8222181](https://bugs.openjdk.org/browse/JDK-8222181) mentions: "When -XX:SoftMaxHeapSize is set, the GC should strive to not grow heap size beyond the specified size, unless the GC decides it's necessary to do so." We might interpret "GC decides it's necessary" differently. I think the real necessary case is "the JVM will throw OutOfMemoryError if it does not grow the heap", instead of "the JVM will violate `MinHeapFreeRatio`/`MaxHeapFreeRatio`/`GCTimeRatio` if it does not grow the heap". In the current approach, it is not that we are respecting the user's request, we are violating the request just that we do this only during GCs. So eventually you have back to back GCs that will expand the heap to whatever heapsize the application requires. My interpretation of `SoftMaxHeapSize` is that we can meet this limit where possible, but also exceed the limit if required. So I propose we take the same approach as used in other GCs where `SoftMaxHeapSize` is used as a parameter for setting GC pressure but not as a limit to allocations. > > 3. Issues with expansion after young collections from `GCTimeRatio`. `MinHeapFreeRatio`/`MaxHeapFreeRatio` have no effect on how much G1 expands the heap after young collections. Users need to tune `GCTimeRatio` if they want to make G1 expand less aggressively, otherwise aggressive expansion would defeat the purpose of `SoftMaxHeapSize`. However, `GCTimeRatio` is not a manageable flag, so it cannot be changed at run time. If `SoftMaxHeapSize` has precedence, we don't need to bother making `GCTimeRatio` manageable and asking users to tune it at run time. (This is somewhat related to [JDK-8349978](https://bugs.openjdk.org/browse/JDK-8349978) and [email thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2025-February/051004.html). ) Agreed, these ratios are problematic, and we should find a solution that removes them. We also need to agree on the purpose of `SoftMaxHeapSize`, my understanding is that `SoftMaxHeapSize` is meant for the application to be handle spikes in allocations and and quickly release the memory if no longer required. If `SoftMaxHeapSize` has precedence over`GCTimeRatio`, then G1 is changing the objective from balancing latency and throughput to optimizing for memory usage. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24211#issuecomment-2774824745 From tschatzl at openjdk.org Thu Apr 3 08:34:00 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Thu, 3 Apr 2025 08:34:00 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v8] In-Reply-To: References:

Message-ID: On Thu, 3 Apr 2025 07:28:13 GMT, Man Cao wrote: > Re: concurrency issue with reading `SoftMaxHeapSize` > > I updated to `Atomic::load()`, but not sure if I understand the concern correctly. > > > So e.g. the assignment of `G1IHOPControl::get_conc_mark_start_threshold` to `marking_initiating_used_threshold` in that call can be inlined in `G1Policy::need_to_start_conc_mark` (called by the mutator in `G1CollectedHeap::attempt_allocation_humongous`) in multiple places, and so `SoftMaxHeapSize` re-read with multiple different values in that method. > > I don't see where the re-read is. I think in any code path from `G1IHOPControl::get_conc_mark_start_threshold`, `G1CollectedHeap::heap()->soft_max_capacity()` is called only once. `G1CollectedHeap::attempt_allocation_humongous` also appears to call `G1Policy::need_to_start_conc_mark` only once, which calls `G1IHOPControl::get_conc_mark_start_threshold` only once. > > I agree it is a data race if `soft_max_capacity()` runs outside of a safepoint, so `Atomic::load()` makes sense regardless. The compiler could be(*) free to call `get_conc_mark_start_threshold()` again in any of the uses of the local variable without telling it that one of its components may change between re-reads. (*) Probably not after looking again, given that it's not marked as `const` (not sure why), and a virtual method, and fairly large. The situation would be much worse if somehow `SoftMaxHeapsize` could be changed within a safepoint. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24211#issuecomment-2774885501 From stefank at openjdk.org Thu Apr 3 09:32:12 2025 From: stefank at openjdk.org (Stefan Karlsson) Date: Thu, 3 Apr 2025 09:32:12 GMT Subject: RFR: 8353264: ZGC: Windows heap unreserving is broken [v2] In-Reply-To: References: Message-ID: > During the development of [JDK-8350441](https://bugs.openjdk.org/browse/JDK-8350441) we found that the functionality to release reserved memory for the heap is broken. The current implementation passes in the size of the reserved memory area, but according to the documentation the call should be done with `0` as the dwSize argument: > > If the dwFreeType parameter is MEM_RELEASE, dwSize must be 0 (zero) > > > Generational ZGC isn't affected by this because we never release any reserved memory for the heap. However, the changes in JDK-8350441 is going to change that and we will start to release memory in certain corner-cases. > > In Single-gen ZGC, which exists in older releases, we have paths that do release memory for "views" into the heap. This only happens if something blocks the memory areas were we want to set up our "views" of the heap. We should probably backport this fix to the affected releases. > > I've added a unit test that provokes the problem and I have run this fix together with the changes for JDK-8350441. Stefan Karlsson has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains three additional commits since the last revision: - Merge remote-tracking branch 'upstream/master' into 8353264_zgc_unreserve - Make addtions static - 8353264: ZGC: Windows heap unreserving is broken ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24377/files - new: https://git.openjdk.org/jdk/pull/24377/files/7e2861b2..bbf83831 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24377&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24377&range=00-01 Stats: 11266 lines in 447 files changed: 7600 ins; 2558 del; 1108 mod Patch: https://git.openjdk.org/jdk/pull/24377.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24377/head:pull/24377 PR: https://git.openjdk.org/jdk/pull/24377 From jsikstro at openjdk.org Thu Apr 3 09:32:12 2025 From: jsikstro at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Thu, 3 Apr 2025 09:32:12 GMT Subject: RFR: 8353264: ZGC: Windows heap unreserving is broken [v2] In-Reply-To: References:

Message-ID: <-jYFzlEXm9kiqtULRVQFRP1UcAfb_Yscb8s7AelLI98=.b68fb9ed-1a28-4437-8658-40087c134800@github.com> On Thu, 3 Apr 2025 09:29:08 GMT, Stefan Karlsson wrote: >> During the development of [JDK-8350441](https://bugs.openjdk.org/browse/JDK-8350441) we found that the functionality to release reserved memory for the heap is broken. The current implementation passes in the size of the reserved memory area, but according to the documentation the call should be done with `0` as the dwSize argument: >> >> If the dwFreeType parameter is MEM_RELEASE, dwSize must be 0 (zero) >> >> >> Generational ZGC isn't affected by this because we never release any reserved memory for the heap. However, the changes in JDK-8350441 is going to change that and we will start to release memory in certain corner-cases. >> >> In Single-gen ZGC, which exists in older releases, we have paths that do release memory for "views" into the heap. This only happens if something blocks the memory areas were we want to set up our "views" of the heap. We should probably backport this fix to the affected releases. >> >> I've added a unit test that provokes the problem and I have run this fix together with the changes for JDK-8350441. > > Stefan Karlsson has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains three additional commits since the last revision: > > - Merge remote-tracking branch 'upstream/master' into 8353264_zgc_unreserve > - Make addtions static > - 8353264: ZGC: Windows heap unreserving is broken Marked as reviewed by jsikstro (Committer). ------------- PR Review: https://git.openjdk.org/jdk/pull/24377#pullrequestreview-2739122546 From eosterlund at openjdk.org Thu Apr 3 09:53:53 2025 From: eosterlund at openjdk.org (Erik =?UTF-8?B?w5ZzdGVybHVuZA==?=) Date: Thu, 3 Apr 2025 09:53:53 GMT Subject: RFR: 8353559: Restructure CollectedHeap error printing In-Reply-To: References: Message-ID: On Wed, 2 Apr 2025 18:09:12 GMT, Joel Sikstr?m wrote: > Calling Universe::heap()->print_on_error() gets dispatched to the most specific implementation, which for some GCs is their own implementation instead of the default in CollectedHeap. Each GC-specific implementation calls back to CollectedHeap::print_on_error(), which then dispatches back into the specific implementation of print_on(). This is kind of awkward and creates a call-chain that's not straightforward to wrap your head around, jumping back and forth via CollectedHeap and the specific implementation. > > To make the call-chain cleaner, I have made print_on_error() a pure virtual method in CollectedHeap, and implemented print_on_error() in each GC's implementation of CollectedHeap. In addition, I have removed print_extended_on() from CollectedHeap and implemented that for the GCs that actually need/use it. > > Removing the usage of the common print_on_error() also means that GCs that do not print anything interesting for their barrier set can omit this. So, I've removed it from ZGC and Shenandoah. > > To make print_on_error() consistent with print_on(), I have moved the printing of "Heap:" to the caller(s) of print_on_error() (only inside vmError.cpp). This is a trivial change for all GCs except ZGC, which requires some restructuring in its error printing. > > The old and new printing orders are shown below for ZGC: > > # Old > > > > > > > > > > # New > > > > > > > > Testing: > * GHA > * Tiers 1 & 2 > * Manually verified that printing still works and outputs the intended information via running the following commands and comparing the output. > > ../fastdebug-old/jdk/bin/java -XX:ErrorHandlerTest=14 -XX:+ErrorFileToStdout -XX:+Use${gc}GC --version > ${gc}_old.txt > ../fastdebug-new/jdk/bin/java -XX:ErrorHandlerTest=14 -XX:+ErrorFileToStdout -XX:+Use${gc}GC --version > ${gc}_new.txt Marked as reviewed by eosterlund (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/24387#pullrequestreview-2739190388 From tschatzl at openjdk.org Thu Apr 3 10:01:54 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Thu, 3 Apr 2025 10:01:54 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v8] In-Reply-To: References:

Message-ID: On Thu, 3 Apr 2025 07:08:19 GMT, Man Cao wrote: >> Hi all, >> >> I have implemented SoftMaxHeapSize for G1 as attached. It is completely reworked compared to [previous PR](https://github.com/openjdk/jdk/pull/20783), and excludes code for `CurrentMaxHeapSize`. I believe I have addressed all direct concerns from [previous email thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2024-November/050214.html), such as: >> >> - does not respect `MinHeapSize`; >> - being too "blunt" and does not respect other G1 heuristics and flags for resizing, such as `MinHeapFreeRatio`, `MaxHeapFreeRatio`; >> - does not affect heuristcs to trigger a concurrent cycle; >> >> [This recent thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2025-March/051619.html) also has some context. > > Man Cao has updated the pull request incrementally with one additional commit since the last revision: > > Use Atomic::load for flag > Re [Thomas' comment](#issuecomment-2772493942): > > > The original patch on the CR only set the guidance for the marking. It did not interact with heap sizing directly at all like the change does. What is the reason for this change? > > Because without changing heap sizing directly, setting `SoftMaxHeapSize` alone is ineffective to shrink the heap in most cases. E.g., the included test `test/hotspot/jtreg/gc/g1/TestSoftMaxHeapSize.java` will fail. > > For other concerns, I think one fundamental issue is the precedence of heap sizing flags: should the JVM respect `SoftMaxHeapSize` over `GCTimeRatio`/`MinHeapFreeRatio`/`MaxHeapFreeRatio`? My preference is yes, that `SoftMaxHeapSize` should have higher precedence, for the following reasons: > > 1. Users that set `SoftMaxHeapSize` expect it to be effective to limit heap size. The JVM should do its best to respect user's request. As [JDK-8222181](https://bugs.openjdk.org/browse/JDK-8222181) mentions: "When -XX:SoftMaxHeapSize is set, the GC should strive to not grow heap size beyond the specified size, unless the GC decides it's necessary to do so." We might interpret "GC decides it's necessary" differently. I think the real necessary case is "the JVM will throw OutOfMemoryError if it does not grow the heap", instead of "the JVM will violate `MinHeapFreeRatio`/`MaxHeapFreeRatio`/`GCTimeRatio` if it does not grow the heap". > > 2. Having a single flag that makes G1 shrink heap more aggressively, is much more user-friendly than requiring users to tune 3 or more flags to achieve the same effect. As you mentioned, if `SoftMaxHeapSize` only guides marking, user has to also tune `MinHeapFreeRatio`/`MaxHeapFreeRatio` to make G1 shrink more aggressively. It is difficult to figure out a proper value for each flag. Moreover, if user wants to make G1 shrink to a specific heap size, it is a lot harder to achieve that through tuning `MinHeapFreeRatio`/`MaxHeapFreeRatio`. > > 3. Issues with expansion after young collections from `GCTimeRatio`. `MinHeapFreeRatio`/`MaxHeapFreeRatio` have no effect on how much G1 expands the heap after young collections. Users need to tune `GCTimeRatio` if they want to make G1 expand less aggressively, otherwise aggressive expansion would defeat the purpose of `SoftMaxHeapSize`. However, `GCTimeRatio` is not a manageable flag, so it cannot be changed at run time. If `SoftMaxHeapSize` has precedence, we don't need to bother making `GCTimeRatio` manageable and asking users to tune it at run time. (This is somewhat related to [JDK-8349978](https://bugs.openjdk.org/browse/JDK-8349978) and [email thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2025-February/051004.html). ) > > > > So similar to @walulyai I would strongly prefer for SoftMaxHeapSize not interfere that much with the application's performance. > > If user sets a too small `SoftMaxHeapSize` and causes performance regression or GC thrashing, it is really user's misconfiguration, and they should take measures to adjust `SoftMaxHeapSize` based on workload. Also misconfiguring `GCTimeRatio`/`MinHeapFreeRatio`/`MaxHeapFreeRatio` could cause similar regressions (think of `-XX:GCTimeRatio=1 -XX:MinHeapFreeRatio=1 -XX:MaxHeapFreeRatio=1`). > > However, I can see that `SoftMaxHeapSize` may be easier to misconfigure than the other 3 flags, because it does not adapt to changing live size by itself. I wonder if we could try reaching a middle ground (perhaps this is also what you suggests with ZGC's example of growing up to 25% of cpu usage?): Exactly. > > * `SoftMaxHeapSize` still takes higher precedence over `GCTimeRatio`/`MinHeapFreeRatio`/`MaxHeapFreeRatio`. > > * G1 could have an internal mechanism to detect GC thrashing, and expands heap above `SoftMaxHeapSize` if thrashing happens. > > > > That gets us back to [JDK-8238687](https://bugs.openjdk.org/browse/JDK-8238687) and [JDK-8248324](https://bugs.openjdk.org/browse/JDK-8248324)... > > Yes, fixing these two issues would be great regardless of `SoftMaxHeapSize`. However, they do not address the 3 issues above about flag precedence. * JDK-8248324 effectively removes the use of `Min/MaxHeapFreeRatio` (apart of full gc, which obviously they also need to be handled in some way that fits into the system). * JDK-8238687 makes `GCTimeRatio` shrink the heap too, obviating the need for `Min/MaxHeapFreeRatio`, which are currently the knobs that limit excessive memory usage. With no flag to interfere (no `Min/MaxHeapFreeRatio`) with each other, there is no need for considering their precedence. As you mention, there is need for some strategy to reconcile divergent goals - ultimately G1 needs a single value that tells it to resize the heap in which direction in which degree. Incidentally, the way `GCTimeRatio` (or actually the internal gc cpu usage target as an intermediate) is already in use fits these requirements. From that guiding value you can calculate a difference to desired, with some smoothing applied, which gives you both direction and degree of the change in heap size (applying some magic factors/constants). So it seems fairly straightforward to have any outside "memory pressure" effect this intermediate control value instead of everyone overriding each other in multiple places in the code. Now there is some question about the weights of these factors: we (in the gc team) prefer to keep G1's balancing between throughput and latency, particularly if the input this time is some value explicitly containing "soft" in its name. Using the 25% from ZGC as a max limit for gc cpu usage if we are (way) beyond what the user desires seems good enough for an initial guess. Not too high, guaranteeing some application progress in the worst case (for this factor!), not too low, guaranteeing that the intent of the user setting this value is respected. (One can see `Min/MaxHeapFreeRatio` as an old attempt to limit heap size growth without affecting performance too much, changing memory pressure. However they are hard to use. And they are completely dis-associated with the rest of the heap sizing mechanism. `SoftMaxHeapSize` is easier to handle) ------------- PR Comment: https://git.openjdk.org/jdk/pull/24211#issuecomment-2775155378 From stefank at openjdk.org Thu Apr 3 10:38:59 2025 From: stefank at openjdk.org (Stefan Karlsson) Date: Thu, 3 Apr 2025 10:38:59 GMT Subject: RFR: 8353264: ZGC: Windows heap unreserving is broken [v2] In-Reply-To: References:

Message-ID: On Thu, 3 Apr 2025 09:32:12 GMT, Stefan Karlsson wrote: >> During the development of [JDK-8350441](https://bugs.openjdk.org/browse/JDK-8350441) we found that the functionality to release reserved memory for the heap is broken. The current implementation passes in the size of the reserved memory area, but according to the documentation the call should be done with `0` as the dwSize argument: >> >> If the dwFreeType parameter is MEM_RELEASE, dwSize must be 0 (zero) >> >> >> Generational ZGC isn't affected by this because we never release any reserved memory for the heap. However, the changes in JDK-8350441 is going to change that and we will start to release memory in certain corner-cases. >> >> In Single-gen ZGC, which exists in older releases, we have paths that do release memory for "views" into the heap. This only happens if something blocks the memory areas were we want to set up our "views" of the heap. We should probably backport this fix to the affected releases. >> >> I've added a unit test that provokes the problem and I have run this fix together with the changes for JDK-8350441. > > Stefan Karlsson has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains three additional commits since the last revision: > > - Merge remote-tracking branch 'upstream/master' into 8353264_zgc_unreserve > - Make addtions static > - 8353264: ZGC: Windows heap unreserving is broken Thanks for the reviews! ------------- PR Comment: https://git.openjdk.org/jdk/pull/24377#issuecomment-2775290032 From stefank at openjdk.org Thu Apr 3 10:48:01 2025 From: stefank at openjdk.org (Stefan Karlsson) Date: Thu, 3 Apr 2025 10:48:01 GMT Subject: Integrated: 8353264: ZGC: Windows heap unreserving is broken In-Reply-To: References: Message-ID: On Wed, 2 Apr 2025 11:35:36 GMT, Stefan Karlsson wrote: > During the development of [JDK-8350441](https://bugs.openjdk.org/browse/JDK-8350441) we found that the functionality to release reserved memory for the heap is broken. The current implementation passes in the size of the reserved memory area, but according to the documentation the call should be done with `0` as the dwSize argument: > > If the dwFreeType parameter is MEM_RELEASE, dwSize must be 0 (zero) > > > Generational ZGC isn't affected by this because we never release any reserved memory for the heap. However, the changes in JDK-8350441 is going to change that and we will start to release memory in certain corner-cases. > > In Single-gen ZGC, which exists in older releases, we have paths that do release memory for "views" into the heap. This only happens if something blocks the memory areas were we want to set up our "views" of the heap. We should probably backport this fix to the affected releases. > > I've added a unit test that provokes the problem and I have run this fix together with the changes for JDK-8350441. This pull request has now been integrated. Changeset: ffca4f2d Author: Stefan Karlsson URL: https://git.openjdk.org/jdk/commit/ffca4f2da84cb8711794d8e692d176a7e785e7b1 Stats: 27 lines in 2 files changed: 24 ins; 0 del; 3 mod 8353264: ZGC: Windows heap unreserving is broken Reviewed-by: jsikstro, aboldtch, eosterlund, stuefe ------------- PR: https://git.openjdk.org/jdk/pull/24377 From aboldtch at openjdk.org Thu Apr 3 10:48:01 2025 From: aboldtch at openjdk.org (Axel Boldt-Christmas) Date: Thu, 3 Apr 2025 10:48:01 GMT Subject: RFR: 8353264: ZGC: Windows heap unreserving is broken [v2] In-Reply-To: References:

Message-ID: On Wed, 2 Apr 2025 11:15:01 GMT, Stefan Karlsson wrote: >> We have seen a bunch of timeouts that all points towards the introduction of a check against VMError::is_error_reported_in_current_thread() in the ZGC verification code. I propose this workaround to first check if there's really an error reporting event that is going on by checking VMError::is_error_reported(). >> >> The underlying performance issue (or hang(?)) when calling os::current_thread_id() is being investigated as a separate bug. This fix just tries to clean up issues we see when running ZGC testing. >> >> Thanks to @plummercj for digging into this and proposing the same workaround. >> >> Testing: GHA is clean, I'll run this through a few tiers of our CI pipeline > > Stefan Karlsson has updated the pull request incrementally with one additional commit since the last revision: > > Remove test from ProblemList A good local fix. But I also think `VMError::is_error_reported_in_current_thread()` should do `return is_error_reported() && _first_error_tid == os::current_thread_id();` Given that `current_thread_id` has a non trivial cost. ------------- Marked as reviewed by aboldtch (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/24349#pullrequestreview-2739468102 From ayang at openjdk.org Thu Apr 3 11:32:55 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Thu, 3 Apr 2025 11:32:55 GMT Subject: RFR: 8353559: Restructure CollectedHeap error printing In-Reply-To: References: Message-ID: On Wed, 2 Apr 2025 18:09:12 GMT, Joel Sikstr?m wrote: > Calling Universe::heap()->print_on_error() gets dispatched to the most specific implementation, which for some GCs is their own implementation instead of the default in CollectedHeap. Each GC-specific implementation calls back to CollectedHeap::print_on_error(), which then dispatches back into the specific implementation of print_on(). This is kind of awkward and creates a call-chain that's not straightforward to wrap your head around, jumping back and forth via CollectedHeap and the specific implementation. > > To make the call-chain cleaner, I have made print_on_error() a pure virtual method in CollectedHeap, and implemented print_on_error() in each GC's implementation of CollectedHeap. In addition, I have removed print_extended_on() from CollectedHeap and implemented that for the GCs that actually need/use it. > > Removing the usage of the common print_on_error() also means that GCs that do not print anything interesting for their barrier set can omit this. So, I've removed it from ZGC and Shenandoah. > > To make print_on_error() consistent with print_on(), I have moved the printing of "Heap:" to the caller(s) of print_on_error() (only inside vmError.cpp). This is a trivial change for all GCs except ZGC, which requires some restructuring in its error printing. > > The old and new printing orders are shown below for ZGC: > > # Old > > > > > > > > > > # New > > > > > > > > Testing: > * GHA > * Tiers 1 & 2 > * Manually verified that printing still works and outputs the intended information via running the following commands and comparing the output. > > ../fastdebug-old/jdk/bin/java -XX:ErrorHandlerTest=14 -XX:+ErrorFileToStdout -XX:+Use${gc}GC --version > ${gc}_old.txt > ../fastdebug-new/jdk/bin/java -XX:ErrorHandlerTest=14 -XX:+ErrorFileToStdout -XX:+Use${gc}GC --version > ${gc}_new.txt Marked as reviewed by ayang (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/24387#pullrequestreview-2739525769 From tschatzl at openjdk.org Thu Apr 3 11:33:45 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Thu, 3 Apr 2025 11:33:45 GMT Subject: RFR: 8271870: G1: Add objArray splitting when scanning object with evacuation failure [v2] In-Reply-To: <7hH3ohZ65_msEVaZ0qAI1D3pNI1iyZbKM9sYgfEMbwg=.1d21c70e-788b-43a0-8720-ca0231a70a45@github.com> References: <7hH3ohZ65_msEVaZ0qAI1D3pNI1iyZbKM9sYgfEMbwg=.1d21c70e-788b-43a0-8720-ca0231a70a45@github.com> Message-ID: > Hi all, > > please review this change that makes the object iteration path for evacuation failed objects the same as the one for regular objects (and indeed make both use the same code). > > This has been made possible with the refactoring of object array task queues. > > At the same time this also covers [JDK-8271871](https://bugs.openjdk.org/browse/JDK-8271871). > > Testing: tier1-5, some perf testing with no differences > > Thanks, > Thomas Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: * some additional assert to make sure the scanner is initialized correctly. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24222/files - new: https://git.openjdk.org/jdk/pull/24222/files/e5ce3984..21cc754a Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24222&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24222&range=00-01 Stats: 7 lines in 2 files changed: 6 ins; 1 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/24222.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24222/head:pull/24222 PR: https://git.openjdk.org/jdk/pull/24222 From iwalulya at openjdk.org Thu Apr 3 13:31:55 2025 From: iwalulya at openjdk.org (Ivan Walulya) Date: Thu, 3 Apr 2025 13:31:55 GMT Subject: RFR: 8271870: G1: Add objArray splitting when scanning object with evacuation failure [v2] In-Reply-To: References: <7hH3ohZ65_msEVaZ0qAI1D3pNI1iyZbKM9sYgfEMbwg=.1d21c70e-788b-43a0-8720-ca0231a70a45@github.com> Message-ID: On Thu, 3 Apr 2025 11:33:45 GMT, Thomas Schatzl wrote: >> Hi all, >> >> please review this change that makes the object iteration path for evacuation failed objects the same as the one for regular objects (and indeed make both use the same code). >> >> This has been made possible with the refactoring of object array task queues. >> >> At the same time this also covers [JDK-8271871](https://bugs.openjdk.org/browse/JDK-8271871). >> >> Testing: tier1-5, some perf testing with no differences >> >> Thanks, >> Thomas > > Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: > > * some additional assert to make sure the scanner is initialized correctly. LGTM! ------------- Marked as reviewed by iwalulya (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/24222#pullrequestreview-2739853788 From tschatzl at openjdk.org Thu Apr 3 15:09:18 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Thu, 3 Apr 2025 15:09:18 GMT Subject: RFR: 8271870: G1: Add objArray splitting when scanning object with evacuation failure [v2] In-Reply-To: References: <7hH3ohZ65_msEVaZ0qAI1D3pNI1iyZbKM9sYgfEMbwg=.1d21c70e-788b-43a0-8720-ca0231a70a45@github.com>

Message-ID: On Thu, 3 Apr 2025 13:29:18 GMT, Ivan Walulya wrote: >> Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: >> >> * some additional assert to make sure the scanner is initialized correctly. > > LGTM! Thanks @walulyai @albertnetymk for your reviews ------------- PR Comment: https://git.openjdk.org/jdk/pull/24222#issuecomment-2776099031 From tschatzl at openjdk.org Thu Apr 3 15:09:19 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Thu, 3 Apr 2025 15:09:19 GMT Subject: Integrated: 8271870: G1: Add objArray splitting when scanning object with evacuation failure In-Reply-To: <7hH3ohZ65_msEVaZ0qAI1D3pNI1iyZbKM9sYgfEMbwg=.1d21c70e-788b-43a0-8720-ca0231a70a45@github.com> References: <7hH3ohZ65_msEVaZ0qAI1D3pNI1iyZbKM9sYgfEMbwg=.1d21c70e-788b-43a0-8720-ca0231a70a45@github.com> Message-ID: <3pkPiCQ3xl43uo_Y6hbpUa8qCjgvId2B6tcL23TZTbI=.69ecc66d-a462-41cc-8914-85dc38308b64@github.com> On Tue, 25 Mar 2025 10:35:58 GMT, Thomas Schatzl wrote: > Hi all, > > please review this change that makes the object iteration path for evacuation failed objects the same as the one for regular objects (and indeed make both use the same code). > > This has been made possible with the refactoring of object array task queues. > > At the same time this also covers [JDK-8271871](https://bugs.openjdk.org/browse/JDK-8271871). > > Testing: tier1-5, some perf testing with no differences > > Thanks, > Thomas This pull request has now been integrated. Changeset: 64b691ab Author: Thomas Schatzl URL: https://git.openjdk.org/jdk/commit/64b691ab619d2d99a9c6492341074d2794563c16 Stats: 106 lines in 4 files changed: 51 ins; 32 del; 23 mod 8271870: G1: Add objArray splitting when scanning object with evacuation failure 8271871: G1 does not try to deduplicate objects that failed evacuation Reviewed-by: iwalulya, ayang ------------- PR: https://git.openjdk.org/jdk/pull/24222 From ysr at openjdk.org Thu Apr 3 21:45:52 2025 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Thu, 3 Apr 2025 21:45:52 GMT Subject: RFR: 8351091: Shenandoah: global marking context completeness is not accurately maintained [v7] In-Reply-To: <5Yxk8oBN69i5Ty_jRCtXoLeNjyet6DEySoFqnzxrblk=.9a1ad401-9da2-4d06-8e22-c51d810dd2f8@github.com> References: <5Yxk8oBN69i5Ty_jRCtXoLeNjyet6DEySoFqnzxrblk=.9a1ad401-9da2-4d06-8e22-c51d810dd2f8@github.com> Message-ID: <6sjBSQODcXKXzjvshAJiHq96N4Ler-TEBaSuN4nNr6w=.a6ee8ec7-9a3e-49ae-9718-8d1a027e6420@github.com> On Mon, 31 Mar 2025 23:09:53 GMT, Xiaolong Peng wrote: >> With the JEP 404: Generational Shenandoah implementation, there are generation specific marking completeness flags introduced, and the global marking context completeness flag is not updated at all after initialization, hence the global marking context completeness is not accurate anymore. This may cause expected behavior: [ShenandoahHeap::complete_marking_context()](https://github.com/openjdk/jdk/pull/23886/files#diff-d5ddf298c36b1c91bf33f9bff7bedcc063074edd68c298817f1fdf39d2ed970fL642) should throw assert error if the global marking context completeness flag is false, but now it always return the marking context even it marking is not complete, this may hide bugs where we expect the global/generational marking to be completed. >> >> This change PR fix the bug in global marking context completeness flag, and update all the places using `ShenandoahHeap::complete_marking_context()` to use proper API. >> >> ### Test >> - [x] hotspot_gc_shenandoah >> - [x] Tier 1 >> - [x] Tier 2 > > Xiaolong Peng has updated the pull request incrementally with one additional commit since the last revision: > > Can't verify marked object with complete marking after full GC I looked at the files that changed since the last review only, but can look over all of it once again if necessary (just let me know). This looks good; just a few small comments, and in particular a somewhat formalistic and pedantic distinction between the use of `gc_generation()` and `active_generation()` to fetch the marking context (and the use of `global_generation()`). Otherwise looks good to me. src/hotspot/share/gc/shenandoah/shenandoahFullGC.cpp line 352: > 350: assert(_from_region != nullptr, "must set before work"); > 351: assert(_heap->active_generation()->complete_marking_context()->is_marked(p), "must be marked"); > 352: assert(!_heap->active_generation()->complete_marking_context()->allocated_after_mark_start(p), "must be truly marked"); I am probably being a bit pedantic here... I would use `gc_generation()` in all code that is executed strictly by GC threads, and `active_generation()` in all code that may possibly be executed by a mutator thread. It seems as if today this code is only executed by GC threads. In general, there is no real distinction between these field at times like these (STW pauses) when heap verification is taking place, but from a syntactic hygiene perspective. We can otherwise file a ticket to separately clean up any confusion in the use of these fields (and add a dynamic check to prevent creeping confusion). The names aren't super well-chosen, but generally think of `_gc_generation` as the generation that is being GC'd, `_active_generation` as one that mutator threads are aware is being the subject of GC. Any assertions by mutator threads should use the latter and by GC threads the former. The fields are reconciled at STW pauses. src/hotspot/share/gc/shenandoah/shenandoahFullGC.cpp line 778: > 776: ShenandoahAdjustPointersClosure() : > 777: _heap(ShenandoahHeap::heap()), > 778: _ctx(ShenandoahHeap::heap()->global_generation()->complete_marking_context()) {} I liked the changes in this file that everywhere use the heap's `_gc_generation` (see comment about the distinction between `gc_generation()` and `active_generation()` above) field to fetch the marking context. While I understand that it might be the case that whenever we are here, the `_gc_generation` must necessarily be the `global_generation()`, I am wondering about: 1. using `_gc_generation` here as well to fetch the context, and 2. secondly, asserting also that the `_gc_generation` is in fact the `global_generation()`. I assume (2) must be the case here? If not, it would be good to see if this can be fixed. src/hotspot/share/gc/shenandoah/shenandoahFullGC.cpp line 1094: > 1092: ShenandoahHeapRegion* region = _regions.next(); > 1093: ShenandoahHeap* heap = ShenandoahHeap::heap(); > 1094: ShenandoahMarkingContext* const ctx = heap->global_generation()->complete_marking_context(); Same comment as at line 778. src/hotspot/share/gc/shenandoah/shenandoahVerifier.cpp line 1191: > 1189: _verify_remembered_after_full_gc, // verify read-write remembered set > 1190: _verify_forwarded_none, // all objects are non-forwarded > 1191: _verify_marked_incomplete, // all objects are marked in incomplete bitmap Is the marking bitmap updated as objects are moved to their new locations? Is that done just to satisfy the verifier? ------------- PR Review: https://git.openjdk.org/jdk/pull/23886#pullrequestreview-2741111545 PR Review Comment: https://git.openjdk.org/jdk/pull/23886#discussion_r2027772698 PR Review Comment: https://git.openjdk.org/jdk/pull/23886#discussion_r2027710108 PR Review Comment: https://git.openjdk.org/jdk/pull/23886#discussion_r2027713065 PR Review Comment: https://git.openjdk.org/jdk/pull/23886#discussion_r2027777968 From ysr at openjdk.org Thu Apr 3 21:55:07 2025 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Thu, 3 Apr 2025 21:55:07 GMT Subject: RFR: 8351091: Shenandoah: global marking context completeness is not accurately maintained [v7] In-Reply-To: References:

<7yfWKXewUM1XqWtlnyuPV3nu9bGr5VNJXuXi1aNQGvQ=.4c53d85b-13f3-4bfc-87c3-634d547bb440@github.com> Message-ID: On Thu, 6 Mar 2025 23:09:47 GMT, Xiaolong Peng wrote: >> OK, yes, that makes sense. Why not then use both `ShenandoahHeap::[complete_]marking_context()` as synonyms for `ShehandoahHeap::active_generation()->[complete_]marking_context()`. See other related comments in this review round. > > I feel using `henandoahHeap::complete_marking_context()` as synonyms for `ShehandoahHeap::active_generation()->[complete_]marking_context()` may cause more confusion, just read from the name it seems that it indicates the marking is complete for the whole heap, not just the active generation. ok, that makes sense. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23886#discussion_r2027790148 From ysr at openjdk.org Thu Apr 3 22:10:50 2025 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Thu, 3 Apr 2025 22:10:50 GMT Subject: RFR: 8351091: Shenandoah: global marking context completeness is not accurately maintained [v7] In-Reply-To: References:

<8w22oUPhZEx0iEIeNQ-GUUjx8jNkjXrTHjfjN_sX4HE=.2c391dd5-227e-4755-ba4d-528a7dcefca3@github.com>

Message-ID: On Fri, 7 Mar 2025 19:25:33 GMT, William Kemper wrote: >> You proposal will make the impl of the set_mark_complete/is_mark_complete of ShenandoahGeneration cleaner, but the thing is it will change current design and behavior, we may have to update the code where there methods is called, e.g. when we call `set_mark_complete` of gc_generation/active_generation, if it is global generation, we may have to explicitly call the same methods of ShenandoahYoungGeneration and ShenandoahOldGeneration to fan out the status. >> >> How about I follow up it in a separate task and update the implementation if necessary? I want to limit the changes involved in this PR, and only fix the bug. > > The young and old generations are only instantiated in the generational mode, so using them without checking the mode will result in SEGV in non-generational modes. > > Global collections have a lot of overlap with old collections. I think what Ramki is saying, is that if we change all the code that makes assertions about the completion status of young/old marking to use the `active_generation` field instead, then we wouldn't need to update the completion status of young/old during a global collection. The difficulty here is that we need assurances that the old generation mark bitmap is valid in collections subsequent to a global collection. So, I don't think we can rely on completion status of `active_generation` when it was global, in following collections where it may now refer to young or old. I see. Yes, that makes sense to me, thanks William. It would then be the case for the global generation that if is_mark_complete() then in the generational case that's also the case for both of its constituent generations. May be we can assert that when we fetch that at line 204 (and find it's true)? May be I am being paranoid, but the assert would make me feel confident that the state maintenance isn't going awry. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23886#discussion_r2027812176 From xpeng at openjdk.org Thu Apr 3 22:33:56 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Thu, 3 Apr 2025 22:33:56 GMT Subject: RFR: 8351091: Shenandoah: global marking context completeness is not accurately maintained [v7] In-Reply-To: <6sjBSQODcXKXzjvshAJiHq96N4Ler-TEBaSuN4nNr6w=.a6ee8ec7-9a3e-49ae-9718-8d1a027e6420@github.com> References: <5Yxk8oBN69i5Ty_jRCtXoLeNjyet6DEySoFqnzxrblk=.9a1ad401-9da2-4d06-8e22-c51d810dd2f8@github.com> <6sjBSQODcXKXzjvshAJiHq96N4Ler-TEBaSuN4nNr6w=.a6ee8ec7-9a3e-49ae-9718-8d1a027e6420@github.com> Message-ID: On Thu, 3 Apr 2025 21:39:33 GMT, Y. Srinivas Ramakrishna wrote: >> Xiaolong Peng has updated the pull request incrementally with one additional commit since the last revision: >> >> Can't verify marked object with complete marking after full GC > > src/hotspot/share/gc/shenandoah/shenandoahVerifier.cpp line 1191: > >> 1189: _verify_remembered_after_full_gc, // verify read-write remembered set >> 1190: _verify_forwarded_none, // all objects are non-forwarded >> 1191: _verify_marked_incomplete, // all objects are marked in incomplete bitmap > > Is the marking bitmap updated as objects are moved to their new locations? Is that done just to satisfy the verifier? Yes, making bitmaps has been reset after full GC, except for the for regions with pined objects. _verify_marked_complete requires complete marking context, it might make more sense to change it to _verify_marked_disable after full GC. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23886#discussion_r2027835236 From xpeng at openjdk.org Thu Apr 3 22:37:50 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Thu, 3 Apr 2025 22:37:50 GMT Subject: RFR: 8351091: Shenandoah: global marking context completeness is not accurately maintained [v7] In-Reply-To: <6sjBSQODcXKXzjvshAJiHq96N4Ler-TEBaSuN4nNr6w=.a6ee8ec7-9a3e-49ae-9718-8d1a027e6420@github.com> References: <5Yxk8oBN69i5Ty_jRCtXoLeNjyet6DEySoFqnzxrblk=.9a1ad401-9da2-4d06-8e22-c51d810dd2f8@github.com> <6sjBSQODcXKXzjvshAJiHq96N4Ler-TEBaSuN4nNr6w=.a6ee8ec7-9a3e-49ae-9718-8d1a027e6420@github.com> Message-ID: On Thu, 3 Apr 2025 21:34:06 GMT, Y. Srinivas Ramakrishna wrote: >> Xiaolong Peng has updated the pull request incrementally with one additional commit since the last revision: >> >> Can't verify marked object with complete marking after full GC > > src/hotspot/share/gc/shenandoah/shenandoahFullGC.cpp line 352: > >> 350: assert(_from_region != nullptr, "must set before work"); >> 351: assert(_heap->active_generation()->complete_marking_context()->is_marked(p), "must be marked"); >> 352: assert(!_heap->active_generation()->complete_marking_context()->allocated_after_mark_start(p), "must be truly marked"); > > I am probably being a bit pedantic here... > > I would use `gc_generation()` in all code that is executed strictly by GC threads, and `active_generation()` in all code that may possibly be executed by a mutator thread. It seems as if today this code is only executed by GC threads. > > In general, there is no real distinction between these field at times like these (STW pauses) when heap verification is taking place, but from a syntactic hygiene perspective. > > We can otherwise file a ticket to separately clean up any confusion in the use of these fields (and add a dynamic check to prevent creeping confusion). The names aren't super well-chosen, but generally think of `_gc_generation` as the generation that is being GC'd, `_active_generation` as one that mutator threads are aware is being the subject of GC. Any assertions by mutator threads should use the latter and by GC threads the former. The fields are reconciled at STW pauses. Make sense, I did notice that there is assert `assert(!Thread::current()->is_Java_thread(), "Not allowed");` in `gc_generation()` suggesting that non-Java thread should call `gc_generation()`. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23886#discussion_r2027837825 From ysr at openjdk.org Thu Apr 3 22:57:50 2025 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Thu, 3 Apr 2025 22:57:50 GMT Subject: RFR: 8351091: Shenandoah: global marking context completeness is not accurately maintained [v7] In-Reply-To: References: <5Yxk8oBN69i5Ty_jRCtXoLeNjyet6DEySoFqnzxrblk=.9a1ad401-9da2-4d06-8e22-c51d810dd2f8@github.com> <6sjBSQODcXKXzjvshAJiHq96N4Ler-TEBaSuN4nNr6w=.a6ee8ec7-9a3e-49ae-9718-8d1a027e6420@github.com> Message-ID: <6dN8IY3rHlVn2aiHJwWdB-OKbbx8GABuvau9-Bdw6vU=.a74101a0-845d-4174-a87a-b41674e90579@github.com> On Thu, 3 Apr 2025 22:31:27 GMT, Xiaolong Peng wrote: >> src/hotspot/share/gc/shenandoah/shenandoahVerifier.cpp line 1191: >> >>> 1189: _verify_remembered_after_full_gc, // verify read-write remembered set >>> 1190: _verify_forwarded_none, // all objects are non-forwarded >>> 1191: _verify_marked_incomplete, // all objects are marked in incomplete bitmap >> >> Is the marking bitmap updated as objects are moved to their new locations? Is that done just to satisfy the verifier? > > Yes, making bitmaps has been reset after full GC, except for the for regions with pined objects. > _verify_marked_complete requires complete marking context, it might make more sense to change it to _verify_marked_disable after full GC. Curious; in that case should it not have failed in your testing because the objects not pinned may not have been marked as the verifier would have insisted they were? Why do we leave the regions with pinned objects marked? I am guessing once we have filled in the dead objects, the marks do not serve any purpose? May be I am missing some corner case? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23886#discussion_r2027852832 From manc at openjdk.org Fri Apr 4 07:26:54 2025 From: manc at openjdk.org (Man Cao) Date: Fri, 4 Apr 2025 07:26:54 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v8] In-Reply-To: References:

Message-ID: On Thu, 3 Apr 2025 07:08:19 GMT, Man Cao wrote: >> Hi all, >> >> I have implemented SoftMaxHeapSize for G1 as attached. It is completely reworked compared to [previous PR](https://github.com/openjdk/jdk/pull/20783), and excludes code for `CurrentMaxHeapSize`. I believe I have addressed all direct concerns from [previous email thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2024-November/050214.html), such as: >> >> - does not respect `MinHeapSize`; >> - being too "blunt" and does not respect other G1 heuristics and flags for resizing, such as `MinHeapFreeRatio`, `MaxHeapFreeRatio`; >> - does not affect heuristcs to trigger a concurrent cycle; >> >> [This recent thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2025-March/051619.html) also has some context. > > Man Cao has updated the pull request incrementally with one additional commit since the last revision: > > Use Atomic::load for flag Thank you both for the quick and detailed responses! > * JDK-8248324 effectively removes the use of `Min/MaxHeapFreeRatio` (apart of full gc, which obviously they also need to be handled in some way that fits into the system). > * JDK-8238687 makes `GCTimeRatio` shrink the heap too, obviating the need for `Min/MaxHeapFreeRatio`, which are currently the knobs that limit excessive memory usage. > > With no flag to interfere (no `Min/MaxHeapFreeRatio`) with each other, there is no need for considering their precedence. > > As you mention, there is need for some strategy to reconcile divergent goals - ultimately G1 needs a single value that tells it to resize the heap in which direction in which degree. > > Incidentally, the way `GCTimeRatio` (or actually the internal gc cpu usage target as an intermediate) is already in use fits these requirements. From some actual value you can calculate a difference to desired, with some smoothing applied, which gives you both direction and degree of the change in heap size (applying some magic factors/constants). I was unaware that G1 plans to stop using `Min/MaxHeapFreeRatio` until now. Looks like [JDK-8238686](https://bugs.openjdk.org/browse/JDK-8238686) has more relevant description. It sounds good to solve all above-mentioned issues and converge on a single flag such as `GCTimeRatio`, and ensure both incremental and full GCs respect this flag. (We should also fix [JDK-8349978](https://bugs.openjdk.org/browse/JDK-8349978) for converging on `GCTimeRatio`. ) It would be nicer if we have a doc or a master bug that describes the overall plan. In comparison, this PR's approach for a high-precedence, "harder" `SoftMaxHeapSize` is an easier and more expedient approach to improve heap resizing, without solving all other issues. However, it requires users to carefully maintain and dynamically adjust `SoftMaxHeapSize` to prevent GC thrashing. I think if all other issues are resolved, our existing internal use cases that use a separate algorithm to dynamically calculate and set the high-precedence `SoftMaxHeapSize` (or `ProposedHeapSize`) could probably migrate to the `GCTimeRatio` approach, and stop using `SoftMaxHeapSize`. I'll need some discussion with my team about what we would do next. Meanwhile, @mo-beck do you guys have preference on how `SoftMaxHeapSize` should work? > > Now there is some question about the weights of these factors: we (in the gc team) prefer to keep G1's balancing between throughput and latency, particularly if the input this time is some value explicitly containing "soft" in its name. Using the 25% from ZGC as a max limit for gc cpu usage if we are (way) beyond what the user desires seems good enough for an initial guess. Not too high, guaranteeing some application progress in the worst case (for this factor!), not too low, guaranteeing that the intent of the user setting this value is respected. Somewhat related to above, our experience with our internal algorithm that adjusts `SoftMaxHeapSize` based on GC CPU overhead, encountered cases that it behaves poorly. The problem is that some workload have large variance in mutator's CPU usage (e.g. peak hours vs off-peak hours), but smaller variance in GC CPU usage. Then it does not make much sense to maintain a constant % for GC CPU overhead, which could cause excessive heap expansion when mutator CPU usage is low. The workaround is to take live size into consideration when calculating `SoftMaxHeapSize`, which is similar to how `Min/MaxHeapFreeRatio` works. I'm not sure if `GCTimeRatio` using wall time and pause time could run into similar issues. I'm happy to experiment when we make progress on JDK-8238687/JDK-8248324/JDK-8349978. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24211#issuecomment-2777769994 From tschatzl at openjdk.org Fri Apr 4 08:10:34 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Fri, 4 Apr 2025 08:10:34 GMT Subject: RFR: 8342382: Implementation of JEP G1: Improve Application Throughput with a More Efficient Write-Barrier [v30] In-Reply-To: References: Message-ID: > Hi all, > > please review this change that implements (currently Draft) JEP: G1: Improve Application Throughput with a More Efficient Write-Barrier. > > The reason for posting this early is that this is a large change, and the JEP process is already taking very long with no end in sight but we would like to have this ready by JDK 25. > > ### Current situation > > With this change, G1 will reduce the post write barrier to much more resemble Parallel GC's as described in the JEP. The reason is that G1 lacks in throughput compared to Parallel/Serial GC due to larger barrier. > > The main reason for the current barrier is how g1 implements concurrent refinement: > * g1 tracks dirtied cards using sets (dirty card queue set - dcqs) of buffers (dirty card queues - dcq) containing the location of dirtied cards. Refinement threads pick up their contents to re-refine. The barrier needs to enqueue card locations. > * For correctness dirty card updates requires fine-grained synchronization between mutator and refinement threads, > * Finally there is generic code to avoid dirtying cards altogether (filters), to avoid executing the synchronization and the enqueuing as much as possible. > > These tasks require the current barrier to look as follows for an assignment `x.a = y` in pseudo code: > > > // Filtering > if (region(@x.a) == region(y)) goto done; // same region check > if (y == null) goto done; // null value check > if (card(@x.a) == young_card) goto done; // write to young gen check > StoreLoad; // synchronize > if (card(@x.a) == dirty_card) goto done; > > *card(@x.a) = dirty > > // Card tracking > enqueue(card-address(@x.a)) into thread-local-dcq; > if (thread-local-dcq is not full) goto done; > > call runtime to move thread-local-dcq into dcqs > > done: > > > Overall this post-write barrier alone is in the range of 40-50 total instructions, compared to three or four(!) for parallel and serial gc. > > The large size of the inlined barrier not only has a large code footprint, but also prevents some compiler optimizations like loop unrolling or inlining. > > There are several papers showing that this barrier alone can decrease throughput by 10-20% ([Yang12](https://dl.acm.org/doi/10.1145/2426642.2259004)), which is corroborated by some benchmarks (see links). > > The main idea for this change is to not use fine-grained synchronization between refinement and mutator threads, but coarse grained based on atomically switching card tables. Mutators only work on the "primary" card table, refinement threads on a se... Thomas Schatzl has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 39 commits: - * missing file from merge - Merge branch 'master' into 8342382-card-table-instead-of-dcq - Merge branch 'master' into 8342382-card-table-instead-of-dcq - Merge branch 'master' into 8342382-card-table-instead-of-dcq - Merge branch 'master' into submit/8342382-card-table-instead-of-dcq - * make young gen length revising independent of refinement thread * use a service task * both refinement control thread and young gen length revising use the same infrastructure to get the number of available bytes and determine the time to the next update - * fix IR code generation tests that change due to barrier cost changes - * factor out card table and refinement table merging into a single method - Merge branch 'master' into 8342382-card-table-instead-of-dcq3 - * obsolete G1UpdateBufferSize G1UpdateBufferSize has previously been used to size the refinement buffers and impose a minimum limit on the number of cards per thread that need to be pending before refinement starts. The former function is now obsolete with the removal of the dirty card queues, the latter functionality has been taken over by the new diagnostic option `G1PerThreadPendingCardThreshold`. I prefer to make this a diagnostic option is better than a product option because it is something that is only necessary for some test cases to produce some otherwise unwanted behavior (continuous refinement). CSR is pending. - ... and 29 more: https://git.openjdk.org/jdk/compare/41d4a0d7...1c5a669f ------------- Changes: https://git.openjdk.org/jdk/pull/23739/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23739&range=29 Stats: 7089 lines in 110 files changed: 2610 ins; 3555 del; 924 mod Patch: https://git.openjdk.org/jdk/pull/23739.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23739/head:pull/23739 PR: https://git.openjdk.org/jdk/pull/23739 From tschatzl at openjdk.org Fri Apr 4 09:03:50 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Fri, 4 Apr 2025 09:03:50 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v8] In-Reply-To: References:

Message-ID: On Fri, 4 Apr 2025 07:23:45 GMT, Man Cao wrote: > Thank you both for the quick and detailed responses! > > > * JDK-8248324 effectively removes the use of `Min/MaxHeapFreeRatio` (apart of full gc, which obviously they also need to be handled in some way that fits into the system). > > * JDK-8238687 makes `GCTimeRatio` shrink the heap too, obviating the need for `Min/MaxHeapFreeRatio`, which are currently the knobs that limit excessive memory usage. > > > > With no flag to interfere (no `Min/MaxHeapFreeRatio`) with each other, there is no need for considering their precedence. > > As you mention, there is need for some strategy to reconcile divergent goals - ultimately G1 needs a single value that tells it to resize the heap in which direction in which degree. > > Incidentally, the way `GCTimeRatio` (or actually the internal gc cpu usage target as an intermediate) is already in use fits these requirements. From some actual value you can calculate a difference to desired, with some smoothing applied, which gives you both direction and degree of the change in heap size (applying some magic factors/constants). > > I was unaware that G1 plans to stop using `Min/MaxHeapFreeRatio` until now. Looks like [JDK-8238686](https://bugs.openjdk.org/browse/JDK-8238686) has more relevant description. It sounds good to solve all above-mentioned issues and converge on a single flag such as `GCTimeRatio`, and ensure both incremental and full GCs respect this flag. (We should also fix [JDK-8349978](https://bugs.openjdk.org/browse/JDK-8349978) for converging on `GCTimeRatio`. ) It would be nicer if we have a doc or a master bug that describes the overall plan. Last time this has been mentioned in the hotspot-gc-dev list has been [here](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2025-February/051079.html). I remember giving multiple outlines to everyone involved earlier, each mentioning that `Min/MaxHeapFreeRatio` need to go away because it's in the way, so I was/am a bit surprised on this response. I will look through the existing bugs and see if I there is a need for a(nother) master bug. > > In comparison, this PR's approach for a high-precedence, "harder" `SoftMaxHeapSize` is an easier and more expedient approach to improve heap resizing, without solving all other issues. However, it requires users to carefully maintain and dynamically adjust `SoftMaxHeapSize` to prevent GC thrashing. I think if all other issues are resolved, our existing internal use cases that use a separate algorithm to dynamically calculate and set the high-precedence `SoftMaxHeapSize` (or `ProposedHeapSize`) could probably migrate to the `GCTimeRatio` approach, and stop using `SoftMaxHeapSize`. > > I'll need some discussion with my team about what we would do next. Meanwhile, @mo-beck do you guys have preference on how `SoftMaxHeapSize` should work? > > > Now there is some question about the weights of these factors: we (in the gc team) prefer to keep G1's balancing between throughput and latency, particularly if the input this time is some value explicitly containing "soft" in its name. Using the 25% from ZGC as a max limit for gc cpu usage if we are (way) beyond what the user desires seems good enough for an initial guess. Not too high, guaranteeing some application progress in the worst case (for this factor!), not too low, guaranteeing that the intent of the user setting this value is respected. > > Somewhat related to above, our experience with our internal algorithm that adjusts `SoftMaxHeapSize` based on GC CPU overhead, encountered cases that it behaves poorly. The problem is that some workload have large variance in mutator's CPU usage (e.g. peak hours vs off-peak hours), but smaller variance in GC CPU usage. Then it does not make much sense to maintain a constant % for GC CPU overhead, which could cause excessive heap expansion when mutator CPU usage is low. The workaround is to take live size into consideration when calculating `SoftMaxHeapSize`, which is similar to how `Min/MaxHeapFreeRatio` works. > > I'm not sure if `GCTimeRatio` using wall time and pause time could run into similar issues. I'm happy to experiment when we make progress on JDK-8238687/JDK-8248324/JDK-8349978. Obviously there are issues to sort out. :) ------------- PR Comment: https://git.openjdk.org/jdk/pull/24211#issuecomment-2778005801 From ayang at openjdk.org Fri Apr 4 09:12:23 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Fri, 4 Apr 2025 09:12:23 GMT Subject: RFR: 8352116: Deadlock with GCLocker and JVMTI after JDK-8192647 Message-ID: Using a new lock (`JNICritical_lock`) in `GCLocker::block` to resolve a deadlock issue. The root cause of the deadlock is that holding `Heap_lock` while waiting in `GCLocker::block` is unsafe. The new lock is held from the start of `GCLocker::block` to the end of `GCLocker::unblock`. This requires adjusting `Heap_lock`'s rank to allow acquiring `Heap_lock` while holding `JNICritical_lock`. The most important changes are in `gcVMOperations.cpp` and `mutexLocker.cpp`. Test: tier1-8; verified failure can be observed 2/2000 and pass 8000 iterations. ------------- Commit messages: - tmp - gclocker-nested Changes: https://git.openjdk.org/jdk/pull/24407/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=24407&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8352116 Stats: 31 lines in 4 files changed: 20 ins; 7 del; 4 mod Patch: https://git.openjdk.org/jdk/pull/24407.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24407/head:pull/24407 PR: https://git.openjdk.org/jdk/pull/24407 From eosterlund at openjdk.org Fri Apr 4 09:12:23 2025 From: eosterlund at openjdk.org (Erik =?UTF-8?B?w5ZzdGVybHVuZA==?=) Date: Fri, 4 Apr 2025 09:12:23 GMT Subject: RFR: 8352116: Deadlock with GCLocker and JVMTI after JDK-8192647 In-Reply-To: References: Message-ID: On Thu, 3 Apr 2025 09:40:19 GMT, Albert Mingkun Yang wrote: > Using a new lock (`JNICritical_lock`) in `GCLocker::block` to resolve a deadlock issue. The root cause of the deadlock is that holding `Heap_lock` while waiting in `GCLocker::block` is unsafe. > > The new lock is held from the start of `GCLocker::block` to the end of `GCLocker::unblock`. This requires adjusting `Heap_lock`'s rank to allow acquiring `Heap_lock` while holding `JNICritical_lock`. The most important changes are in `gcVMOperations.cpp` and `mutexLocker.cpp`. > > Test: tier1-8; verified failure can be observed 2/2000 and pass 8000 iterations. Looks good. Would be nice to refactor the if (UseSerialGC || UseParallelGC) code to something that explains why it's there (those are the GCs that use the new improved GC locker). But that's pre existing so I don't mind if it's split to a separate RFE. ------------- Marked as reviewed by eosterlund (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/24407#pullrequestreview-2739864515 From jsikstro at openjdk.org Fri Apr 4 11:56:07 2025 From: jsikstro at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Fri, 4 Apr 2025 11:56:07 GMT Subject: RFR: 8353471: ZGC: Redundant generation id in ZGeneration In-Reply-To: References: Message-ID: On Wed, 2 Apr 2025 06:52:49 GMT, Joel Sikstr?m wrote: > The ZGeneration class (and in turn ZGenerationOld and ZGenerationYoung) keeps track of its own ZGenerationId, which means that the generation id does not need to be passed along as an argument when calling internal functions. > > I've removed the id parameter from `ZGeneration::select_relocation_set` in favor of using the member variable `_id`. Thank you for the reviews! ------------- PR Comment: https://git.openjdk.org/jdk/pull/24374#issuecomment-2778471557 From jsikstro at openjdk.org Fri Apr 4 11:56:07 2025 From: jsikstro at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Fri, 4 Apr 2025 11:56:07 GMT Subject: Integrated: 8353471: ZGC: Redundant generation id in ZGeneration In-Reply-To: References: Message-ID: <8QZgCh8R7ZycqowtfLbPwmbJz59ni6HckX2dwRW-U7w=.1db6ca63-5edd-4086-be8a-2d55ae6ac0de@github.com> On Wed, 2 Apr 2025 06:52:49 GMT, Joel Sikstr?m wrote: > The ZGeneration class (and in turn ZGenerationOld and ZGenerationYoung) keeps track of its own ZGenerationId, which means that the generation id does not need to be passed along as an argument when calling internal functions. > > I've removed the id parameter from `ZGeneration::select_relocation_set` in favor of using the member variable `_id`. This pull request has now been integrated. Changeset: b92a4436 Author: Joel Sikstr?m URL: https://git.openjdk.org/jdk/commit/b92a44364d3a2267f5bc9aef3077805bebdf9fba Stats: 6 lines in 2 files changed: 0 ins; 0 del; 6 mod 8353471: ZGC: Redundant generation id in ZGeneration Reviewed-by: stefank, eosterlund ------------- PR: https://git.openjdk.org/jdk/pull/24374 From xpeng at openjdk.org Fri Apr 4 18:11:50 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Fri, 4 Apr 2025 18:11:50 GMT Subject: RFR: 8351091: Shenandoah: global marking context completeness is not accurately maintained [v7] In-Reply-To: <6dN8IY3rHlVn2aiHJwWdB-OKbbx8GABuvau9-Bdw6vU=.a74101a0-845d-4174-a87a-b41674e90579@github.com> References: <5Yxk8oBN69i5Ty_jRCtXoLeNjyet6DEySoFqnzxrblk=.9a1ad401-9da2-4d06-8e22-c51d810dd2f8@github.com> <6sjBSQODcXKXzjvshAJiHq96N4Ler-TEBaSuN4nNr6w=.a6ee8ec7-9a3e-49ae-9718-8d1a027e6420@github.com> <6dN8IY3rHlVn2aiHJwWdB-OKbbx8GABuvau9-Bdw6vU=.a74101a0-845d-4174-a87a-b41674e90579@github.com> Message-ID: On Thu, 3 Apr 2025 22:55:18 GMT, Y. Srinivas Ramakrishna wrote: >> Yes, making bitmaps has been reset after full GC, except for the for regions with pined objects. >> _verify_marked_complete requires complete marking context, it might make more sense to change it to _verify_marked_disable after full GC. > > Curious; in that case should it not have failed in your testing because the objects not pinned may not have been marked as the verifier would have insisted they were? Why do we leave the regions with pinned objects marked? I am guessing once we have filled in the dead objects, the marks do not serve any purpose? > > May be I am missing some corner case? It does, one of the changes https://github.com/openjdk/jdk/pull/24092 is to set the marking completeness flag to false after Full GC because the bitmaps have been reset, `_verify_marked_complete` requires complete marking marking context so there is assert error. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23886#discussion_r2029244689 From xpeng at openjdk.org Fri Apr 4 18:18:30 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Fri, 4 Apr 2025 18:18:30 GMT Subject: RFR: 8351091: Shenandoah: global marking context completeness is not accurately maintained [v8] In-Reply-To: References: Message-ID: > With the JEP 404: Generational Shenandoah implementation, there are generation specific marking completeness flags introduced, and the global marking context completeness flag is not updated at all after initialization, hence the global marking context completeness is not accurate anymore. This may cause expected behavior: [ShenandoahHeap::complete_marking_context()](https://github.com/openjdk/jdk/pull/23886/files#diff-d5ddf298c36b1c91bf33f9bff7bedcc063074edd68c298817f1fdf39d2ed970fL642) should throw assert error if the global marking context completeness flag is false, but now it always return the marking context even it marking is not complete, this may hide bugs where we expect the global/generational marking to be completed. > > This change PR fix the bug in global marking context completeness flag, and update all the places using `ShenandoahHeap::complete_marking_context()` to use proper API. > > ### Test > - [x] hotspot_gc_shenandoah > - [x] Tier 1 > - [x] Tier 2 Xiaolong Peng has updated the pull request incrementally with one additional commit since the last revision: Address PR comments ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23886/files - new: https://git.openjdk.org/jdk/pull/23886/files/7c73e121..d4af962a Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23886&range=07 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23886&range=06-07 Stats: 8 lines in 2 files changed: 0 ins; 0 del; 8 mod Patch: https://git.openjdk.org/jdk/pull/23886.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23886/head:pull/23886 PR: https://git.openjdk.org/jdk/pull/23886 From sangheki at openjdk.org Fri Apr 4 21:21:22 2025 From: sangheki at openjdk.org (Sangheon Kim) Date: Fri, 4 Apr 2025 21:21:22 GMT Subject: RFR: 8346568: G1: Other time can be negative Message-ID: <0A-BDKTohMv3ziXO4LrtniptKNCWYvZZfVKMWAUK6iA=.7fbd372c-f2ed-417c-8517-073e0a9a5276@github.com> Other time described in this bug is displayed at G1GCPhaseTimes::print_other(total_measured_time - sum_of_sub_phases). And the value can be negative for 3 reasons. 1. Different scope of measurement - 3 variables is out of scope from total_measured_time. Those used for wait-root-region-scan, verify-before/after. (_root_region_scan_wait_time_ms, _cur_verify_before_time_ms and _cur_verify_after_time_ms) - Changed not to be included in sum_of_sub_phases. - One may want to include them in total_measured_time but I think it is better to be addressed in a separate ticket. 2. Duplicated measurement - Initial and optional evacuation time include nmethod-cleanup-time, so separated them as we are already measuring them. As there is no public getter, just added cleanup time when those evacuation time are used internally. 3. Concurrent task execution time - Sometimes just triggering concurrent work takes 2 digit milliseconds. Changed to add only initiating time on sum_of_sub_phases and keep displaying concurrent tasks' average execution time. Testing: tier 1 ~ 5 ------------- Commit messages: - Separate measurement for cleanup Changes: https://git.openjdk.org/jdk/pull/24454/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=24454&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8346568 Stats: 61 lines in 4 files changed: 35 ins; 17 del; 9 mod Patch: https://git.openjdk.org/jdk/pull/24454.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24454/head:pull/24454 PR: https://git.openjdk.org/jdk/pull/24454 From kbarrett at openjdk.org Sat Apr 5 06:29:47 2025 From: kbarrett at openjdk.org (Kim Barrett) Date: Sat, 5 Apr 2025 06:29:47 GMT Subject: RFR: 8352116: Deadlock with GCLocker and JVMTI after JDK-8192647 In-Reply-To: References: Message-ID: On Thu, 3 Apr 2025 09:40:19 GMT, Albert Mingkun Yang wrote: > Using a new lock (`JNICritical_lock`) in `GCLocker::block` to resolve a deadlock issue. The root cause of the deadlock is that holding `Heap_lock` while waiting in `GCLocker::block` is unsafe. > > The new lock is held from the start of `GCLocker::block` to the end of `GCLocker::unblock`. This requires adjusting `Heap_lock`'s rank to allow acquiring `Heap_lock` while holding `JNICritical_lock`. The most important changes are in `gcVMOperations.cpp` and `mutexLocker.cpp`. > > Test: tier1-8; verified failure can be observed 2/2000 and pass 8000 iterations. Looks good. ------------- Marked as reviewed by kbarrett (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/24407#pullrequestreview-2744728350 From duke at openjdk.org Mon Apr 7 05:45:54 2025 From: duke at openjdk.org (Saint Wesonga) Date: Mon, 7 Apr 2025 05:45:54 GMT Subject: RFR: 8350722: Serial GC: Remove duplicate logic for detecting pointers in young gen In-Reply-To: References: Message-ID: On Wed, 26 Feb 2025 06:54:19 GMT, Saint Wesonga wrote: > Checking whether a pointer is in the young generation is currently done by comparing the pointer to the end of the young generation reserved space. The duplication of these checks in various places complicates any changes the layout of the young generation since all these locations need to be updated. This PR replaces the duplicated logic with the DefNewGeneration::is_in_reserved method. @tschatzl , I'm closing this PR now that I have an updated approach in https://github.com/openjdk/jdk/pull/23853 ------------- PR Comment: https://git.openjdk.org/jdk/pull/23792#issuecomment-2782077611 From duke at openjdk.org Mon Apr 7 05:45:54 2025 From: duke at openjdk.org (Saint Wesonga) Date: Mon, 7 Apr 2025 05:45:54 GMT Subject: Withdrawn: 8350722: Serial GC: Remove duplicate logic for detecting pointers in young gen In-Reply-To: References: Message-ID: <_hkx74X6j9YnTj9Z_dUXjLPXSMY4IeRk3W4Vo5Ti_KI=.0b979267-53cc-4cc4-8f03-c33d726bedc7@github.com> On Wed, 26 Feb 2025 06:54:19 GMT, Saint Wesonga wrote: > Checking whether a pointer is in the young generation is currently done by comparing the pointer to the end of the young generation reserved space. The duplication of these checks in various places complicates any changes the layout of the young generation since all these locations need to be updated. This PR replaces the duplicated logic with the DefNewGeneration::is_in_reserved method. This pull request has been closed without being integrated. ------------- PR: https://git.openjdk.org/jdk/pull/23792 From tschatzl at openjdk.org Mon Apr 7 07:55:52 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 7 Apr 2025 07:55:52 GMT Subject: RFR: 8352116: Deadlock with GCLocker and JVMTI after JDK-8192647 In-Reply-To: References: Message-ID: On Thu, 3 Apr 2025 09:40:19 GMT, Albert Mingkun Yang wrote: > Using a new lock (`JNICritical_lock`) in `GCLocker::block` to resolve a deadlock issue. The root cause of the deadlock is that holding `Heap_lock` while waiting in `GCLocker::block` is unsafe. > > The new lock is held from the start of `GCLocker::block` to the end of `GCLocker::unblock`. This requires adjusting `Heap_lock`'s rank to allow acquiring `Heap_lock` while holding `JNICritical_lock`. The most important changes are in `gcVMOperations.cpp` and `mutexLocker.cpp`. > > Test: tier1-8; verified failure can be observed 2/2000 and pass 8000 iterations. Marked as reviewed by tschatzl (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/24407#pullrequestreview-2745840662 From tschatzl at openjdk.org Mon Apr 7 07:57:51 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 7 Apr 2025 07:57:51 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v8] In-Reply-To: References:

Message-ID: <9nwg79xCItPNaMsHRK6VQFl-dkWPP385vHqhvTYK_k0=.a830743a-5fd6-46a3-87c3-fd2a164ddf6a@github.com> On Thu, 3 Apr 2025 07:08:19 GMT, Man Cao wrote: >> Hi all, >> >> I have implemented SoftMaxHeapSize for G1 as attached. It is completely reworked compared to [previous PR](https://github.com/openjdk/jdk/pull/20783), and excludes code for `CurrentMaxHeapSize`. I believe I have addressed all direct concerns from [previous email thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2024-November/050214.html), such as: >> >> - does not respect `MinHeapSize`; >> - being too "blunt" and does not respect other G1 heuristics and flags for resizing, such as `MinHeapFreeRatio`, `MaxHeapFreeRatio`; >> - does not affect heuristcs to trigger a concurrent cycle; >> >> [This recent thread](https://mail.openjdk.org/pipermail/hotspot-gc-dev/2025-March/051619.html) also has some context. > > Man Cao has updated the pull request incrementally with one additional commit since the last revision: > > Use Atomic::load for flag Filed [JDK-8353716](https://bugs.openjdk.org/browse/JDK-8353716). ------------- PR Comment: https://git.openjdk.org/jdk/pull/24211#issuecomment-2782349959 From thomas.schatzl at oracle.com Mon Apr 7 09:07:08 2025 From: thomas.schatzl at oracle.com (Thomas Schatzl) Date: Mon, 7 Apr 2025 11:07:08 +0200 Subject: Moving Forward with AHS for G1 In-Reply-To: References:

Message-ID: <5dc9c3e2-fe3e-4c53-b8dc-3d55337187e5@oracle.com> Hi all, On 26.03.25 03:33, Monica Beckwith wrote: > Hi Ivan, > Thanks for the note ? and nice to meet you! > > The refinements you're working on around |GCTimeRatio|?and memory > uncommit are valuable contributions to the broader AHS direction we've > been shaping. They align closely with the multi-input heap sizing model > Thomas and I outlined ? especially the emphasis on GC cost (via | > GCTimeRatio|) and memory responsiveness as primary drivers. > > These kinds of enhancements are central to making G1?s heap sizing more > adaptive and responsive, particularly in environments with shifting > workload patterns. I?m especially interested in your work around > improving the GC time-base ? it seems like a crucial piece for > coordinating GC-triggered adjustments more precisely. > > Given the growing collaboration across contributors, I?ve been thinking > of opening an umbrella issue to track these efforts and possibly > drafting a JEP to help clarify and unify the overall scope. With Oracle, > Google, and others actively contributing, it?s exciting to see a shared > vision taking shape ? and your work is clearly part of it. > I created an umbrella CR at https://bugs.openjdk.org/browse/JDK-8353716 supposed to contain latest info on the effort. Feel free to add to it. If possible, I would like to keep the more free-form discussion here in the mailing list though. My bad for not following up on this much much earlier. > I?m genuinely excited to see this come together. Looking forward to > continuing the discussion and shaping the future of G1 ergonomics together. > Hth, Thomas From ayang at openjdk.org Mon Apr 7 09:19:03 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 7 Apr 2025 09:19:03 GMT Subject: RFR: 8352116: Deadlock with GCLocker and JVMTI after JDK-8192647 In-Reply-To: References: Message-ID: On Thu, 3 Apr 2025 09:40:19 GMT, Albert Mingkun Yang wrote: > Using a new lock (`JNICritical_lock`) in `GCLocker::block` to resolve a deadlock issue. The root cause of the deadlock is that holding `Heap_lock` while waiting in `GCLocker::block` is unsafe. > > The new lock is held from the start of `GCLocker::block` to the end of `GCLocker::unblock`. This requires adjusting `Heap_lock`'s rank to allow acquiring `Heap_lock` while holding `JNICritical_lock`. The most important changes are in `gcVMOperations.cpp` and `mutexLocker.cpp`. > > Test: tier1-8; verified failure can be observed 2/2000 and pass 8000 iterations. Thanks for review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24407#issuecomment-2782605636 From ayang at openjdk.org Mon Apr 7 09:19:03 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 7 Apr 2025 09:19:03 GMT Subject: Integrated: 8352116: Deadlock with GCLocker and JVMTI after JDK-8192647 In-Reply-To: References: Message-ID: On Thu, 3 Apr 2025 09:40:19 GMT, Albert Mingkun Yang wrote: > Using a new lock (`JNICritical_lock`) in `GCLocker::block` to resolve a deadlock issue. The root cause of the deadlock is that holding `Heap_lock` while waiting in `GCLocker::block` is unsafe. > > The new lock is held from the start of `GCLocker::block` to the end of `GCLocker::unblock`. This requires adjusting `Heap_lock`'s rank to allow acquiring `Heap_lock` while holding `JNICritical_lock`. The most important changes are in `gcVMOperations.cpp` and `mutexLocker.cpp`. > > Test: tier1-8; verified failure can be observed 2/2000 and pass 8000 iterations. This pull request has now been integrated. Changeset: 39549f89 Author: Albert Mingkun Yang URL: https://git.openjdk.org/jdk/commit/39549f89905019fa90dd20ff8b6822c1351cbaa6 Stats: 31 lines in 4 files changed: 20 ins; 7 del; 4 mod 8352116: Deadlock with GCLocker and JVMTI after JDK-8192647 Reviewed-by: kbarrett, tschatzl, eosterlund ------------- PR: https://git.openjdk.org/jdk/pull/24407 From tschatzl at openjdk.org Mon Apr 7 09:22:50 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 7 Apr 2025 09:22:50 GMT Subject: RFR: 8236073: G1: Use SoftMaxHeapSize to guide GC heuristics [v8] In-Reply-To: References:

Message-ID: <_J82bhnQOjixO9UDu2Mm0CsGVNe9gXXBxayIyv2TFz8=.2deea0ff-c51b-499d-a8fd-1ebc253a9e2d@github.com> On Mon, 7 Apr 2025 10:33:35 GMT, Aleksey Shipilev wrote: >> See bug for discussion. This is the code change, which is simple. What is not simple is deciding what the new value should be. The change would probably require CSR, which I can file after we agree on the value. >> >> I think cutting to 0.2% of RAM size gets us into good sweet spot: >> - On huge 1024G machine, this yields 2G initial heap >> - On reasonably sized 128G machine, this gives 256M initial heap >> - On smaller 1G container, this gives 2M initial heap >> >> Additional testing: >> - [x] Linux AArch64 server fastdebug, `all` > > Aleksey Shipilev has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision: > > - Merge branch 'master' into JDK-8348278-trim-iramp > - Also man page > - Merge branch 'master' into JDK-8348278-trim-iramp > - Fix CSR filed: [JDK-8353837](https://bugs.openjdk.org/browse/JDK-8353837) ------------- PR Comment: https://git.openjdk.org/jdk/pull/23262#issuecomment-2782893464 From jsikstro at openjdk.org Mon Apr 7 11:33:57 2025 From: jsikstro at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Mon, 7 Apr 2025 11:33:57 GMT Subject: RFR: 8353559: Restructure CollectedHeap error printing In-Reply-To: References: Message-ID: <9tbw7_56t4aDDTVE-KI9b84ccG_Iky2LRhsMmL0gXF0=.f03a1ac0-099f-465d-977d-751f7b5cf7ff@github.com> On Wed, 2 Apr 2025 18:09:12 GMT, Joel Sikstr?m wrote: > Calling Universe::heap()->print_on_error() gets dispatched to the most specific implementation, which for some GCs is their own implementation instead of the default in CollectedHeap. Each GC-specific implementation calls back to CollectedHeap::print_on_error(), which then dispatches back into the specific implementation of print_on(). This is kind of awkward and creates a call-chain that's not straightforward to wrap your head around, jumping back and forth via CollectedHeap and the specific implementation. > > To make the call-chain cleaner, I have made print_on_error() a pure virtual method in CollectedHeap, and implemented print_on_error() in each GC's implementation of CollectedHeap. In addition, I have removed print_extended_on() from CollectedHeap and implemented that for the GCs that actually need/use it. > > Removing the usage of the common print_on_error() also means that GCs that do not print anything interesting for their barrier set can omit this. So, I've removed it from ZGC and Shenandoah. > > To make print_on_error() consistent with print_on(), I have moved the printing of "Heap:" to the caller(s) of print_on_error() (only inside vmError.cpp). This is a trivial change for all GCs except ZGC, which requires some restructuring in its error printing. > > The old and new printing orders are shown below for ZGC: > > # Old > > > > > > > > > > # New > > > > > > > > Testing: > * GHA > * Tiers 1 & 2 > * Manually verified that printing still works and outputs the intended information via running the following commands and comparing the output. > > ../fastdebug-old/jdk/bin/java -XX:ErrorHandlerTest=14 -XX:+ErrorFileToStdout -XX:+Use${gc}GC --version > ${gc}_old.txt > ../fastdebug-new/jdk/bin/java -XX:ErrorHandlerTest=14 -XX:+ErrorFileToStdout -XX:+Use${gc}GC --version > ${gc}_new.txt Since this is a relatively small change, I'm hoping that the Shenandoah devs are on board. I am going to integrate this now so that we can continue working in this area in ZGC. I am happy to follow up on this if there are any more opinions in the future. Thanks for the reviews! ------------- PR Comment: https://git.openjdk.org/jdk/pull/24387#issuecomment-2783006637 From jsikstro at openjdk.org Mon Apr 7 11:33:58 2025 From: jsikstro at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Mon, 7 Apr 2025 11:33:58 GMT Subject: Integrated: 8353559: Restructure CollectedHeap error printing In-Reply-To: References: Message-ID: On Wed, 2 Apr 2025 18:09:12 GMT, Joel Sikstr?m wrote: > Calling Universe::heap()->print_on_error() gets dispatched to the most specific implementation, which for some GCs is their own implementation instead of the default in CollectedHeap. Each GC-specific implementation calls back to CollectedHeap::print_on_error(), which then dispatches back into the specific implementation of print_on(). This is kind of awkward and creates a call-chain that's not straightforward to wrap your head around, jumping back and forth via CollectedHeap and the specific implementation. > > To make the call-chain cleaner, I have made print_on_error() a pure virtual method in CollectedHeap, and implemented print_on_error() in each GC's implementation of CollectedHeap. In addition, I have removed print_extended_on() from CollectedHeap and implemented that for the GCs that actually need/use it. > > Removing the usage of the common print_on_error() also means that GCs that do not print anything interesting for their barrier set can omit this. So, I've removed it from ZGC and Shenandoah. > > To make print_on_error() consistent with print_on(), I have moved the printing of "Heap:" to the caller(s) of print_on_error() (only inside vmError.cpp). This is a trivial change for all GCs except ZGC, which requires some restructuring in its error printing. > > The old and new printing orders are shown below for ZGC: > > # Old > > > > > > > > > > # New > > >