From zgu at openjdk.org Sat Feb 1 16:42:50 2025 From: zgu at openjdk.org (Zhengyu Gu) Date: Sat, 1 Feb 2025 16:42:50 GMT Subject: RFR: 8348171: Refactor GenerationCounters and its subclasses [v4] In-Reply-To: References:

Message-ID: On Fri, 31 Jan 2025 12:30:30 GMT, Albert Mingkun Yang wrote: >> Simple refactoring of removing the use of `virtual` method and use concrete subclasses when needed. >> >> Test: tier1-5 > > Albert Mingkun Yang has updated the pull request incrementally with two additional commits since the last revision: > > - review > - * some more refactoring src/hotspot/share/gc/shenandoah/shenandoahMonitoringSupport.cpp line 39: > 37: GenerationCounters("Young", 0, 0, 0, (size_t)0, (size_t)0) {}; > 38: > 39: void update_all() { Shenandoah looks a bit odd now. @kdnilsen @wkemper and @ysramakrishna may want to take a look? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23209#discussion_r1938303987 From zgu at openjdk.org Sat Feb 1 16:47:52 2025 From: zgu at openjdk.org (Zhengyu Gu) Date: Sat, 1 Feb 2025 16:47:52 GMT Subject: RFR: 8348171: Refactor GenerationCounters and its subclasses [v4] In-Reply-To: References:

Message-ID: On Fri, 31 Jan 2025 12:30:30 GMT, Albert Mingkun Yang wrote: >> Simple refactoring of removing the use of `virtual` method and use concrete subclasses when needed. >> >> Test: tier1-5 > > Albert Mingkun Yang has updated the pull request incrementally with two additional commits since the last revision: > > - review > - * some more refactoring `Shenandoah` code no longer aligns to others. Other than that, LGTM. ------------- Marked as reviewed by zgu (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/23209#pullrequestreview-2588364961 From tschatzl at openjdk.org Mon Feb 3 09:25:50 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 3 Feb 2025 09:25:50 GMT Subject: RFR: 8348171: Refactor GenerationCounters and its subclasses [v4] In-Reply-To: References:

Message-ID: <7k73_VjUmBq7-G2reVDOlB7-vSazUekr8Q3Ez3houa0=.61d2baf5-72c6-41cd-aa74-c49a5b5e9ce1@github.com> On Fri, 31 Jan 2025 12:30:30 GMT, Albert Mingkun Yang wrote: >> Simple refactoring of removing the use of `virtual` method and use concrete subclasses when needed. >> >> Test: tier1-5 > > Albert Mingkun Yang has updated the pull request incrementally with two additional commits since the last revision: > > - review > - * some more refactoring Lgtm. Related to @zhengyu123 's comment, not sure right now what is meant with "looking odd" here as the previous code did not update the counters either, but it might be useful to wait on Shenandoah team's input anyway. ------------- Marked as reviewed by tschatzl (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/23209#pullrequestreview-2589340090 From tschatzl at openjdk.org Mon Feb 3 15:20:19 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 3 Feb 2025 15:20:19 GMT Subject: RFR: 8349213: G1: Clearing bitmaps during collection set merging not claimed by region Message-ID: Hi all, please review this change that makes (optional) bitmap clearing during merging remembered sets claim regions. Otherwise every thread will do the (currently little) work themselves over and over again. Testing: tier1-3 Thanks, Thomas ------------- Commit messages: - * move commenty - 8349213: G1: Clearing bitmaps during collection set merging not claimed by region Changes: https://git.openjdk.org/jdk/pull/23419/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23419&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8349213 Stats: 30 lines in 1 file changed: 8 ins; 20 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/23419.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23419/head:pull/23419 PR: https://git.openjdk.org/jdk/pull/23419 From mdoerr at openjdk.org Mon Feb 3 18:03:26 2025 From: mdoerr at openjdk.org (Martin Doerr) Date: Mon, 3 Feb 2025 18:03:26 GMT Subject: [jdk24] RFR: 8348562: ZGC: segmentation fault due to missing node type check in barrier elision analysis Message-ID: Clean backport of [JDK-8348562](https://bugs.openjdk.org/browse/JDK-8348562). It only adds a null check + bailout where the current implementation crashes with SIGSEGV. ------------- Commit messages: - Backport afcc2b03afc77f730300e1d92471466d56ed75fb Changes: https://git.openjdk.org/jdk/pull/23422/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23422&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8348562 Stats: 3 lines in 1 file changed: 2 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/23422.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23422/head:pull/23422 PR: https://git.openjdk.org/jdk/pull/23422 From mdoerr at openjdk.org Mon Feb 3 18:18:48 2025 From: mdoerr at openjdk.org (Martin Doerr) Date: Mon, 3 Feb 2025 18:18:48 GMT Subject: [jdk24] RFR: 8348562: ZGC: segmentation fault due to missing node type check in barrier elision analysis In-Reply-To: References: Message-ID: <4iwi6PbCt7B6GE731aOGDZsEl9KiT2ZERf-r7JUjiq8=.6fc8933b-4718-4a48-9064-ca205bc25630@github.com> On Mon, 3 Feb 2025 17:56:23 GMT, Martin Doerr wrote: > Clean backport of [JDK-8348562](https://bugs.openjdk.org/browse/JDK-8348562). It only adds a null check + bailout where the current implementation crashes with SIGSEGV. @TobiHartmann This is the jdk24 backport. Please take a look. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23422#issuecomment-2631726161 From kvn at openjdk.org Mon Feb 3 18:18:47 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Mon, 3 Feb 2025 18:18:47 GMT Subject: [jdk24] RFR: 8348562: ZGC: segmentation fault due to missing node type check in barrier elision analysis In-Reply-To: References: Message-ID: On Mon, 3 Feb 2025 17:56:23 GMT, Martin Doerr wrote: > Clean backport of [JDK-8348562](https://bugs.openjdk.org/browse/JDK-8348562). It only adds a null check + bailout where the current implementation crashes with SIGSEGV. The fix was requested for JDK 24 update (jdk24u repository) not for JDK 24 branch which this change is based on (if I see this correctly). ------------- Changes requested by kvn (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/23422#pullrequestreview-2590672470 From mdoerr at openjdk.org Mon Feb 3 18:22:46 2025 From: mdoerr at openjdk.org (Martin Doerr) Date: Mon, 3 Feb 2025 18:22:46 GMT Subject: [jdk24] RFR: 8348562: ZGC: segmentation fault due to missing node type check in barrier elision analysis In-Reply-To: References:

Message-ID: On Mon, 3 Feb 2025 18:15:53 GMT, Vladimir Kozlov wrote: >> Clean backport of [JDK-8348562](https://bugs.openjdk.org/browse/JDK-8348562). It only adds a null check + bailout where the current implementation crashes with SIGSEGV. > > The fix was requested for JDK 24 update (jdk24u repository) not for JDK 24 branch which this change is based on (if I see this correctly). @vnkozlov I had originally targeted 24u, but Tobias has reclassified it as P2, so this is the new PR. I will close the other one if this one gets approved. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23422#issuecomment-2631736130 From kvn at openjdk.org Mon Feb 3 18:22:46 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Mon, 3 Feb 2025 18:22:46 GMT Subject: [jdk24] RFR: 8348562: ZGC: segmentation fault due to missing node type check in barrier elision analysis In-Reply-To: References:

Message-ID: On Mon, 3 Feb 2025 18:19:25 GMT, Martin Doerr wrote: >> The fix was requested for JDK 24 update (jdk24u repository) not for JDK 24 branch which this change is based on (if I see this correctly). > > @vnkozlov I had originally targeted 24u, but Tobias has reclassified it as P2, so this is the new PR. I will close the other one if this one gets approved. @TheRealMDoerr you need new Fix request and approval for JDK 24 in bug report. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23422#issuecomment-2631739024 From mdoerr at openjdk.org Mon Feb 3 18:37:46 2025 From: mdoerr at openjdk.org (Martin Doerr) Date: Mon, 3 Feb 2025 18:37:46 GMT Subject: [jdk24] RFR: 8348562: ZGC: segmentation fault due to missing node type check in barrier elision analysis In-Reply-To: References:

Message-ID: On Mon, 3 Feb 2025 18:19:25 GMT, Martin Doerr wrote: >> The fix was requested for JDK 24 update (jdk24u repository) not for JDK 24 branch which this change is based on (if I see this correctly). > > @vnkozlov I had originally targeted 24u, but Tobias has reclassified it as P2, so this is the new PR. I will close the other one if this one gets approved. > @TheRealMDoerr you need new Fix request and approval for JDK 24 in bug report. JDK24 requires a review instead of a maintainer approval. See Skara messages above. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23422#issuecomment-2631767998 From kvn at openjdk.org Mon Feb 3 18:59:50 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Mon, 3 Feb 2025 18:59:50 GMT Subject: [jdk24] RFR: 8348562: ZGC: segmentation fault due to missing node type check in barrier elision analysis In-Reply-To: References:

Message-ID: On Mon, 3 Feb 2025 18:35:22 GMT, Martin Doerr wrote: > > @TheRealMDoerr you need new Fix request and approval for JDK 24 in bug report. > > JDK24 requires a review instead of a maintainer approval. See Skara messages above. We are in RDP 2 phase - you need approval for fixes there: https://openjdk.org/jeps/3#rdp-2 ------------- PR Comment: https://git.openjdk.org/jdk/pull/23422#issuecomment-2631811256 From kvn at openjdk.org Mon Feb 3 19:07:08 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Mon, 3 Feb 2025 19:07:08 GMT Subject: [jdk24] RFR: 8348562: ZGC: segmentation fault due to missing node type check in barrier elision analysis In-Reply-To: References: Message-ID: On Mon, 3 Feb 2025 17:56:23 GMT, Martin Doerr wrote: > Clean backport of [JDK-8348562](https://bugs.openjdk.org/browse/JDK-8348562). It only adds a null check + bailout where the current implementation crashes with SIGSEGV. To clarify. It is different approval from approval for update release. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23422#issuecomment-2631817357 From kvn at openjdk.org Mon Feb 3 19:07:08 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Mon, 3 Feb 2025 19:07:08 GMT Subject: [jdk24] RFR: 8348562: ZGC: segmentation fault due to missing node type check in barrier elision analysis In-Reply-To: References:

Message-ID: On Mon, 3 Feb 2025 19:02:25 GMT, Martin Doerr wrote: > Ok. Thanks! I've created the approval request manually. Skara doesn't support it. Yes, it is manual process - you need to add label and comment to main bug report. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23422#issuecomment-2631830611 From mdoerr at openjdk.org Mon Feb 3 19:07:08 2025 From: mdoerr at openjdk.org (Martin Doerr) Date: Mon, 3 Feb 2025 19:07:08 GMT Subject: [jdk24] RFR: 8348562: ZGC: segmentation fault due to missing node type check in barrier elision analysis In-Reply-To: References: Message-ID: On Mon, 3 Feb 2025 17:56:23 GMT, Martin Doerr wrote: > Clean backport of [JDK-8348562](https://bugs.openjdk.org/browse/JDK-8348562). It only adds a null check + bailout where the current implementation crashes with SIGSEGV. Ok. Thanks! I've created the approval request manually. Skara doesn't support it. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23422#issuecomment-2631827041 From kvn at openjdk.org Mon Feb 3 19:09:51 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Mon, 3 Feb 2025 19:09:51 GMT Subject: [jdk24] RFR: 8348562: ZGC: segmentation fault due to missing node type check in barrier elision analysis In-Reply-To: References:

Message-ID: On Mon, 3 Feb 2025 19:02:25 GMT, Martin Doerr wrote: >> Clean backport of [JDK-8348562](https://bugs.openjdk.org/browse/JDK-8348562). It only adds a null check + bailout where the current implementation crashes with SIGSEGV. > > Ok. Thanks! I've created the approval request manually. Skara doesn't support it. @TheRealMDoerr Please, add fix request comment too. You can copy jdk24u request. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23422#issuecomment-2631837256 From kvn at openjdk.org Mon Feb 3 19:22:45 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Mon, 3 Feb 2025 19:22:45 GMT Subject: [jdk24] RFR: 8348562: ZGC: segmentation fault due to missing node type check in barrier elision analysis In-Reply-To: References: Message-ID: On Mon, 3 Feb 2025 17:56:23 GMT, Martin Doerr wrote: > Clean backport of [JDK-8348562](https://bugs.openjdk.org/browse/JDK-8348562). It only adds a null check + bailout where the current implementation crashes with SIGSEGV. Looks good. Good. I approved request as Area Lead. Formalities are done ;^) Now we can review and integrate this into JDK 24. ------------- Marked as reviewed by kvn (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/23422#pullrequestreview-2590805301 PR Comment: https://git.openjdk.org/jdk/pull/23422#issuecomment-2631867578 From wkemper at openjdk.org Mon Feb 3 20:36:09 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 3 Feb 2025 20:36:09 GMT Subject: RFR: 8348268: Test gc/shenandoah/TestResizeTLAB.java#compact: fatal error: Before Updating References: Thread C2 CompilerThread1: expected gc-state 9, actual 21 Message-ID: Non-java threads were not having their gc-state configured when they attach. If they were created before the verifier's safepoint, but after the iteration over non-java threads, they would not have the correct state. ------------- Commit messages: - Set gc state for all attached threads (not just java threads). Changes: https://git.openjdk.org/jdk/pull/23428/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23428&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8348268 Stats: 4 lines in 1 file changed: 3 ins; 1 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/23428.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23428/head:pull/23428 PR: https://git.openjdk.org/jdk/pull/23428 From wkemper at openjdk.org Mon Feb 3 22:34:13 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 3 Feb 2025 22:34:13 GMT Subject: [jdk24] RFR: 8349002: GenShen: Deadlock during shutdown Message-ID: <6a_6G1g93RUACyYcHG5B9HtqFBaqdETRRdhvFWwrfi8=.e88c6252-f6c4-4e3f-972a-2c4495d27127@github.com> Clean backport. Fixes bug introduced by [JDK-8345970](https://bugs.openjdk.org/browse/JDK-8345970). ------------- Commit messages: - Backport 06ebb170bac3879dc1e378b48b1c7ef006070c86 Changes: https://git.openjdk.org/jdk/pull/23429/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23429&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8349002 Stats: 5 lines in 2 files changed: 4 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/23429.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23429/head:pull/23429 PR: https://git.openjdk.org/jdk/pull/23429 From kvn at openjdk.org Mon Feb 3 23:51:17 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Mon, 3 Feb 2025 23:51:17 GMT Subject: [jdk24] RFR: 8349002: GenShen: Deadlock during shutdown In-Reply-To: <6a_6G1g93RUACyYcHG5B9HtqFBaqdETRRdhvFWwrfi8=.e88c6252-f6c4-4e3f-972a-2c4495d27127@github.com> References: <6a_6G1g93RUACyYcHG5B9HtqFBaqdETRRdhvFWwrfi8=.e88c6252-f6c4-4e3f-972a-2c4495d27127@github.com> Message-ID: On Mon, 3 Feb 2025 22:27:44 GMT, William Kemper wrote: > Clean backport. Fixes bug introduced by [JDK-8345970](https://bugs.openjdk.org/browse/JDK-8345970). We are in RDP2 phase of JDK 24 release. Only P1 and P2 are allowed to be pushed with approval: https://openjdk.org/jeps/3#rdp-2 Consider backporting the fix into JDK 24 Update release. ------------- Changes requested by kvn (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/23429#pullrequestreview-2591384756 From ayang at openjdk.org Tue Feb 4 09:22:13 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Tue, 4 Feb 2025 09:22:13 GMT Subject: RFR: 8349213: G1: Clearing bitmaps during collection set merging not claimed by region In-Reply-To: References: Message-ID: <5YSfBhp40MOFgK-EbKrg1vY-X6ZuKHXmcnFi40hQp54=.2021c2c8-c22a-485c-b987-681e2e032f86@github.com> On Mon, 3 Feb 2025 14:11:20 GMT, Thomas Schatzl wrote: > Hi all, > > please review this change that makes (optional) bitmap clearing during merging remembered sets claim regions. Otherwise every thread will do the (currently little) work themselves over and over again. > > Testing: tier1-3 > > Thanks, > Thomas src/hotspot/share/gc/g1/g1RemSet.cpp line 1390: > 1388: g1h->collection_set_iterate_increment_from(&merge, worker_id); > 1389: for (uint i = 0; i < G1GCPhaseTimes::MergeRSContainersSentinel; i++) { > 1390: p->record_or_add_thread_work_item(merge_remset_phase, worker_id, merge.stats().merged(i), i); `stats()` has side-effect; should be invoked only once. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23419#discussion_r1940791948 From tschatzl at openjdk.org Tue Feb 4 09:50:16 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 4 Feb 2025 09:50:16 GMT Subject: RFR: 8192647: GClocker induced GCs can starve threads requiring memory leading to OOME In-Reply-To: <8Vqsu8qf5wAN8pZF-8zu8zNhryQa42EZux3nMRChX5k=.63c53ac1-ca69-4a45-a924-9a454e24ea3f@github.com> References: <8Vqsu8qf5wAN8pZF-8zu8zNhryQa42EZux3nMRChX5k=.63c53ac1-ca69-4a45-a924-9a454e24ea3f@github.com> Message-ID: On Thu, 30 Jan 2025 12:12:29 GMT, Albert Mingkun Yang wrote: > Here is an attempt to simplify GCLocker implementation for Serial and Parallel. > > GCLocker prevents GC when Java threads are in a critical region (i.e., calling JNI critical APIs). JDK-7129164 introduces an optimization that updates a shared variable (used to track the number of threads in the critical region) only if there is a pending GC request. However, this also means that after reaching a GC safepoint, we may discover that GCLocker is active, preventing a GC cycle from being invoked. The inability to perform GC at a safepoint adds complexity -- for example, a caller must retry allocation if the request fails due to GC being inhibited by GCLocker. > > The proposed patch uses a readers-writer lock to ensure that all Java threads exit the critical region before reaching a GC safepoint. This guarantees that once inside the safepoint, we can successfully invoke a GC cycle. The approach takes inspiration from `ZJNICritical`, but some regressions were observed in j2dbench (on Windows) and the micro-benchmark in [JDK-8232575](https://bugs.openjdk.org/browse/JDK-8232575). Therefore, instead of relying on atomic operations on a global variable when entering or leaving the critical region, this PR uses an existing thread-local variable with a store-load barrier for synchronization. > > Performance is neutral for all benchmarks tested: DaCapo, SPECjbb2005, SPECjbb2015, SPECjvm2008, j2dbench, and CacheStress. > > Test: tier1-8 * Idk if GCLocker JFR events need to be available in metadata.xml if the VM does not actually ever send it. I think it does not. Maybe it is used to decode from old recordings, may be worth asking e.g. @egahlin . * the bot shows a failure that this PR's CR number shows up in the problemlist, that line needs to be deleted as well. Further it would be interesting to see how many retries there are in the allocation loop with these jnilock* stress test. * another issue, probably todo is that while Parallel GC has the emergency bailout via GC Overhead limit after excessive retries, Serial does not. Which means that it might retry for a long time, which isn't good (while it did earlier if the number of retries due to gclocker exceed that threshold) src/hotspot/share/gc/parallel/parallelScavengeHeap.cpp line 323: > 321: } > 322: > 323: if (result == nullptr) { pre-existing: is it actually possible that `result` is not `nullptr` here? The code above always returns with a non-null result. Maybe assert this instead. src/hotspot/share/gc/shared/gcLocker.cpp line 86: > 84: void GCLocker::block() { > 85: assert(_lock->is_locked(), "precondition"); > 86: assert(Atomic::load(&_is_gc_request_pending) == false, "precondition"); Suggestion: assert(!Atomic::load(&_is_gc_request_pending), "precondition"); src/hotspot/share/gc/shared/gcLocker.cpp line 106: > 104: > 105: #ifdef ASSERT > 106: // Matching the storestore in GCLocker::exit Suggestion: // Matching the storestore in GCLocker::exit. src/hotspot/share/gc/shared/gcLocker.cpp line 114: > 112: void GCLocker::unblock() { > 113: assert(_lock->is_locked(), "precondition"); > 114: assert(Atomic::load(&_is_gc_request_pending) == true, "precondition"); Suggestion: assert(Atomic::load(&_is_gc_request_pending), "precondition"); src/hotspot/share/gc/shared/gcLocker.hpp line 31: > 29: #include "memory/allStatic.hpp" > 30: #include "runtime/mutex.hpp" > 31: Documentation how GCLocker works/is supposed to work is missing here. It's not exactly trivial. src/hotspot/share/gc/shared/gcLocker.hpp line 33: > 31: > 32: class GCLocker: public AllStatic { > 33: static Monitor* _lock; Not sure if having this copy/reference to `Heap_lock` makes the code more clear than referencing `Heap_lock` directly. It needs to be `Heap_lock` anyway. src/hotspot/share/gc/shared/gcLocker.hpp line 37: > 35: > 36: #ifdef ASSERT > 37: static uint64_t _debug_count; Maybe the variable could be named something less generic, indicating what it is counting. Or add a comment. src/hotspot/share/gc/shared/gcLocker.inline.hpp line 40: > 38: if (Atomic::load(&_is_gc_request_pending)) { > 39: thread->exit_critical(); > 40: // slow-path Suggestion: Not sure what this `slow-path` comment helps with. Maybe it is describing the next method (but it is named very similarly), or this is an attempt to describe the true-block of the if. In the latter case, it would maybe be better to put this comment at the start of the true-block of the if, and say something more descriptive like `// Another thread is requesting gc, enter slow path.` Not sure, feel free to ignore, it's just that to me the comment should either be removed or put upwards a line. src/hotspot/share/gc/shared/gcLocker.inline.hpp line 56: > 54: if (thread->in_last_critical()) { > 55: Atomic::add(&_debug_count, (uint64_t)-1); > 56: // Matching the loadload in GCLocker::block Suggestion: // Matching the loadload in GCLocker::block. src/hotspot/share/gc/shared/gcTraceSend.cpp line 364: > 362: #if INCLUDE_JFR > 363: > 364: #endif Please remove this empty `#if/#endif` block. src/hotspot/share/gc/shared/gc_globals.hpp line 162: > 160: "blocked by the GC locker") \ > 161: range(0, max_uintx) \ > 162: \ This removal should warrant a release note; while it's a diagnostic option and we can remove at a whim, it is in use to workaround issues. src/hotspot/share/prims/whitebox.cpp line 48: > 46: #include "gc/shared/concurrentGCBreakpoints.hpp" > 47: #include "gc/shared/gcConfig.hpp" > 48: #include "gc/shared/gcLocker.hpp" Suggestion: The file does not seem to use the `GCLocker` class anymore, please remove this line as well. ------------- Changes requested by tschatzl (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/23367#pullrequestreview-2592106484 PR Review Comment: https://git.openjdk.org/jdk/pull/23367#discussion_r1940732531 PR Review Comment: https://git.openjdk.org/jdk/pull/23367#discussion_r1940775211 PR Review Comment: https://git.openjdk.org/jdk/pull/23367#discussion_r1940813063 PR Review Comment: https://git.openjdk.org/jdk/pull/23367#discussion_r1940779840 PR Review Comment: https://git.openjdk.org/jdk/pull/23367#discussion_r1940770235 PR Review Comment: https://git.openjdk.org/jdk/pull/23367#discussion_r1940769765 PR Review Comment: https://git.openjdk.org/jdk/pull/23367#discussion_r1940796501 PR Review Comment: https://git.openjdk.org/jdk/pull/23367#discussion_r1940793704 PR Review Comment: https://git.openjdk.org/jdk/pull/23367#discussion_r1940812598 PR Review Comment: https://git.openjdk.org/jdk/pull/23367#discussion_r1940746077 PR Review Comment: https://git.openjdk.org/jdk/pull/23367#discussion_r1940748992 PR Review Comment: https://git.openjdk.org/jdk/pull/23367#discussion_r1940752118 From tschatzl at openjdk.org Tue Feb 4 10:35:44 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 4 Feb 2025 10:35:44 GMT Subject: RFR: 8349213: G1: Clearing bitmaps during collection set merging not claimed by region [v2] In-Reply-To: References: Message-ID: > Hi all, > > please review this change that makes (optional) bitmap clearing during merging remembered sets claim regions. Otherwise every thread will do the (currently little) work themselves over and over again. > > Testing: tier1-3 > > Thanks, > Thomas Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: * ayang review ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23419/files - new: https://git.openjdk.org/jdk/pull/23419/files/ba3a9ec7..a73b3b34 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23419&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23419&range=00-01 Stats: 4 lines in 1 file changed: 2 ins; 1 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/23419.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23419/head:pull/23419 PR: https://git.openjdk.org/jdk/pull/23419 From tschatzl at openjdk.org Tue Feb 4 10:35:44 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 4 Feb 2025 10:35:44 GMT Subject: RFR: 8349213: G1: Clearing bitmaps during collection set merging not claimed by region [v2] In-Reply-To: <5YSfBhp40MOFgK-EbKrg1vY-X6ZuKHXmcnFi40hQp54=.2021c2c8-c22a-485c-b987-681e2e032f86@github.com> References: <5YSfBhp40MOFgK-EbKrg1vY-X6ZuKHXmcnFi40hQp54=.2021c2c8-c22a-485c-b987-681e2e032f86@github.com> Message-ID: On Tue, 4 Feb 2025 09:19:04 GMT, Albert Mingkun Yang wrote: >> Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: >> >> * ayang review > > src/hotspot/share/gc/g1/g1RemSet.cpp line 1390: > >> 1388: g1h->collection_set_iterate_increment_from(&merge, worker_id); >> 1389: for (uint i = 0; i < G1GCPhaseTimes::MergeRSContainersSentinel; i++) { >> 1390: p->record_or_add_thread_work_item(merge_remset_phase, worker_id, merge.stats().merged(i), i); > > `stats()` has side-effect; should be invoked only once. Nice catch! Fixed. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23419#discussion_r1940911005 From ayang at openjdk.org Tue Feb 4 10:58:09 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Tue, 4 Feb 2025 10:58:09 GMT Subject: RFR: 8349213: G1: Clearing bitmaps during collection set merging not claimed by region [v2] In-Reply-To: References:

Message-ID: On Tue, 4 Feb 2025 10:35:44 GMT, Thomas Schatzl wrote: >> Hi all, >> >> please review this change that makes (optional) bitmap clearing during merging remembered sets claim regions. Otherwise every thread will do the (currently little) work themselves over and over again. >> >> Testing: tier1-3 >> >> Thanks, >> Thomas > > Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: > > * ayang review Marked as reviewed by ayang (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/23419#pullrequestreview-2592457802 From thartmann at openjdk.org Tue Feb 4 12:09:10 2025 From: thartmann at openjdk.org (Tobias Hartmann) Date: Tue, 4 Feb 2025 12:09:10 GMT Subject: [jdk24] RFR: 8348562: ZGC: segmentation fault due to missing node type check in barrier elision analysis In-Reply-To: References: Message-ID: <5LRr8GRO6lq8Y2c8cYr91PMm9XoJ5sq3m_1NJhGeOWE=.384cd75d-810f-47a7-a80f-c09e8f04619b@github.com> On Mon, 3 Feb 2025 17:56:23 GMT, Martin Doerr wrote: > Clean backport of [JDK-8348562](https://bugs.openjdk.org/browse/JDK-8348562). It only adds a null check + bailout where the current implementation crashes with SIGSEGV. Marked as reviewed by thartmann (Reviewer). Looks good to me too. ------------- PR Review: https://git.openjdk.org/jdk/pull/23422#pullrequestreview-2592626045 PR Comment: https://git.openjdk.org/jdk/pull/23422#issuecomment-2633708939 From mdoerr at openjdk.org Tue Feb 4 13:13:19 2025 From: mdoerr at openjdk.org (Martin Doerr) Date: Tue, 4 Feb 2025 13:13:19 GMT Subject: [jdk24] RFR: 8348562: ZGC: segmentation fault due to missing node type check in barrier elision analysis In-Reply-To: References: Message-ID: On Mon, 3 Feb 2025 17:56:23 GMT, Martin Doerr wrote: > Clean backport of [JDK-8348562](https://bugs.openjdk.org/browse/JDK-8348562). It only adds a null check + bailout where the current implementation crashes with SIGSEGV. Thanks for the reviews and for the assistance! ------------- PR Comment: https://git.openjdk.org/jdk/pull/23422#issuecomment-2633855619 From mdoerr at openjdk.org Tue Feb 4 13:13:19 2025 From: mdoerr at openjdk.org (Martin Doerr) Date: Tue, 4 Feb 2025 13:13:19 GMT Subject: [jdk24] Integrated: 8348562: ZGC: segmentation fault due to missing node type check in barrier elision analysis In-Reply-To: References: Message-ID: On Mon, 3 Feb 2025 17:56:23 GMT, Martin Doerr wrote: > Clean backport of [JDK-8348562](https://bugs.openjdk.org/browse/JDK-8348562). It only adds a null check + bailout where the current implementation crashes with SIGSEGV. This pull request has now been integrated. Changeset: b1659e34 Author: Martin Doerr URL: https://git.openjdk.org/jdk/commit/b1659e345afa7d660e832f0d8ce48707ac99e824 Stats: 3 lines in 1 file changed: 2 ins; 0 del; 1 mod 8348562: ZGC: segmentation fault due to missing node type check in barrier elision analysis Reviewed-by: kvn, thartmann Backport-of: afcc2b03afc77f730300e1d92471466d56ed75fb ------------- PR: https://git.openjdk.org/jdk/pull/23422 From phh at openjdk.org Tue Feb 4 15:53:13 2025 From: phh at openjdk.org (Paul Hohensee) Date: Tue, 4 Feb 2025 15:53:13 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check In-Reply-To: References: Message-ID: On Fri, 24 Jan 2025 18:30:02 GMT, Kelvin Nilsen wrote: > At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. > > For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. > > This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. src/hotspot/share/gc/shenandoah/shenandoahMetrics.cpp line 52: > 50: size_t free_actual = free_set->available(); > 51: // The sum of free_set->capacity() and ->reserved represents capacity of young in generational, heap in non-generational. > 52: size_t free_expected = ((free_set->capacity() + free_set->reserved()) / 100) * ShenandoahCriticalFreeThreshold; As an outsider, the units involved and what exactly is being calculated is pretty opaque. Why would we divide by 100 to compute free_expected and not do the same for free_actual? Do we care about integer division truncation? The default value of ShenandoahCriticalFreeThreshold is 1, so multiplying by it is a nop by default, which seems strange. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1941436272 From shade at openjdk.org Tue Feb 4 16:05:10 2025 From: shade at openjdk.org (Aleksey Shipilev) Date: Tue, 4 Feb 2025 16:05:10 GMT Subject: RFR: 8348268: Test gc/shenandoah/TestResizeTLAB.java#compact: fatal error: Before Updating References: Thread C2 CompilerThread1: expected gc-state 9, actual 21 In-Reply-To: References: Message-ID: On Mon, 3 Feb 2025 20:28:58 GMT, William Kemper wrote: > Non-java threads were not having their gc-state configured when they attach. If they were created before the verifier's safepoint, but after the iteration over non-java threads, they would not have the correct state. It looks generally okay, but I am confused how this fixes a bad state in `C2 CompilerThread1`, since compiler threads are Java threads? https://github.com/openjdk/jdk/blob/beb43e2633900bb9ab3c975376fe5860b6d054e0/src/hotspot/share/compiler/compilerThread.hpp#L42 ------------- PR Comment: https://git.openjdk.org/jdk/pull/23428#issuecomment-2634411610 From phh at openjdk.org Tue Feb 4 16:19:20 2025 From: phh at openjdk.org (Paul Hohensee) Date: Tue, 4 Feb 2025 16:19:20 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v2] In-Reply-To: References:

Message-ID: On Mon, 27 Jan 2025 02:05:02 GMT, Kelvin Nilsen wrote: >> Shenandoah heuristics use a penalty mechanism to cause earlier GC triggers when recent concurrent GC cycles degenerate. Degeneration is a stop-the-world remediation that allows GC to catch up when mutator allocations fail during concurrent GC. The fact that we needed to degenerate indicates that we were overly optimistic in delaying the trigger that starts concurrent GC. >> >> We have observed that it is common for degenerated GC cycles to cascade upon each other. The condition that caused an initial degenerated cycle is often not fully resolved by the end of that degenerated cycle. For example, the application may be experiencing a phase change and the GC heuristics are not yet attuned to the new behavior. Furthermore, a degenerated GC may exacerbate the problem condition. During the stop-the-world pause imposed by the first degenerated GC, work continues to accumulate in the form of new client requests that are buffered in network sockets until the end of that degenerated GC. >> >> As originally implemented, each degeneration would "pile on" additional penalties. These penalties cause the GC frequency to continue to increase. And the expanding CPU load of GC makes it increasingly difficult for mutator threads to catchup. The large penalties accumulated while we are trying to resolve the problem linger long after the problem condition has been resolved. >> >> This change does not add further to the degeneration penalties if a new degenerated cycle occurs through no fault of the triggering mechanism. We only add the degeneration penalty if the reason we are now degenerating can be attributed to a consciously late trigger by the heuristic. > > Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: > > Respond to reviewer feedback src/hotspot/share/gc/shenandoah/heuristics/shenandoahAdaptiveHeuristics.cpp line 318: > 316: > 317: if (ShenandoahHeuristics::should_start_gc()) { > 318: _start_gc_is_pending = true; I assume there's no race here, i.e., only one thread reads/writes _start_gc_is_pending. If there's a race, make sure it's benign. In either case, _start_gc_is_pending is made "sticky" by this code. src/hotspot/share/gc/shenandoah/heuristics/shenandoahHeuristics.cpp line 261: > 259: > 260: void ShenandoahHeuristics::record_success_concurrent() { > 261: _start_gc_is_pending = false; The name _start_gc_is_pending implies that it should be set false as soon as a gc cycle starts, not when it finishes. Maybe _gc_pending? Or maybe setting it false at the end of a gc cycle is a bug? :) src/hotspot/share/gc/shenandoah/heuristics/shenandoahHeuristics.hpp line 87: > 85: size_t _declined_trigger_count; // This counts how many times since previous GC finished that this > 86: // heuristic has answered false to should_start_gc(). > 87: size_t _previous_trigger_declinations; // This represents the value of _declined_trigger_count as captured at the Maybe the name should be _most_recent_declined_trigger_count, which relates it directly to _declined_trigger_count. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23305#discussion_r1941486248 PR Review Comment: https://git.openjdk.org/jdk/pull/23305#discussion_r1941462312 PR Review Comment: https://git.openjdk.org/jdk/pull/23305#discussion_r1941468695 From egahlin at openjdk.org Tue Feb 4 16:56:17 2025 From: egahlin at openjdk.org (Erik Gahlin) Date: Tue, 4 Feb 2025 16:56:17 GMT Subject: RFR: 8192647: GClocker induced GCs can starve threads requiring memory leading to OOME In-Reply-To: References: <8Vqsu8qf5wAN8pZF-8zu8zNhryQa42EZux3nMRChX5k=.63c53ac1-ca69-4a45-a924-9a454e24ea3f@github.com> Message-ID: On Tue, 4 Feb 2025 09:47:20 GMT, Thomas Schatzl wrote: > * Idk if GCLocker JFR events need to be available in metadata.xml if the VM does not actually ever send it. I think it does not. > Maybe it is used to decode from old recordings, may be worth asking e.g. @egahlin . If the event is not used and the metric is not interesting to have anymore, remove it from metadata.xml, default.jfc, profile.jfc, EventNames.java and delete the TestGCLockerEvent.java file. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23367#issuecomment-2634538626 From wkemper at openjdk.org Tue Feb 4 17:25:20 2025 From: wkemper at openjdk.org (William Kemper) Date: Tue, 4 Feb 2025 17:25:20 GMT Subject: RFR: 8348268: Test gc/shenandoah/TestResizeTLAB.java#compact: fatal error: Before Updating References: Thread C2 CompilerThread1: expected gc-state 9, actual 21 In-Reply-To: References:

Message-ID: On Tue, 4 Feb 2025 16:03:02 GMT, Aleksey Shipilev wrote: >> Non-java threads were not having their gc-state configured when they attach. If they were created before the verifier's safepoint, but after the iteration over non-java threads, they would not have the correct state. > > It looks generally okay, but I am confused how this fixes a bad state in `C2 CompilerThread1`, since compiler threads are Java threads? https://github.com/openjdk/jdk/blob/beb43e2633900bb9ab3c975376fe5860b6d054e0/src/hotspot/share/compiler/compilerThread.hpp#L42 @shipilev , that is a good point. Will take a closer look. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23428#issuecomment-2634609075 From rcastanedalo at openjdk.org Wed Feb 5 12:38:06 2025 From: rcastanedalo at openjdk.org (Roberto =?UTF-8?B?Q2FzdGHDsWVkYQ==?= Lozano) Date: Wed, 5 Feb 2025 12:38:06 GMT Subject: RFR: 8346280: C2: implement late barrier elision for G1 [v4] In-Reply-To: <3eOK-nFYQbKn1w81CWHUY14wk0gyWMT5ULHgZ-ih5-w=.8be51ad0-f412-4aad-b73a-436ccdb8181a@github.com> References: <3eOK-nFYQbKn1w81CWHUY14wk0gyWMT5ULHgZ-ih5-w=.8be51ad0-f412-4aad-b73a-436ccdb8181a@github.com> Message-ID: <-aHCYC9iVc4eMZ3pMfiDpqaW-wGM_s3zRMiVBWoadCM=.910336cd-3be2-45b5-9874-63b71abf38f8@github.com> > G1 barriers can be safely elided from writes to newly allocated objects as long as no safepoint is taken between the allocation and the write. This changeset complements early G1 barrier elision (performed by the platform-independent phases of C2, and limited to writes immediately following allocations) with a more general elision pass done at a late stage. > > The late elision pass exploits that it runs at a stage where the relative order of memory accesses and safepoints cannot change anymore to elide barriers from initialization writes that do not immediately follow the corresponding allocation, e.g. in conditional initialization writes: > > > o = new MyObject(); > if (...) { > o.myField = ...; // barrier elided only after this changeset > // (assuming no safepoint in the if condition) > } > > > or in initialization writes placed after exception-throwing checks: > > > o = new MyObject(); > if (...) { > throw new Exception(""); > } > o.myField = ...; // barrier elided only after this changeset > // (assuming no safepoint in the above if condition) > > > These patterns are commonly found in Java code, e.g. in the core libraries: > > - [conditional initialization](https://github.com/openjdk/jdk/blob/25fecaaf87400af535c242fe50296f1f89ceeb16/src/java.base/share/classes/java/lang/String.java#L4850), or > > - [initialization after exception-throwing checks (in the superclass constructor)](https://github.com/openjdk/jdk/blob/master/src/java.base/share/classes/java/nio/X-Buffer.java.template#L324). > > The optimization also enhances barrier elision for array initialization writes, for example eliding barriers from small array initialization loops (for which safepoints are not inserted): > > > Object[] a = new Object[...]; > for (int i = 0; i < a.length; i++) { > a[i] = ...; // barrier elided only after this changeset > } > > > or eliding barriers from array initialization writes with unknown array index: > > > Object[] a = new Object[...]; > a[index] = ...; // barrier elided only after this changeset > > > The logic used to perform this additional barrier elision is a subset of a pre-existing ZGC-specific optimization. This changeset simply reuses the relevant subset (barrier elision for writes to newly-allocated objects) by extracting the core of the optimization logic from `zBarrierSetC2.cpp` into the GC-shared file `barrierSetC2.cpp`. The functions `block_has_safepoint`, `block_index`, `look_through_node`, `is_{undefined|unknown|concrete}`, `get_base_and_offset`, `is_array... Roberto Casta?eda Lozano has updated the pull request incrementally with two additional commits since the last revision: - Add some more tests to exercise barrier elision for atomic operations - Elide barriers from atomic operations on newly allocated objects as well ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23235/files - new: https://git.openjdk.org/jdk/pull/23235/files/3d154fa8..621a61cf Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23235&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23235&range=02-03 Stats: 174 lines in 2 files changed: 167 ins; 0 del; 7 mod Patch: https://git.openjdk.org/jdk/pull/23235.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23235/head:pull/23235 PR: https://git.openjdk.org/jdk/pull/23235 From rcastanedalo at openjdk.org Wed Feb 5 12:42:15 2025 From: rcastanedalo at openjdk.org (Roberto =?UTF-8?B?Q2FzdGHDsWVkYQ==?= Lozano) Date: Wed, 5 Feb 2025 12:42:15 GMT Subject: RFR: 8346280: C2: implement late barrier elision for G1 [v2] In-Reply-To: References: <3eOK-nFYQbKn1w81CWHUY14wk0gyWMT5ULHgZ-ih5-w=.8be51ad0-f412-4aad-b73a-436ccdb8181a@github.com>

Message-ID: On Fri, 31 Jan 2025 14:06:16 GMT, Roberto Casta?eda Lozano wrote: > > One question about elision for atomics. > > Otherwise it seems good afaict, although a large part was checking that the code movement is/was correct. > > Thanks for reviewing Thomas! Please let me know whether you want me to extend this changeset to elide barriers on atomic operations (happy to do so). @tschatzl I did extend the changeset now to also elide barriers on atomic operations, as discussed offline. Please have a look again. @offamitkumar @TheRealMDoerr @RealFYang @snazarkin you might want to re-test the changeset on your respective platforms. Thanks! ------------- PR Comment: https://git.openjdk.org/jdk/pull/23235#issuecomment-2636669816 From iwalulya at openjdk.org Wed Feb 5 13:37:52 2025 From: iwalulya at openjdk.org (Ivan Walulya) Date: Wed, 5 Feb 2025 13:37:52 GMT Subject: RFR: 8343782: G1: Use one G1CardSet instance for multiple old gen regions [v11] In-Reply-To: References: Message-ID: > Hi all, > > Please review this change to assign multiple collection candidate regions to a single instance of a G1CardSet. Currently, we maintain a 1:1 mapping of old-gen regions and G1CardSet instances, assuming these regions are collected independently. However, regions are collected in batches for performance reasons to meet the G1MixedGCCountTarget. > > In this change, at the end of the Remark phase, we batch regions that we anticipate will be collected together into a collection group while selecting remembered set rebuild candidates. Regions in a collection group should be evacuated at the same time because they are assigned to the same G1CardSet instances. This implies that we do not need to maintain cross-region remembered set entries for regions within the same collection group. > > The benefit is a reduction in the memory overhead of the remembered set and the remembered set merge time during the collection pause. One disadvantage is that this approach decreases the flexibility during evacuation: you can only evacuate all regions that share a particular G1CardSet at the same time. Another downside is that pinned regions that are part of a collection group have to be partially evacuated when the collection group is selected for evacuation. This removes the optimization in the mainline implementation where the pinned regions are skipped to allow for potential unpinning before evacuation. > > In this change, we make significant changes to the collection set implementation as we switch to group selection instead of region selection. Consequently, many of the changes in the PR are about switching from region-centered collection set selection to a group-centered approach. > > Note: The batching is based on the sort order by reclaimable bytes which may change the evacuation order in which regions would have been evacuated when sorted by gc efficiency. > > We have not observed any regressions on internal performance testing platforms. Memory comparisons for the Cachestress benchmark for different heap sizes are attached below. > > Testing: Mach5 Tier1-6 > > ![16GB](https://github.com/user-attachments/assets/3224c2f1-172d-4d76-ba28-bf483b1b1c95) > ![32G](https://github.com/user-attachments/assets/abd10537-41a9-4cf9-b668-362af12fe949) > ![64GB](https://github.com/user-attachments/assets/fa87eefc-cf8a-4fb5-9fc4-e7151498bf73) > ![128GB](https://github.com/user-attachments/assets/c3a59e32-6bd7-43e3-a3e4-c472f71aa544) Ivan Walulya has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 31 commits: - Revise Print Rememberedset info - Merge remote-tracking branch 'upstream/master' into OldGenRemsetGroupsV1 - Merge remote-tracking branch 'upstream/master' into OldGenRemsetGroupsV1 - Albert review - Merge remote-tracking branch 'upstream/master' into OldGenRemsetGroupsV1 - Merge remote-tracking branch 'upstream/master' into OldGenRemsetGroupsV1 - fix type - fix space issues - cleanup - assert - ... and 21 more: https://git.openjdk.org/jdk/compare/beae8843...d50457e3 ------------- Changes: https://git.openjdk.org/jdk/pull/22015/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=22015&range=10 Stats: 1441 lines in 32 files changed: 679 ins; 369 del; 393 mod Patch: https://git.openjdk.org/jdk/pull/22015.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22015/head:pull/22015 PR: https://git.openjdk.org/jdk/pull/22015 From iwalulya at openjdk.org Wed Feb 5 13:40:59 2025 From: iwalulya at openjdk.org (Ivan Walulya) Date: Wed, 5 Feb 2025 13:40:59 GMT Subject: RFR: 8343782: G1: Use one G1CardSet instance for multiple old gen regions [v12] In-Reply-To: References: Message-ID: <3ru13KcIWif1mzPnCckRryxaW6g3AkrIJvTBIaaCRNQ=.6c12262e-7b05-40df-8341-ae8141983237@github.com> > Hi all, > > Please review this change to assign multiple collection candidate regions to a single instance of a G1CardSet. Currently, we maintain a 1:1 mapping of old-gen regions and G1CardSet instances, assuming these regions are collected independently. However, regions are collected in batches for performance reasons to meet the G1MixedGCCountTarget. > > In this change, at the end of the Remark phase, we batch regions that we anticipate will be collected together into a collection group while selecting remembered set rebuild candidates. Regions in a collection group should be evacuated at the same time because they are assigned to the same G1CardSet instances. This implies that we do not need to maintain cross-region remembered set entries for regions within the same collection group. > > The benefit is a reduction in the memory overhead of the remembered set and the remembered set merge time during the collection pause. One disadvantage is that this approach decreases the flexibility during evacuation: you can only evacuate all regions that share a particular G1CardSet at the same time. Another downside is that pinned regions that are part of a collection group have to be partially evacuated when the collection group is selected for evacuation. This removes the optimization in the mainline implementation where the pinned regions are skipped to allow for potential unpinning before evacuation. > > In this change, we make significant changes to the collection set implementation as we switch to group selection instead of region selection. Consequently, many of the changes in the PR are about switching from region-centered collection set selection to a group-centered approach. > > Note: The batching is based on the sort order by reclaimable bytes which may change the evacuation order in which regions would have been evacuated when sorted by gc efficiency. > > We have not observed any regressions on internal performance testing platforms. Memory comparisons for the Cachestress benchmark for different heap sizes are attached below. > > Testing: Mach5 Tier1-6 > > ![16GB](https://github.com/user-attachments/assets/3224c2f1-172d-4d76-ba28-bf483b1b1c95) > ![32G](https://github.com/user-attachments/assets/abd10537-41a9-4cf9-b668-362af12fe949) > ![64GB](https://github.com/user-attachments/assets/fa87eefc-cf8a-4fb5-9fc4-e7151498bf73) > ![128GB](https://github.com/user-attachments/assets/c3a59e32-6bd7-43e3-a3e4-c472f71aa544) Ivan Walulya has updated the pull request incrementally with one additional commit since the last revision: space ------------- Changes: - all: https://git.openjdk.org/jdk/pull/22015/files - new: https://git.openjdk.org/jdk/pull/22015/files/d50457e3..5b43fdb7 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=22015&range=11 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=22015&range=10-11 Stats: 1 line in 1 file changed: 0 ins; 1 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/22015.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22015/head:pull/22015 PR: https://git.openjdk.org/jdk/pull/22015 From iwalulya at openjdk.org Wed Feb 5 13:48:20 2025 From: iwalulya at openjdk.org (Ivan Walulya) Date: Wed, 5 Feb 2025 13:48:20 GMT Subject: RFR: 8343782: G1: Use one G1CardSet instance for multiple old gen regions [v10] In-Reply-To: References:

<5IANDiv_ZPk3dAPem7OekMx6d1cUDiFGtOVWlcWt52Y=.f5e7ad67-3181-4757-8f61-1bbcc9e62280@github.com>

Message-ID: On Mon, 23 Dec 2024 21:03:16 GMT, Albert Mingkun Yang wrote: >> Yes, retained regions are in "single region" groups, so all details should be added to the log when we call `do_heap_region` > > I see; however, this would print the same gc_eff twice if young-gen contains a single region, right? Since this method is about cset-groups, I think it's more natural to visit all groups (regardless their size) here. With this PR, there is no gc_eff associated with individual region, `do_heap_region` can just skip gc_eff. fixed, creating another issue; now we don't print details on humongous regions. I ask we fix that in a follow up. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22015#discussion_r1942972818 From ayang at openjdk.org Wed Feb 5 14:41:39 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 5 Feb 2025 14:41:39 GMT Subject: RFR: 8192647: GClocker induced GCs can starve threads requiring memory leading to OOME [v2] In-Reply-To: <8Vqsu8qf5wAN8pZF-8zu8zNhryQa42EZux3nMRChX5k=.63c53ac1-ca69-4a45-a924-9a454e24ea3f@github.com> References: <8Vqsu8qf5wAN8pZF-8zu8zNhryQa42EZux3nMRChX5k=.63c53ac1-ca69-4a45-a924-9a454e24ea3f@github.com> Message-ID: > Here is an attempt to simplify GCLocker implementation for Serial and Parallel. > > GCLocker prevents GC when Java threads are in a critical region (i.e., calling JNI critical APIs). JDK-7129164 introduces an optimization that updates a shared variable (used to track the number of threads in the critical region) only if there is a pending GC request. However, this also means that after reaching a GC safepoint, we may discover that GCLocker is active, preventing a GC cycle from being invoked. The inability to perform GC at a safepoint adds complexity -- for example, a caller must retry allocation if the request fails due to GC being inhibited by GCLocker. > > The proposed patch uses a readers-writer lock to ensure that all Java threads exit the critical region before reaching a GC safepoint. This guarantees that once inside the safepoint, we can successfully invoke a GC cycle. The approach takes inspiration from `ZJNICritical`, but some regressions were observed in j2dbench (on Windows) and the micro-benchmark in [JDK-8232575](https://bugs.openjdk.org/browse/JDK-8232575). Therefore, instead of relying on atomic operations on a global variable when entering or leaving the critical region, this PR uses an existing thread-local variable with a store-load barrier for synchronization. > > Performance is neutral for all benchmarks tested: DaCapo, SPECjbb2005, SPECjbb2015, SPECjvm2008, j2dbench, and CacheStress. > > Test: tier1-8 Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision: - Merge branch 'master' into gclocker - review - Merge branch 'master' into gclocker - gclocker ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23367/files - new: https://git.openjdk.org/jdk/pull/23367/files/6283a19c..1b6f908b Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23367&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23367&range=00-01 Stats: 20456 lines in 569 files changed: 9369 ins; 6708 del; 4379 mod Patch: https://git.openjdk.org/jdk/pull/23367.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23367/head:pull/23367 PR: https://git.openjdk.org/jdk/pull/23367 From ayang at openjdk.org Wed Feb 5 14:41:39 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 5 Feb 2025 14:41:39 GMT Subject: RFR: 8192647: GClocker induced GCs can starve threads requiring memory leading to OOME [v2] In-Reply-To: References: <8Vqsu8qf5wAN8pZF-8zu8zNhryQa42EZux3nMRChX5k=.63c53ac1-ca69-4a45-a924-9a454e24ea3f@github.com> Message-ID: On Tue, 4 Feb 2025 09:05:35 GMT, Thomas Schatzl wrote: >> Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision: >> >> - Merge branch 'master' into gclocker >> - review >> - Merge branch 'master' into gclocker >> - gclocker > > src/hotspot/share/gc/shared/gcLocker.hpp line 33: > >> 31: >> 32: class GCLocker: public AllStatic { >> 33: static Monitor* _lock; > > Not sure if having this copy/reference to `Heap_lock` makes the code more clear than referencing `Heap_lock` directly. It needs to be `Heap_lock` anyway. `GCLocker` itself doesn't mandates that the lock must be `Heap_lock`; it's the interaction with rest of VM that shows that `Heap_lock` is a good candidate. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23367#discussion_r1943040719 From ayang at openjdk.org Wed Feb 5 14:41:39 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 5 Feb 2025 14:41:39 GMT Subject: RFR: 8192647: GClocker induced GCs can starve threads requiring memory leading to OOME [v2] In-Reply-To: References: <8Vqsu8qf5wAN8pZF-8zu8zNhryQa42EZux3nMRChX5k=.63c53ac1-ca69-4a45-a924-9a454e24ea3f@github.com> Message-ID: <82w1_VjrsxtrpA7921QmHsA0kh9_J0kBtOCxp6sL7F4=.0b0d0698-b3d2-43a0-85b4-6b7e530e3a7a@github.com> On Wed, 5 Feb 2025 14:38:45 GMT, Albert Mingkun Yang wrote: >> Here is an attempt to simplify GCLocker implementation for Serial and Parallel. >> >> GCLocker prevents GC when Java threads are in a critical region (i.e., calling JNI critical APIs). JDK-7129164 introduces an optimization that updates a shared variable (used to track the number of threads in the critical region) only if there is a pending GC request. However, this also means that after reaching a GC safepoint, we may discover that GCLocker is active, preventing a GC cycle from being invoked. The inability to perform GC at a safepoint adds complexity -- for example, a caller must retry allocation if the request fails due to GC being inhibited by GCLocker. >> >> The proposed patch uses a readers-writer lock to ensure that all Java threads exit the critical region before reaching a GC safepoint. This guarantees that once inside the safepoint, we can successfully invoke a GC cycle. The approach takes inspiration from `ZJNICritical`, but some regressions were observed in j2dbench (on Windows) and the micro-benchmark in [JDK-8232575](https://bugs.openjdk.org/browse/JDK-8232575). Therefore, instead of relying on atomic operations on a global variable when entering or leaving the critical region, this PR uses an existing thread-local variable with a store-load barrier for synchronization. >> >> Performance is neutral for all benchmarks tested: DaCapo, SPECjbb2005, SPECjbb2015, SPECjvm2008, j2dbench, and CacheStress. >> >> Test: tier1-8 > > Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision: > > - Merge branch 'master' into gclocker > - review > - Merge branch 'master' into gclocker > - gclocker > Further it would be interesting to see how many retries there are in the allocation loop with these jnilock* stress test. I added `QueuedAllocationWarningCount=1` to `test/hotspot/jtreg/vmTestbase/nsk/stress/jni/gclocker/gcl001.java` and saw retry never exceeds 10 for Serial/Parallel. > Which means that it might retry for a long time That occurs only when another java thread successfully triggers a gc, advancing the gc-counter, i.e. there is some system-wide progress. Per-thread progress is hard to guarantee, IMO. ------------- PR Review: https://git.openjdk.org/jdk/pull/23367#pullrequestreview-2595944041 From ayang at openjdk.org Wed Feb 5 15:08:13 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 5 Feb 2025 15:08:13 GMT Subject: RFR: 8343782: G1: Use one G1CardSet instance for multiple old gen regions [v12] In-Reply-To: <3ru13KcIWif1mzPnCckRryxaW6g3AkrIJvTBIaaCRNQ=.6c12262e-7b05-40df-8341-ae8141983237@github.com> References: <3ru13KcIWif1mzPnCckRryxaW6g3AkrIJvTBIaaCRNQ=.6c12262e-7b05-40df-8341-ae8141983237@github.com> Message-ID: On Wed, 5 Feb 2025 13:40:59 GMT, Ivan Walulya wrote: >> Hi all, >> >> Please review this change to assign multiple collection candidate regions to a single instance of a G1CardSet. Currently, we maintain a 1:1 mapping of old-gen regions and G1CardSet instances, assuming these regions are collected independently. However, regions are collected in batches for performance reasons to meet the G1MixedGCCountTarget. >> >> In this change, at the end of the Remark phase, we batch regions that we anticipate will be collected together into a collection group while selecting remembered set rebuild candidates. Regions in a collection group should be evacuated at the same time because they are assigned to the same G1CardSet instances. This implies that we do not need to maintain cross-region remembered set entries for regions within the same collection group. >> >> The benefit is a reduction in the memory overhead of the remembered set and the remembered set merge time during the collection pause. One disadvantage is that this approach decreases the flexibility during evacuation: you can only evacuate all regions that share a particular G1CardSet at the same time. Another downside is that pinned regions that are part of a collection group have to be partially evacuated when the collection group is selected for evacuation. This removes the optimization in the mainline implementation where the pinned regions are skipped to allow for potential unpinning before evacuation. >> >> In this change, we make significant changes to the collection set implementation as we switch to group selection instead of region selection. Consequently, many of the changes in the PR are about switching from region-centered collection set selection to a group-centered approach. >> >> Note: The batching is based on the sort order by reclaimable bytes which may change the evacuation order in which regions would have been evacuated when sorted by gc efficiency. >> >> We have not observed any regressions on internal performance testing platforms. Memory comparisons for the Cachestress benchmark for different heap sizes are attached below. >> >> Testing: Mach5 Tier1-6 >> >> ![16GB](https://github.com/user-attachments/assets/3224c2f1-172d-4d76-ba28-bf483b1b1c95) >> ![32G](https://github.com/user-attachments/assets/abd10537-41a9-4cf9-b668-362af12fe949) >> ![64GB](https://github.com/user-attachments/assets/fa87eefc-cf8a-4fb5-9fc4-e7151498bf73) >> ![128GB](https://github.com/user-attachments/assets/c3a59e32-6bd7-43e3-a3e4-c472f71aa544) > > Ivan Walulya has updated the pull request incrementally with one additional commit since the last revision: > > space Marked as reviewed by ayang (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/22015#pullrequestreview-2596072480 From mdoerr at openjdk.org Wed Feb 5 15:09:17 2025 From: mdoerr at openjdk.org (Martin Doerr) Date: Wed, 5 Feb 2025 15:09:17 GMT Subject: RFR: 8346280: C2: implement late barrier elision for G1 [v4] In-Reply-To: <-aHCYC9iVc4eMZ3pMfiDpqaW-wGM_s3zRMiVBWoadCM=.910336cd-3be2-45b5-9874-63b71abf38f8@github.com> References: <3eOK-nFYQbKn1w81CWHUY14wk0gyWMT5ULHgZ-ih5-w=.8be51ad0-f412-4aad-b73a-436ccdb8181a@github.com> <-aHCYC9iVc4eMZ3pMfiDpqaW-wGM_s3zRMiVBWoadCM=.910336cd-3be2-45b5-9874-63b71abf38f8@github.com> Message-ID: On Wed, 5 Feb 2025 12:38:06 GMT, Roberto Casta?eda Lozano wrote: >> G1 barriers can be safely elided from writes to newly allocated objects as long as no safepoint is taken between the allocation and the write. This changeset complements early G1 barrier elision (performed by the platform-independent phases of C2, and limited to writes immediately following allocations) with a more general elision pass done at a late stage. >> >> The late elision pass exploits that it runs at a stage where the relative order of memory accesses and safepoints cannot change anymore to elide barriers from initialization writes that do not immediately follow the corresponding allocation, e.g. in conditional initialization writes: >> >> >> o = new MyObject(); >> if (...) { >> o.myField = ...; // barrier elided only after this changeset >> // (assuming no safepoint in the if condition) >> } >> >> >> or in initialization writes placed after exception-throwing checks: >> >> >> o = new MyObject(); >> if (...) { >> throw new Exception(""); >> } >> o.myField = ...; // barrier elided only after this changeset >> // (assuming no safepoint in the above if condition) >> >> >> These patterns are commonly found in Java code, e.g. in the core libraries: >> >> - [conditional initialization](https://github.com/openjdk/jdk/blob/25fecaaf87400af535c242fe50296f1f89ceeb16/src/java.base/share/classes/java/lang/String.java#L4850), or >> >> - [initialization after exception-throwing checks (in the superclass constructor)](https://github.com/openjdk/jdk/blob/master/src/java.base/share/classes/java/nio/X-Buffer.java.template#L324). >> >> The optimization also enhances barrier elision for array initialization writes, for example eliding barriers from small array initialization loops (for which safepoints are not inserted): >> >> >> Object[] a = new Object[...]; >> for (int i = 0; i < a.length; i++) { >> a[i] = ...; // barrier elided only after this changeset >> } >> >> >> or eliding barriers from array initialization writes with unknown array index: >> >> >> Object[] a = new Object[...]; >> a[index] = ...; // barrier elided only after this changeset >> >> >> The logic used to perform this additional barrier elision is a subset of a pre-existing ZGC-specific optimization. This changeset simply reuses the relevant subset (barrier elision for writes to newly-allocated objects) by extracting the core of the optimization logic from `zBarrierSetC2.cpp` into the GC-shared file `barrierSetC2.cpp`. The functions `block_has_safepoint`, `block_inde... > > Roberto Casta?eda Lozano has updated the pull request incrementally with two additional commits since the last revision: > > - Add some more tests to exercise barrier elision for atomic operations > - Elide barriers from atomic operations on newly allocated objects as well LGTM. TestG1BarrierGeneration.java has passed on ppc64le. I'll run more tests. Please remember updating the Copyright headers. ------------- PR Review: https://git.openjdk.org/jdk/pull/23235#pullrequestreview-2596076238 From amitkumar at openjdk.org Wed Feb 5 17:57:19 2025 From: amitkumar at openjdk.org (Amit Kumar) Date: Wed, 5 Feb 2025 17:57:19 GMT Subject: RFR: 8346280: C2: implement late barrier elision for G1 [v4] In-Reply-To: <-aHCYC9iVc4eMZ3pMfiDpqaW-wGM_s3zRMiVBWoadCM=.910336cd-3be2-45b5-9874-63b71abf38f8@github.com> References: <3eOK-nFYQbKn1w81CWHUY14wk0gyWMT5ULHgZ-ih5-w=.8be51ad0-f412-4aad-b73a-436ccdb8181a@github.com> <-aHCYC9iVc4eMZ3pMfiDpqaW-wGM_s3zRMiVBWoadCM=.910336cd-3be2-45b5-9874-63b71abf38f8@github.com> Message-ID: On Wed, 5 Feb 2025 12:38:06 GMT, Roberto Casta?eda Lozano wrote: >> G1 barriers can be safely elided from writes to newly allocated objects as long as no safepoint is taken between the allocation and the write. This changeset complements early G1 barrier elision (performed by the platform-independent phases of C2, and limited to writes immediately following allocations) with a more general elision pass done at a late stage. >> >> The late elision pass exploits that it runs at a stage where the relative order of memory accesses and safepoints cannot change anymore to elide barriers from initialization writes that do not immediately follow the corresponding allocation, e.g. in conditional initialization writes: >> >> >> o = new MyObject(); >> if (...) { >> o.myField = ...; // barrier elided only after this changeset >> // (assuming no safepoint in the if condition) >> } >> >> >> or in initialization writes placed after exception-throwing checks: >> >> >> o = new MyObject(); >> if (...) { >> throw new Exception(""); >> } >> o.myField = ...; // barrier elided only after this changeset >> // (assuming no safepoint in the above if condition) >> >> >> These patterns are commonly found in Java code, e.g. in the core libraries: >> >> - [conditional initialization](https://github.com/openjdk/jdk/blob/25fecaaf87400af535c242fe50296f1f89ceeb16/src/java.base/share/classes/java/lang/String.java#L4850), or >> >> - [initialization after exception-throwing checks (in the superclass constructor)](https://github.com/openjdk/jdk/blob/master/src/java.base/share/classes/java/nio/X-Buffer.java.template#L324). >> >> The optimization also enhances barrier elision for array initialization writes, for example eliding barriers from small array initialization loops (for which safepoints are not inserted): >> >> >> Object[] a = new Object[...]; >> for (int i = 0; i < a.length; i++) { >> a[i] = ...; // barrier elided only after this changeset >> } >> >> >> or eliding barriers from array initialization writes with unknown array index: >> >> >> Object[] a = new Object[...]; >> a[index] = ...; // barrier elided only after this changeset >> >> >> The logic used to perform this additional barrier elision is a subset of a pre-existing ZGC-specific optimization. This changeset simply reuses the relevant subset (barrier elision for writes to newly-allocated objects) by extracting the core of the optimization logic from `zBarrierSetC2.cpp` into the GC-shared file `barrierSetC2.cpp`. The functions `block_has_safepoint`, `block_inde... > > Roberto Casta?eda Lozano has updated the pull request incrementally with two additional commits since the last revision: > > - Add some more tests to exercise barrier elision for atomic operations > - Elide barriers from atomic operations on newly allocated objects as well I see TestG1BarrierGeneration.java failure :( [TestG1BarrierGeneration_jtr.log](https://github.com/user-attachments/files/18676532/TestG1BarrierGeneration_jtr.log) ------------- PR Comment: https://git.openjdk.org/jdk/pull/23235#issuecomment-2637624720 From wkemper at openjdk.org Wed Feb 5 19:06:20 2025 From: wkemper at openjdk.org (William Kemper) Date: Wed, 5 Feb 2025 19:06:20 GMT Subject: [jdk24] Withdrawn: 8349002: GenShen: Deadlock during shutdown In-Reply-To: <6a_6G1g93RUACyYcHG5B9HtqFBaqdETRRdhvFWwrfi8=.e88c6252-f6c4-4e3f-972a-2c4495d27127@github.com> References: <6a_6G1g93RUACyYcHG5B9HtqFBaqdETRRdhvFWwrfi8=.e88c6252-f6c4-4e3f-972a-2c4495d27127@github.com> Message-ID: On Mon, 3 Feb 2025 22:27:44 GMT, William Kemper wrote: > Clean backport. Fixes bug introduced by [JDK-8345970](https://bugs.openjdk.org/browse/JDK-8345970). This pull request has been closed without being integrated. ------------- PR: https://git.openjdk.org/jdk/pull/23429 From wkemper at openjdk.org Wed Feb 5 19:06:20 2025 From: wkemper at openjdk.org (William Kemper) Date: Wed, 5 Feb 2025 19:06:20 GMT Subject: [jdk24] RFR: 8349002: GenShen: Deadlock during shutdown In-Reply-To: <6a_6G1g93RUACyYcHG5B9HtqFBaqdETRRdhvFWwrfi8=.e88c6252-f6c4-4e3f-972a-2c4495d27127@github.com> References: <6a_6G1g93RUACyYcHG5B9HtqFBaqdETRRdhvFWwrfi8=.e88c6252-f6c4-4e3f-972a-2c4495d27127@github.com> Message-ID: On Mon, 3 Feb 2025 22:27:44 GMT, William Kemper wrote: > Clean backport. Fixes bug introduced by [JDK-8345970](https://bugs.openjdk.org/browse/JDK-8345970). Understood. Will target JDK24 update release. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23429#issuecomment-2637789977 From dlong at openjdk.org Wed Feb 5 19:51:02 2025 From: dlong at openjdk.org (Dean Long) Date: Wed, 5 Feb 2025 19:51:02 GMT Subject: RFR: 8192647: GClocker induced GCs can starve threads requiring memory leading to OOME [v2] In-Reply-To: References: <8Vqsu8qf5wAN8pZF-8zu8zNhryQa42EZux3nMRChX5k=.63c53ac1-ca69-4a45-a924-9a454e24ea3f@github.com> Message-ID: On Wed, 5 Feb 2025 14:41:39 GMT, Albert Mingkun Yang wrote: >> Here is an attempt to simplify GCLocker implementation for Serial and Parallel. >> >> GCLocker prevents GC when Java threads are in a critical region (i.e., calling JNI critical APIs). JDK-7129164 introduces an optimization that updates a shared variable (used to track the number of threads in the critical region) only if there is a pending GC request. However, this also means that after reaching a GC safepoint, we may discover that GCLocker is active, preventing a GC cycle from being invoked. The inability to perform GC at a safepoint adds complexity -- for example, a caller must retry allocation if the request fails due to GC being inhibited by GCLocker. >> >> The proposed patch uses a readers-writer lock to ensure that all Java threads exit the critical region before reaching a GC safepoint. This guarantees that once inside the safepoint, we can successfully invoke a GC cycle. The approach takes inspiration from `ZJNICritical`, but some regressions were observed in j2dbench (on Windows) and the micro-benchmark in [JDK-8232575](https://bugs.openjdk.org/browse/JDK-8232575). Therefore, instead of relying on atomic operations on a global variable when entering or leaving the critical region, this PR uses an existing thread-local variable with a store-load barrier for synchronization. >> >> Performance is neutral for all benchmarks tested: DaCapo, SPECjbb2005, SPECjbb2015, SPECjvm2008, j2dbench, and CacheStress. >> >> Test: tier1-8 > > Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision: > > - Merge branch 'master' into gclocker > - review > - Merge branch 'master' into gclocker > - gclocker I like the direction this is taking us, but I think we could go even further and eventually fold the JNI critical region into the existing safepoint mechanism. To me, the safepoint mechanism already implements a readers-writer lock, with threads states like _thread_in_Java/_thread_in_vm already being "critical regions". With this change, we have two nested readers-writer locks that a GC needs to acquire. I think if we made entering and exiting a JNI critical region change the thread state, (probably by introducing a new thread state), then we don't need a separate readers-writer lock for JNI critical region. However, maybe we don't want to go that far, as the current solution allows us GC-specific implementations and allows each different GC VMOp to decide if it needs to block for JNI critical regions. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23367#issuecomment-2637865869 From egahlin at openjdk.org Wed Feb 5 19:59:13 2025 From: egahlin at openjdk.org (Erik Gahlin) Date: Wed, 5 Feb 2025 19:59:13 GMT Subject: RFR: 8192647: GClocker induced GCs can starve threads requiring memory leading to OOME [v2] In-Reply-To: References: <8Vqsu8qf5wAN8pZF-8zu8zNhryQa42EZux3nMRChX5k=.63c53ac1-ca69-4a45-a924-9a454e24ea3f@github.com> Message-ID: On Wed, 5 Feb 2025 14:41:39 GMT, Albert Mingkun Yang wrote: >> Here is an attempt to simplify GCLocker implementation for Serial and Parallel. >> >> GCLocker prevents GC when Java threads are in a critical region (i.e., calling JNI critical APIs). JDK-7129164 introduces an optimization that updates a shared variable (used to track the number of threads in the critical region) only if there is a pending GC request. However, this also means that after reaching a GC safepoint, we may discover that GCLocker is active, preventing a GC cycle from being invoked. The inability to perform GC at a safepoint adds complexity -- for example, a caller must retry allocation if the request fails due to GC being inhibited by GCLocker. >> >> The proposed patch uses a readers-writer lock to ensure that all Java threads exit the critical region before reaching a GC safepoint. This guarantees that once inside the safepoint, we can successfully invoke a GC cycle. The approach takes inspiration from `ZJNICritical`, but some regressions were observed in j2dbench (on Windows) and the micro-benchmark in [JDK-8232575](https://bugs.openjdk.org/browse/JDK-8232575). Therefore, instead of relying on atomic operations on a global variable when entering or leaving the critical region, this PR uses an existing thread-local variable with a store-load barrier for synchronization. >> >> Performance is neutral for all benchmarks tested: DaCapo, SPECjbb2005, SPECjbb2015, SPECjvm2008, j2dbench, and CacheStress. >> >> Test: tier1-8 > > Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision: > > - Merge branch 'master' into gclocker > - review > - Merge branch 'master' into gclocker > - gclocker JFR changes look good. ------------- Marked as reviewed by egahlin (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/23367#pullrequestreview-2596847997 From wkemper at openjdk.org Wed Feb 5 22:35:21 2025 From: wkemper at openjdk.org (William Kemper) Date: Wed, 5 Feb 2025 22:35:21 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions Message-ID: There are several changes to the operation of Shenandoah's control threads here. * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. * The cancellation handling is driven entirely by the cancellation cause * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed * The shutdown sequence is simpler * The generational control thread uses a lock to coordinate updates to the requested cause and generation * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles * The control thread doesn't loop on its own (unless the pacer is enabled). ------------- Commit messages: - Merge remote-tracking branch 'jdk/master' into fix-control-regulator-threads - Simplify shControlThread - Revert unnecessary changes - Fix interrupted old cycle handling - Restore reporting allocations to pacer - Better names, better comments - WIP: Simplify shutdown protocol - WIP: Don't need request.mode anymore - WIP: Simplify degenerated cycle handling - WIP: Passes tier1, mostly passes tier2 - ... and 4 more: https://git.openjdk.org/jdk/compare/b499c827...f97f257b Changes: https://git.openjdk.org/jdk/pull/23475/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8349094 Stats: 817 lines in 14 files changed: 241 ins; 286 del; 290 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From fyang at openjdk.org Thu Feb 6 02:43:12 2025 From: fyang at openjdk.org (Fei Yang) Date: Thu, 6 Feb 2025 02:43:12 GMT Subject: RFR: 8346280: C2: implement late barrier elision for G1 [v4] In-Reply-To: <-aHCYC9iVc4eMZ3pMfiDpqaW-wGM_s3zRMiVBWoadCM=.910336cd-3be2-45b5-9874-63b71abf38f8@github.com> References: <3eOK-nFYQbKn1w81CWHUY14wk0gyWMT5ULHgZ-ih5-w=.8be51ad0-f412-4aad-b73a-436ccdb8181a@github.com> <-aHCYC9iVc4eMZ3pMfiDpqaW-wGM_s3zRMiVBWoadCM=.910336cd-3be2-45b5-9874-63b71abf38f8@github.com> Message-ID: On Wed, 5 Feb 2025 12:38:06 GMT, Roberto Casta?eda Lozano wrote: >> G1 barriers can be safely elided from writes to newly allocated objects as long as no safepoint is taken between the allocation and the write. This changeset complements early G1 barrier elision (performed by the platform-independent phases of C2, and limited to writes immediately following allocations) with a more general elision pass done at a late stage. >> >> The late elision pass exploits that it runs at a stage where the relative order of memory accesses and safepoints cannot change anymore to elide barriers from initialization writes that do not immediately follow the corresponding allocation, e.g. in conditional initialization writes: >> >> >> o = new MyObject(); >> if (...) { >> o.myField = ...; // barrier elided only after this changeset >> // (assuming no safepoint in the if condition) >> } >> >> >> or in initialization writes placed after exception-throwing checks: >> >> >> o = new MyObject(); >> if (...) { >> throw new Exception(""); >> } >> o.myField = ...; // barrier elided only after this changeset >> // (assuming no safepoint in the above if condition) >> >> >> These patterns are commonly found in Java code, e.g. in the core libraries: >> >> - [conditional initialization](https://github.com/openjdk/jdk/blob/25fecaaf87400af535c242fe50296f1f89ceeb16/src/java.base/share/classes/java/lang/String.java#L4850), or >> >> - [initialization after exception-throwing checks (in the superclass constructor)](https://github.com/openjdk/jdk/blob/master/src/java.base/share/classes/java/nio/X-Buffer.java.template#L324). >> >> The optimization also enhances barrier elision for array initialization writes, for example eliding barriers from small array initialization loops (for which safepoints are not inserted): >> >> >> Object[] a = new Object[...]; >> for (int i = 0; i < a.length; i++) { >> a[i] = ...; // barrier elided only after this changeset >> } >> >> >> or eliding barriers from array initialization writes with unknown array index: >> >> >> Object[] a = new Object[...]; >> a[index] = ...; // barrier elided only after this changeset >> >> >> The logic used to perform this additional barrier elision is a subset of a pre-existing ZGC-specific optimization. This changeset simply reuses the relevant subset (barrier elision for writes to newly-allocated objects) by extracting the core of the optimization logic from `zBarrierSetC2.cpp` into the GC-shared file `barrierSetC2.cpp`. The functions `block_has_safepoint`, `block_inde... > > Roberto Casta?eda Lozano has updated the pull request incrementally with two additional commits since the last revision: > > - Add some more tests to exercise barrier elision for atomic operations > - Elide barriers from atomic operations on newly allocated objects as well FYI: hs-tier1 still test good on linux-riscv64 with fastdebug build. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23235#issuecomment-2638686602 From dholmes at openjdk.org Thu Feb 6 06:28:13 2025 From: dholmes at openjdk.org (David Holmes) Date: Thu, 6 Feb 2025 06:28:13 GMT Subject: RFR: 8192647: GClocker induced GCs can starve threads requiring memory leading to OOME [v2] In-Reply-To: References: <8Vqsu8qf5wAN8pZF-8zu8zNhryQa42EZux3nMRChX5k=.63c53ac1-ca69-4a45-a924-9a454e24ea3f@github.com> Message-ID: On Wed, 5 Feb 2025 14:41:39 GMT, Albert Mingkun Yang wrote: >> Here is an attempt to simplify GCLocker implementation for Serial and Parallel. >> >> GCLocker prevents GC when Java threads are in a critical region (i.e., calling JNI critical APIs). JDK-7129164 introduces an optimization that updates a shared variable (used to track the number of threads in the critical region) only if there is a pending GC request. However, this also means that after reaching a GC safepoint, we may discover that GCLocker is active, preventing a GC cycle from being invoked. The inability to perform GC at a safepoint adds complexity -- for example, a caller must retry allocation if the request fails due to GC being inhibited by GCLocker. >> >> The proposed patch uses a readers-writer lock to ensure that all Java threads exit the critical region before reaching a GC safepoint. This guarantees that once inside the safepoint, we can successfully invoke a GC cycle. The approach takes inspiration from `ZJNICritical`, but some regressions were observed in j2dbench (on Windows) and the micro-benchmark in [JDK-8232575](https://bugs.openjdk.org/browse/JDK-8232575). Therefore, instead of relying on atomic operations on a global variable when entering or leaving the critical region, this PR uses an existing thread-local variable with a store-load barrier for synchronization. >> >> Performance is neutral for all benchmarks tested: DaCapo, SPECjbb2005, SPECjbb2015, SPECjvm2008, j2dbench, and CacheStress. >> >> Test: tier1-8 > > Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision: > > - Merge branch 'master' into gclocker > - review > - Merge branch 'master' into gclocker > - gclocker src/hotspot/share/runtime/javaThread.hpp line 938: > 936: } > 937: > 938: bool in_critical_atomic() { return Atomic::load(&_jni_active_critical) > 0; } If you think you need an atomic load here, then it would be needed for `in_critical()` so just add it there. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23367#discussion_r1944166423 From dholmes at openjdk.org Thu Feb 6 06:38:13 2025 From: dholmes at openjdk.org (David Holmes) Date: Thu, 6 Feb 2025 06:38:13 GMT Subject: RFR: 8192647: GClocker induced GCs can starve threads requiring memory leading to OOME [v2] In-Reply-To: References: <8Vqsu8qf5wAN8pZF-8zu8zNhryQa42EZux3nMRChX5k=.63c53ac1-ca69-4a45-a924-9a454e24ea3f@github.com> Message-ID: On Wed, 5 Feb 2025 14:41:39 GMT, Albert Mingkun Yang wrote: >> Here is an attempt to simplify GCLocker implementation for Serial and Parallel. >> >> GCLocker prevents GC when Java threads are in a critical region (i.e., calling JNI critical APIs). JDK-7129164 introduces an optimization that updates a shared variable (used to track the number of threads in the critical region) only if there is a pending GC request. However, this also means that after reaching a GC safepoint, we may discover that GCLocker is active, preventing a GC cycle from being invoked. The inability to perform GC at a safepoint adds complexity -- for example, a caller must retry allocation if the request fails due to GC being inhibited by GCLocker. >> >> The proposed patch uses a readers-writer lock to ensure that all Java threads exit the critical region before reaching a GC safepoint. This guarantees that once inside the safepoint, we can successfully invoke a GC cycle. The approach takes inspiration from `ZJNICritical`, but some regressions were observed in j2dbench (on Windows) and the micro-benchmark in [JDK-8232575](https://bugs.openjdk.org/browse/JDK-8232575). Therefore, instead of relying on atomic operations on a global variable when entering or leaving the critical region, this PR uses an existing thread-local variable with a store-load barrier for synchronization. >> >> Performance is neutral for all benchmarks tested: DaCapo, SPECjbb2005, SPECjbb2015, SPECjvm2008, j2dbench, and CacheStress. >> >> Test: tier1-8 > > Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision: > > - Merge branch 'master' into gclocker > - review > - Merge branch 'master' into gclocker > - gclocker > this PR uses an existing thread-local variable with a store-load barrier for synchronization. @albertnetymk can you explain how this protocol is intended to work please. I must be missing some higher-level context that provides additional synchronization because use of the per-thread counters is inherently racy. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23367#issuecomment-2638958442 From dholmes at openjdk.org Thu Feb 6 06:41:21 2025 From: dholmes at openjdk.org (David Holmes) Date: Thu, 6 Feb 2025 06:41:21 GMT Subject: RFR: 8192647: GClocker induced GCs can starve threads requiring memory leading to OOME [v2] In-Reply-To: References: <8Vqsu8qf5wAN8pZF-8zu8zNhryQa42EZux3nMRChX5k=.63c53ac1-ca69-4a45-a924-9a454e24ea3f@github.com>

Message-ID: On Wed, 5 Feb 2025 19:39:30 GMT, Dean Long wrote: > I think we could go even further and eventually fold the JNI critical region into the existing safepoint mechanism. @dean-long you seem to be forgetting why it was folded out in the first place. :) This was performance critical JNI code where the thread-state transitions were too heavyweight and expensive to use. So we keep the thread safepoint-safe (`_thread_in_native`) and have a way to tell the GC to pause whilst we are in these critical regions. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23367#issuecomment-2638962365 From rcastanedalo at openjdk.org Thu Feb 6 08:58:40 2025 From: rcastanedalo at openjdk.org (Roberto =?UTF-8?B?Q2FzdGHDsWVkYQ==?= Lozano) Date: Thu, 6 Feb 2025 08:58:40 GMT Subject: RFR: 8346280: C2: implement late barrier elision for G1 [v4] In-Reply-To: References: <3eOK-nFYQbKn1w81CWHUY14wk0gyWMT5ULHgZ-ih5-w=.8be51ad0-f412-4aad-b73a-436ccdb8181a@github.com> <-aHCYC9iVc4eMZ3pMfiDpqaW-wGM_s3zRMiVBWoadCM=.910336cd-3be2-45b5-9874-63b71abf38f8@github.com> Message-ID: On Wed, 5 Feb 2025 15:06:39 GMT, Martin Doerr wrote: > LGTM. TestG1BarrierGeneration.java has passed on ppc64le. I'll run more tests. Please remember updating the Copyright headers. Thanks for the reminder, updated in commit 3671f474. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23235#issuecomment-2639197888 From rcastanedalo at openjdk.org Thu Feb 6 08:49:28 2025 From: rcastanedalo at openjdk.org (Roberto =?UTF-8?B?Q2FzdGHDsWVkYQ==?= Lozano) Date: Thu, 6 Feb 2025 08:49:28 GMT Subject: RFR: 8346280: C2: implement late barrier elision for G1 [v5] In-Reply-To: <3eOK-nFYQbKn1w81CWHUY14wk0gyWMT5ULHgZ-ih5-w=.8be51ad0-f412-4aad-b73a-436ccdb8181a@github.com> References: <3eOK-nFYQbKn1w81CWHUY14wk0gyWMT5ULHgZ-ih5-w=.8be51ad0-f412-4aad-b73a-436ccdb8181a@github.com> Message-ID: > G1 barriers can be safely elided from writes to newly allocated objects as long as no safepoint is taken between the allocation and the write. This changeset complements early G1 barrier elision (performed by the platform-independent phases of C2, and limited to writes immediately following allocations) with a more general elision pass done at a late stage. > > The late elision pass exploits that it runs at a stage where the relative order of memory accesses and safepoints cannot change anymore to elide barriers from initialization writes that do not immediately follow the corresponding allocation, e.g. in conditional initialization writes: > > > o = new MyObject(); > if (...) { > o.myField = ...; // barrier elided only after this changeset > // (assuming no safepoint in the if condition) > } > > > or in initialization writes placed after exception-throwing checks: > > > o = new MyObject(); > if (...) { > throw new Exception(""); > } > o.myField = ...; // barrier elided only after this changeset > // (assuming no safepoint in the above if condition) > > > These patterns are commonly found in Java code, e.g. in the core libraries: > > - [conditional initialization](https://github.com/openjdk/jdk/blob/25fecaaf87400af535c242fe50296f1f89ceeb16/src/java.base/share/classes/java/lang/String.java#L4850), or > > - [initialization after exception-throwing checks (in the superclass constructor)](https://github.com/openjdk/jdk/blob/master/src/java.base/share/classes/java/nio/X-Buffer.java.template#L324). > > The optimization also enhances barrier elision for array initialization writes, for example eliding barriers from small array initialization loops (for which safepoints are not inserted): > > > Object[] a = new Object[...]; > for (int i = 0; i < a.length; i++) { > a[i] = ...; // barrier elided only after this changeset > } > > > or eliding barriers from array initialization writes with unknown array index: > > > Object[] a = new Object[...]; > a[index] = ...; // barrier elided only after this changeset > > > The logic used to perform this additional barrier elision is a subset of a pre-existing ZGC-specific optimization. This changeset simply reuses the relevant subset (barrier elision for writes to newly-allocated objects) by extracting the core of the optimization logic from `zBarrierSetC2.cpp` into the GC-shared file `barrierSetC2.cpp`. The functions `block_has_safepoint`, `block_index`, `look_through_node`, `is_{undefined|unknown|concrete}`, `get_base_and_offset`, `is_array... Roberto Casta?eda Lozano has updated the pull request incrementally with one additional commit since the last revision: Update copyright headers ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23235/files - new: https://git.openjdk.org/jdk/pull/23235/files/621a61cf..3671f474 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23235&range=04 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23235&range=03-04 Stats: 4 lines in 4 files changed: 0 ins; 0 del; 4 mod Patch: https://git.openjdk.org/jdk/pull/23235.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23235/head:pull/23235 PR: https://git.openjdk.org/jdk/pull/23235 From mdoerr at openjdk.org Thu Feb 6 10:13:28 2025 From: mdoerr at openjdk.org (Martin Doerr) Date: Thu, 6 Feb 2025 10:13:28 GMT Subject: RFR: 8346280: C2: implement late barrier elision for G1 [v5] In-Reply-To: References: <3eOK-nFYQbKn1w81CWHUY14wk0gyWMT5ULHgZ-ih5-w=.8be51ad0-f412-4aad-b73a-436ccdb8181a@github.com> Message-ID: On Thu, 6 Feb 2025 08:49:28 GMT, Roberto Casta?eda Lozano wrote: >> G1 barriers can be safely elided from writes to newly allocated objects as long as no safepoint is taken between the allocation and the write. This changeset complements early G1 barrier elision (performed by the platform-independent phases of C2, and limited to writes immediately following allocations) with a more general elision pass done at a late stage. >> >> The late elision pass exploits that it runs at a stage where the relative order of memory accesses and safepoints cannot change anymore to elide barriers from initialization writes that do not immediately follow the corresponding allocation, e.g. in conditional initialization writes: >> >> >> o = new MyObject(); >> if (...) { >> o.myField = ...; // barrier elided only after this changeset >> // (assuming no safepoint in the if condition) >> } >> >> >> or in initialization writes placed after exception-throwing checks: >> >> >> o = new MyObject(); >> if (...) { >> throw new Exception(""); >> } >> o.myField = ...; // barrier elided only after this changeset >> // (assuming no safepoint in the above if condition) >> >> >> These patterns are commonly found in Java code, e.g. in the core libraries: >> >> - [conditional initialization](https://github.com/openjdk/jdk/blob/25fecaaf87400af535c242fe50296f1f89ceeb16/src/java.base/share/classes/java/lang/String.java#L4850), or >> >> - [initialization after exception-throwing checks (in the superclass constructor)](https://github.com/openjdk/jdk/blob/master/src/java.base/share/classes/java/nio/X-Buffer.java.template#L324). >> >> The optimization also enhances barrier elision for array initialization writes, for example eliding barriers from small array initialization loops (for which safepoints are not inserted): >> >> >> Object[] a = new Object[...]; >> for (int i = 0; i < a.length; i++) { >> a[i] = ...; // barrier elided only after this changeset >> } >> >> >> or eliding barriers from array initialization writes with unknown array index: >> >> >> Object[] a = new Object[...]; >> a[index] = ...; // barrier elided only after this changeset >> >> >> The logic used to perform this additional barrier elision is a subset of a pre-existing ZGC-specific optimization. This changeset simply reuses the relevant subset (barrier elision for writes to newly-allocated objects) by extracting the core of the optimization logic from `zBarrierSetC2.cpp` into the GC-shared file `barrierSetC2.cpp`. The functions `block_has_safepoint`, `block_inde... > > Roberto Casta?eda Lozano has updated the pull request incrementally with one additional commit since the last revision: > > Update copyright headers Code and test results look good. ------------- Marked as reviewed by mdoerr (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/23235#pullrequestreview-2598222122 From wkemper at openjdk.org Thu Feb 6 17:20:52 2025 From: wkemper at openjdk.org (William Kemper) Date: Thu, 6 Feb 2025 17:20:52 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v2] In-Reply-To: References: Message-ID: <9vqH905wEy_k3MoOq-wmpzFWuniRKpiDAu6en7bOSr4=.a8fee870-a8fc-4532-acc7-c37975e8a948@github.com> > There are several changes to the operation of Shenandoah's control threads here. > * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. > * The cancellation handling is driven entirely by the cancellation cause > * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed > * The shutdown sequence is simpler > * The generational control thread uses a lock to coordinate updates to the requested cause and generation > * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance > * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles > * The control thread doesn't loop on its own (unless the pacer is enabled). William Kemper has updated the pull request incrementally with one additional commit since the last revision: Remove invalid assert, alloc waiters wait until allocation failure is clear ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23475/files - new: https://git.openjdk.org/jdk/pull/23475/files/f97f257b..a7a6eea1 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=00-01 Stats: 5 lines in 2 files changed: 2 ins; 1 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From ayang at openjdk.org Thu Feb 6 21:45:13 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Thu, 6 Feb 2025 21:45:13 GMT Subject: RFR: 8192647: GClocker induced GCs can starve threads requiring memory leading to OOME [v2] In-Reply-To: References: <8Vqsu8qf5wAN8pZF-8zu8zNhryQa42EZux3nMRChX5k=.63c53ac1-ca69-4a45-a924-9a454e24ea3f@github.com>

Message-ID: On Thu, 6 Feb 2025 06:35:46 GMT, David Holmes wrote: > can you explain how this protocol is intended to work please. When a GC is requested, the `block()` function sets `_is_gc_request_pending` to `true` and then waits until all threads have exited their critical regions. Any thread attempting to enter a critical region during this time will detect the pending GC flag in `enter()` and follow the slow path, effectively waiting until the GC completes. The storeload barrier is critical to ensure that these two variables -- `_is_gc_request_pending` and the thread-local `_jni_active_critical` -- are accessed in the proper order. > If you think you need an atomic load here, then it would be needed for in_critical() so just add it there. `in_critical()` is used only by the owning thread, which has exclusive write access. Therefore, its access does not need to be atomic. However, the reads performed by other threads must be atomic, I believe. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23367#issuecomment-2641116616 From wkemper at openjdk.org Fri Feb 7 02:04:29 2025 From: wkemper at openjdk.org (William Kemper) Date: Fri, 7 Feb 2025 02:04:29 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v3] In-Reply-To: References: Message-ID: > There are several changes to the operation of Shenandoah's control threads here. > * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. > * The cancellation handling is driven entirely by the cancellation cause > * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed > * The shutdown sequence is simpler > * The generational control thread uses a lock to coordinate updates to the requested cause and generation > * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance > * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles > * The control thread doesn't loop on its own (unless the pacer is enabled). William Kemper has updated the pull request incrementally with three additional commits since the last revision: - Resuming an old cycle should not preempt a young cycle - Use logging tag 'thread' to help control debug volume - Do not stomp on pending requests when running a degenerated cycle ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23475/files - new: https://git.openjdk.org/jdk/pull/23475/files/a7a6eea1..ae207480 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=01-02 Stats: 64 lines in 8 files changed: 35 ins; 12 del; 17 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From dholmes at openjdk.org Fri Feb 7 06:46:10 2025 From: dholmes at openjdk.org (David Holmes) Date: Fri, 7 Feb 2025 06:46:10 GMT Subject: RFR: 8192647: GClocker induced GCs can starve threads requiring memory leading to OOME [v2] In-Reply-To: References: <8Vqsu8qf5wAN8pZF-8zu8zNhryQa42EZux3nMRChX5k=.63c53ac1-ca69-4a45-a924-9a454e24ea3f@github.com>

Message-ID: On Thu, 6 Feb 2025 21:42:50 GMT, Albert Mingkun Yang wrote: > in_critical() is used only by the owning thread, I see code using `thr->in_critical()` which is not obviously being executed by the current thread on itself. But in any case adding the atomic load to `in_critical()` is basically a no-op (loads are atomic) so no need to add a new API just to do that. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23367#issuecomment-2642070840 From dholmes at openjdk.org Fri Feb 7 07:01:39 2025 From: dholmes at openjdk.org (David Holmes) Date: Fri, 7 Feb 2025 07:01:39 GMT Subject: RFR: 8192647: GClocker induced GCs can starve threads requiring memory leading to OOME [v2] In-Reply-To: References: <8Vqsu8qf5wAN8pZF-8zu8zNhryQa42EZux3nMRChX5k=.63c53ac1-ca69-4a45-a924-9a454e24ea3f@github.com>

Message-ID: On Thu, 6 Feb 2025 21:42:50 GMT, Albert Mingkun Yang wrote: > The storeload barrier is critical ... I'm not sure it is sufficient. I would have expected some full fences to be needed here as this is very similar to the interaction of thread state with safepoints. I will look closer on Monday (sorry). ------------- PR Comment: https://git.openjdk.org/jdk/pull/23367#issuecomment-2642089369 From tschatzl at openjdk.org Fri Feb 7 08:42:16 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Fri, 7 Feb 2025 08:42:16 GMT Subject: RFR: 8346280: C2: implement late barrier elision for G1 [v5] In-Reply-To: References: <3eOK-nFYQbKn1w81CWHUY14wk0gyWMT5ULHgZ-ih5-w=.8be51ad0-f412-4aad-b73a-436ccdb8181a@github.com> Message-ID: On Thu, 6 Feb 2025 08:49:28 GMT, Roberto Casta?eda Lozano wrote: >> G1 barriers can be safely elided from writes to newly allocated objects as long as no safepoint is taken between the allocation and the write. This changeset complements early G1 barrier elision (performed by the platform-independent phases of C2, and limited to writes immediately following allocations) with a more general elision pass done at a late stage. >> >> The late elision pass exploits that it runs at a stage where the relative order of memory accesses and safepoints cannot change anymore to elide barriers from initialization writes that do not immediately follow the corresponding allocation, e.g. in conditional initialization writes: >> >> >> o = new MyObject(); >> if (...) { >> o.myField = ...; // barrier elided only after this changeset >> // (assuming no safepoint in the if condition) >> } >> >> >> or in initialization writes placed after exception-throwing checks: >> >> >> o = new MyObject(); >> if (...) { >> throw new Exception(""); >> } >> o.myField = ...; // barrier elided only after this changeset >> // (assuming no safepoint in the above if condition) >> >> >> These patterns are commonly found in Java code, e.g. in the core libraries: >> >> - [conditional initialization](https://github.com/openjdk/jdk/blob/25fecaaf87400af535c242fe50296f1f89ceeb16/src/java.base/share/classes/java/lang/String.java#L4850), or >> >> - [initialization after exception-throwing checks (in the superclass constructor)](https://github.com/openjdk/jdk/blob/master/src/java.base/share/classes/java/nio/X-Buffer.java.template#L324). >> >> The optimization also enhances barrier elision for array initialization writes, for example eliding barriers from small array initialization loops (for which safepoints are not inserted): >> >> >> Object[] a = new Object[...]; >> for (int i = 0; i < a.length; i++) { >> a[i] = ...; // barrier elided only after this changeset >> } >> >> >> or eliding barriers from array initialization writes with unknown array index: >> >> >> Object[] a = new Object[...]; >> a[index] = ...; // barrier elided only after this changeset >> >> >> The logic used to perform this additional barrier elision is a subset of a pre-existing ZGC-specific optimization. This changeset simply reuses the relevant subset (barrier elision for writes to newly-allocated objects) by extracting the core of the optimization logic from `zBarrierSetC2.cpp` into the GC-shared file `barrierSetC2.cpp`. The functions `block_has_safepoint`, `block_inde... > > Roberto Casta?eda Lozano has updated the pull request incrementally with one additional commit since the last revision: > > Update copyright headers Afaict this is good. ------------- Marked as reviewed by tschatzl (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/23235#pullrequestreview-2601121948 From rcastanedalo at openjdk.org Fri Feb 7 09:19:13 2025 From: rcastanedalo at openjdk.org (Roberto =?UTF-8?B?Q2FzdGHDsWVkYQ==?= Lozano) Date: Fri, 7 Feb 2025 09:19:13 GMT Subject: RFR: 8346280: C2: implement late barrier elision for G1 [v5] In-Reply-To: References: <3eOK-nFYQbKn1w81CWHUY14wk0gyWMT5ULHgZ-ih5-w=.8be51ad0-f412-4aad-b73a-436ccdb8181a@github.com>

Message-ID: <6WMofkASYawj1iolPRb1_3GIgpjJ_5ggK-nnnMXdYII=.58aff13a-49b3-430f-a37e-c2dea123bd97@github.com> On Fri, 7 Feb 2025 08:40:03 GMT, Thomas Schatzl wrote: > Afaict this is good. Thanks for reviewing, Thomas! ------------- PR Comment: https://git.openjdk.org/jdk/pull/23235#issuecomment-2642378742 From rcastanedalo at openjdk.org Fri Feb 7 09:24:13 2025 From: rcastanedalo at openjdk.org (Roberto =?UTF-8?B?Q2FzdGHDsWVkYQ==?= Lozano) Date: Fri, 7 Feb 2025 09:24:13 GMT Subject: RFR: 8346280: C2: implement late barrier elision for G1 [v4] In-Reply-To: References: <3eOK-nFYQbKn1w81CWHUY14wk0gyWMT5ULHgZ-ih5-w=.8be51ad0-f412-4aad-b73a-436ccdb8181a@github.com> <-aHCYC9iVc4eMZ3pMfiDpqaW-wGM_s3zRMiVBWoadCM=.910336cd-3be2-45b5-9874-63b71abf38f8@github.com> Message-ID: On Wed, 5 Feb 2025 17:51:36 GMT, Amit Kumar wrote: > I see TestG1BarrierGeneration.java failure :( > > [TestG1BarrierGeneration_jtr.log](https://github.com/user-attachments/files/18676532/TestG1BarrierGeneration_jtr.log) @offamitkumar thanks for the report! Most likely the test failures are only due to missing optimizations (because of limitations in the barrier elision pattern matching analysis), but if you want me to confirm please send the entire jtreg log, without truncation. You can disable output truncation running the test like this: `make run-test TEST="compiler/gcbarriers/TestG1BarrierGeneration.java" JTREG="MAX_OUTPUT=999999999"` Please double-check that the output log file does not contain any `Output overflow` message. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23235#issuecomment-2642388571 From iwalulya at openjdk.org Fri Feb 7 10:40:23 2025 From: iwalulya at openjdk.org (Ivan Walulya) Date: Fri, 7 Feb 2025 10:40:23 GMT Subject: RFR: 8343782: G1: Use one G1CardSet instance for multiple old gen regions [v12] In-Reply-To: References: <3ru13KcIWif1mzPnCckRryxaW6g3AkrIJvTBIaaCRNQ=.6c12262e-7b05-40df-8341-ae8141983237@github.com> Message-ID: On Wed, 5 Feb 2025 15:05:21 GMT, Albert Mingkun Yang wrote: >> Ivan Walulya has updated the pull request incrementally with one additional commit since the last revision: >> >> space > > Marked as reviewed by ayang (Reviewer). Thanks @albertnetymk and @tschatzl for the reviews! ------------- PR Comment: https://git.openjdk.org/jdk/pull/22015#issuecomment-2642514964 From iwalulya at openjdk.org Fri Feb 7 10:40:24 2025 From: iwalulya at openjdk.org (Ivan Walulya) Date: Fri, 7 Feb 2025 10:40:24 GMT Subject: Integrated: 8343782: G1: Use one G1CardSet instance for multiple old gen regions In-Reply-To: References: Message-ID: On Mon, 11 Nov 2024 13:58:36 GMT, Ivan Walulya wrote: > Hi all, > > Please review this change to assign multiple collection candidate regions to a single instance of a G1CardSet. Currently, we maintain a 1:1 mapping of old-gen regions and G1CardSet instances, assuming these regions are collected independently. However, regions are collected in batches for performance reasons to meet the G1MixedGCCountTarget. > > In this change, at the end of the Remark phase, we batch regions that we anticipate will be collected together into a collection group while selecting remembered set rebuild candidates. Regions in a collection group should be evacuated at the same time because they are assigned to the same G1CardSet instances. This implies that we do not need to maintain cross-region remembered set entries for regions within the same collection group. > > The benefit is a reduction in the memory overhead of the remembered set and the remembered set merge time during the collection pause. One disadvantage is that this approach decreases the flexibility during evacuation: you can only evacuate all regions that share a particular G1CardSet at the same time. Another downside is that pinned regions that are part of a collection group have to be partially evacuated when the collection group is selected for evacuation. This removes the optimization in the mainline implementation where the pinned regions are skipped to allow for potential unpinning before evacuation. > > In this change, we make significant changes to the collection set implementation as we switch to group selection instead of region selection. Consequently, many of the changes in the PR are about switching from region-centered collection set selection to a group-centered approach. > > Note: The batching is based on the sort order by reclaimable bytes which may change the evacuation order in which regions would have been evacuated when sorted by gc efficiency. > > We have not observed any regressions on internal performance testing platforms. Memory comparisons for the Cachestress benchmark for different heap sizes are attached below. > > Testing: Mach5 Tier1-6 > > ![16GB](https://github.com/user-attachments/assets/3224c2f1-172d-4d76-ba28-bf483b1b1c95) > ![32G](https://github.com/user-attachments/assets/abd10537-41a9-4cf9-b668-362af12fe949) > ![64GB](https://github.com/user-attachments/assets/fa87eefc-cf8a-4fb5-9fc4-e7151498bf73) > ![128GB](https://github.com/user-attachments/assets/c3a59e32-6bd7-43e3-a3e4-c472f71aa544) This pull request has now been integrated. Changeset: 86cec4ea Author: Ivan Walulya URL: https://git.openjdk.org/jdk/commit/86cec4ea2c2c56f03b23be44caade49b922cd3c6 Stats: 1440 lines in 32 files changed: 678 ins; 369 del; 393 mod 8343782: G1: Use one G1CardSet instance for multiple old gen regions Reviewed-by: ayang, tschatzl ------------- PR: https://git.openjdk.org/jdk/pull/22015 From amitkumar at openjdk.org Fri Feb 7 12:02:22 2025 From: amitkumar at openjdk.org (Amit Kumar) Date: Fri, 7 Feb 2025 12:02:22 GMT Subject: RFR: 8346280: C2: implement late barrier elision for G1 [v4] In-Reply-To: References: <3eOK-nFYQbKn1w81CWHUY14wk0gyWMT5ULHgZ-ih5-w=.8be51ad0-f412-4aad-b73a-436ccdb8181a@github.com> <-aHCYC9iVc4eMZ3pMfiDpqaW-wGM_s3zRMiVBWoadCM=.910336cd-3be2-45b5-9874-63b71abf38f8@github.com>

Message-ID: On Fri, 7 Feb 2025 09:21:39 GMT, Roberto Casta?eda Lozano wrote: >> I see TestG1BarrierGeneration.java failure :( >> >> [TestG1BarrierGeneration_jtr.log](https://github.com/user-attachments/files/18676532/TestG1BarrierGeneration_jtr.log) > >> I see TestG1BarrierGeneration.java failure :( >> >> [TestG1BarrierGeneration_jtr.log](https://github.com/user-attachments/files/18676532/TestG1BarrierGeneration_jtr.log) > > @offamitkumar thanks for the report! Most likely the test failures are only due to missing optimizations (because of limitations in the barrier elision pattern matching analysis), but if you want me to confirm please send the entire jtreg log, without truncation. You can disable output truncation running the test like this: > `make run-test TEST="compiler/gcbarriers/TestG1BarrierGeneration.java" JTREG="MAX_OUTPUT=999999999"` > Please double-check that the output log file does not contain any `Output overflow` message. > @robcasloz Sure: > > I can spend time on it, maybe on weekend, for now I am overloaded with some other tasks. > > [TestG1BarrierGeneration_jtr_no_overflow.log](https://github.com/user-attachments/files/18706090/TestG1BarrierGeneration_jtr_no_overflow.log) Thanks Amit, I had a look and the failures are indeed due to missing barrier elisions for atomic operations on newly created objects, which is suboptimal but safe (and in practice unlikely to make a noticeable performance difference). I just disabled IR checks for the two affected tests on s390 by now (commit 956e0ac5). The issue is likely due to limitations in the pattern matching logic of barrier elision, but I do not have the proper means to debug it on s390. If you find a solution before this changeset is fully reviewed, feel free to propose a patch and I will merge it into the changeset. Otherwise, it can always be done as follow-up work. Hope this works for you! ------------- PR Comment: https://git.openjdk.org/jdk/pull/23235#issuecomment-2643162531 From tschatzl at openjdk.org Fri Feb 7 16:58:22 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Fri, 7 Feb 2025 16:58:22 GMT Subject: RFR: 8349213: G1: Clearing bitmaps during collection set merging not claimed by region [v3] In-Reply-To: References: Message-ID: > Hi all, > > please review this change that makes (optional) bitmap clearing during merging remembered sets claim regions. Otherwise every thread will do the (currently little) work themselves over and over again. > > Testing: tier1-3 > > Thanks, > Thomas Thomas Schatzl has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains four commits: - Merge branch 'master' into 8349213-bitmapclear-merging-not-claiming-regions - * ayang review - * move commenty - 8349213: G1: Clearing bitmaps during collection set merging not claimed by region Hi all, please review this change that makes (optional) bitmap clearing during merging remembered sets claim regions. Otherwise every thread will do the (currently little) work themselves over and over again. Testing: tier1-3 Thanks, Thomas ------------- Changes: https://git.openjdk.org/jdk/pull/23419/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23419&range=02 Stats: 12 lines in 1 file changed: 9 ins; 3 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/23419.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23419/head:pull/23419 PR: https://git.openjdk.org/jdk/pull/23419 From wkemper at openjdk.org Fri Feb 7 22:21:45 2025 From: wkemper at openjdk.org (William Kemper) Date: Fri, 7 Feb 2025 22:21:45 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v4] In-Reply-To: References: Message-ID: > There are several changes to the operation of Shenandoah's control threads here. > * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. > * The cancellation handling is driven entirely by the cancellation cause > * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed > * The shutdown sequence is simpler > * The generational control thread uses a lock to coordinate updates to the requested cause and generation > * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance > * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles > * The control thread doesn't loop on its own (unless the pacer is enabled). William Kemper has updated the pull request incrementally with two additional commits since the last revision: - Simplify locking protocol - Make shutdown more robust, make better use of request lock ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23475/files - new: https://git.openjdk.org/jdk/pull/23475/files/ae207480..a6513bcb Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=02-03 Stats: 133 lines in 5 files changed: 54 ins; 39 del; 40 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From wkemper at openjdk.org Fri Feb 7 22:28:25 2025 From: wkemper at openjdk.org (William Kemper) Date: Fri, 7 Feb 2025 22:28:25 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v5] In-Reply-To: References: Message-ID: > There are several changes to the operation of Shenandoah's control threads here. > * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. > * The cancellation handling is driven entirely by the cancellation cause > * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed > * The shutdown sequence is simpler > * The generational control thread uses a lock to coordinate updates to the requested cause and generation > * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance > * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles > * The control thread doesn't loop on its own (unless the pacer is enabled). William Kemper has updated the pull request incrementally with one additional commit since the last revision: Fix includes ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23475/files - new: https://git.openjdk.org/jdk/pull/23475/files/a6513bcb..d16f6fd0 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=04 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=03-04 Stats: 3 lines in 1 file changed: 1 ins; 2 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From kdnilsen at openjdk.org Fri Feb 7 23:59:52 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 7 Feb 2025 23:59:52 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: References: Message-ID: > At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. > > For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. > > This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: Respond to reviewer feedback In testing suggested refinements, I discovered a bug in original implementation. ShenandoahFreeSet::capacity() does not represent the size of young generation. It represents the total size of the young regions that had available memory at the time we most recently rebuilt the ShenandoahFreeSet. I am rerunning the performance tests following this suggested change. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23306/files - new: https://git.openjdk.org/jdk/pull/23306/files/a850e484..7969515d Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23306&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23306&range=00-01 Stats: 13 lines in 5 files changed: 4 ins; 0 del; 9 mod Patch: https://git.openjdk.org/jdk/pull/23306.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23306/head:pull/23306 PR: https://git.openjdk.org/jdk/pull/23306 From kdnilsen at openjdk.org Fri Feb 7 23:59:52 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 7 Feb 2025 23:59:52 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: References:

Message-ID: On Fri, 31 Jan 2025 01:15:01 GMT, Xiaolong Peng wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Respond to reviewer feedback >> >> In testing suggested refinements, I discovered a bug in original >> implementation. ShenandoahFreeSet::capacity() does not represent the >> size of young generation. It represents the total size of the young >> regions that had available memory at the time we most recently rebuilt >> the ShenandoahFreeSet. >> >> I am rerunning the performance tests following this suggested change. > > src/hotspot/share/gc/shenandoah/shenandoahMetrics.cpp line 52: > >> 50: size_t free_actual = free_set->available(); >> 51: // The sum of free_set->capacity() and ->reserved represents capacity of young in generational, heap in non-generational. >> 52: size_t free_expected = ((free_set->capacity() + free_set->reserved()) / 100) * ShenandoahCriticalFreeThreshold; > > We may pass ShenandoahGeneration as parameter to `is_good_progress` to simplify the calculation of free_expected, it should be like: > ` > generation->max_capacity() / 100 * ShenandoahCriticalFreeThreshold > ` > Good part is, free_expected might be more accurate in Full GC/Degen for global cycle, e.g. Full GC collects memory for global, `free_expected` should be calculated using the metrics from global generation. But either way, `free_expected` is not clearly defined in generational mode now, current code also works. Thanks for this suggestion. I've made change. It turns out there was actually a bug in the original implementation, so I am retesting the performance results. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1947334711 From kdnilsen at openjdk.org Fri Feb 7 23:59:52 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 7 Feb 2025 23:59:52 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: References:

Message-ID: On Tue, 4 Feb 2025 15:50:59 GMT, Paul Hohensee wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Respond to reviewer feedback >> >> In testing suggested refinements, I discovered a bug in original >> implementation. ShenandoahFreeSet::capacity() does not represent the >> size of young generation. It represents the total size of the young >> regions that had available memory at the time we most recently rebuilt >> the ShenandoahFreeSet. >> >> I am rerunning the performance tests following this suggested change. > > src/hotspot/share/gc/shenandoah/shenandoahMetrics.cpp line 52: > >> 50: size_t free_actual = free_set->available(); >> 51: // The sum of free_set->capacity() and ->reserved represents capacity of young in generational, heap in non-generational. >> 52: size_t free_expected = ((free_set->capacity() + free_set->reserved()) / 100) * ShenandoahCriticalFreeThreshold; > > As an outsider, the units involved and what exactly is being calculated is pretty opaque. Why would we divide by 100 to compute free_expected and not do the same for free_actual? Do we care about integer division truncation? The default value of ShenandoahCriticalFreeThreshold is 1, so multiplying by it is a nop by default, which seems strange. ShenandoahCriticalFreeThreshold represents a percentage of the "total size". To calculate N% of the young generation size, we divide the generation size by 100 and then multiply by ShenandoahCriticalFreeThreshold. This code is a bit different in the most recent revision. Do you think it needs a comment? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1947335561 From xpeng at openjdk.org Sat Feb 8 02:11:21 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Sat, 8 Feb 2025 02:11:21 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: References:

Message-ID: On Fri, 7 Feb 2025 23:54:46 GMT, Kelvin Nilsen wrote: >> src/hotspot/share/gc/shenandoah/shenandoahMetrics.cpp line 52: >> >>> 50: size_t free_actual = free_set->available(); >>> 51: // The sum of free_set->capacity() and ->reserved represents capacity of young in generational, heap in non-generational. >>> 52: size_t free_expected = ((free_set->capacity() + free_set->reserved()) / 100) * ShenandoahCriticalFreeThreshold; >> >> We may pass ShenandoahGeneration as parameter to `is_good_progress` to simplify the calculation of free_expected, it should be like: >> ` >> generation->max_capacity() / 100 * ShenandoahCriticalFreeThreshold >> ` >> Good part is, free_expected might be more accurate in Full GC/Degen for global cycle, e.g. Full GC collects memory for global, `free_expected` should be calculated using the metrics from global generation. But either way, `free_expected` is not clearly defined in generational mode now, current code also works. > > Thanks for this suggestion. I've made change. It turns out there was actually a bug in the original implementation, so I am retesting the performance results. Thanks, honest I didn't understand that why `(free_set->capacity() + free_set->reserved()` represents capacity of young in generational, is it the bug you found? `free_set->capacity()` excludes the regions doesn't have enough capacity(it is calculated when rebuild free set) I thought a bit more, it makes more sense to calculate free_expected in `snap_before`, max_capacity of generations may change after collection, the free_expected should be calculated before the cycle. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1947405475 From tschatzl at openjdk.org Sat Feb 8 10:35:06 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Sat, 8 Feb 2025 10:35:06 GMT Subject: RFR: 8349213: G1: Clearing bitmaps during collection set merging not claimed by region [v4] In-Reply-To: References: Message-ID: > Hi all, > > please review this change that makes (optional) bitmap clearing during merging remembered sets claim regions. Otherwise every thread will do the (currently little) work themselves over and over again. > > Testing: tier1-3 > > Thanks, > Thomas Thomas Schatzl has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains six commits: - * fix botched merge - Merge branch 'master' into 8349213-bitmapclear-merging-not-claiming-regions - Merge branch 'master' into 8349213-bitmapclear-merging-not-claiming-regions - * ayang review - * move commenty - 8349213: G1: Clearing bitmaps during collection set merging not claimed by region Hi all, please review this change that makes (optional) bitmap clearing during merging remembered sets claim regions. Otherwise every thread will do the (currently little) work themselves over and over again. Testing: tier1-3 Thanks, Thomas ------------- Changes: https://git.openjdk.org/jdk/pull/23419/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23419&range=03 Stats: 10 lines in 1 file changed: 8 ins; 2 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/23419.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23419/head:pull/23419 PR: https://git.openjdk.org/jdk/pull/23419 From jarek.odzga at gmail.com Sun Feb 9 19:54:20 2025 From: jarek.odzga at gmail.com (Jaroslaw Odzga) Date: Sun, 9 Feb 2025 11:54:20 -0800 Subject: Configurable G1 heap expansion aggressiveness Message-ID: Context and Motivation In multi-tenant environments e.g. Kubernetes clusters in cloud environments there is a strong incentive to use as little memory as possible. Lower memory usage means more processes can be packed on a single VM which directly translates to lower cloud cost. Configuring G1 heap size in this setup is currently challenging. On the one hand we would like to set the max heap size to a high value so that application doesn?t fail with heap OOME when faced with unexpectedly high load or organic growth. On the other hand we need to set max heap size to as small a value as possible because G1 is very eager to expand heap even when tuned to collect garbage aggressively. Ideally, we would like to: - Set the initial heap size to a small value. - Set the max heap size to a value larger than expected usage so that application can handle unexpected load and organic growth. - Configure G1 GC to not expand heap aggressively. This is currently not possible. We propose two new JVM G1 flags that would give us more control over G1 heap expansion aggressiveness and realize significant cost savings in multi-tenant environments. At the same time we don?t want to change existing G1 behavior - with default values of the new flags current G1 behavior would be maintained. Analysis Currently even with very aggressive G1 configuration such as: -XX:-G1UseAdaptiveIHOP -XX:InitiatingHeapOccupancyPercent=20 -XX:GCTimeRatio=4 -XX:MinHeapFreeRatio=20 -XX:MaxHeapFreeRatio=60 the heap is fairly eagerly expanded. We found two culprits responsible for this in G1HeapSizingPolicy::young_collection_expansion_amount() function. First, the scale_with_heap() function makes pause_time_threshold small in cases where current heap size is smaller than 1/2 of max heap size. While it is likely a desired behavior in many situations, it also causes memory usage spikes in situations where max heap size is much larger than current heap size. Second, the MinOverThresholdForGrowth constant equal to 4 is an arbitrary value which hardcodes the heap expansion aggressiveness. We observed that short_term_pause_time_ratio can exceed pause_time_threshold and trigger heap expansion too eagerly in many situations, especially when allocation rate is spiky. Proposal We would like to introduce two new experimental flags: - G1ScaleWithHeapPauseTimeThreshold: a binary flag that would allow disabling scale_with_heap() - G1MinPausesOverThresholdForGrowth: a value between 1 and 10, a configurable replacement for the MinOverThresholdForGrowth constant. We don?t want to change the default behavior of G1. Default values for these flags (G1ScaleWithHeapPauseTimeThreshold=true, G1MinPausesOverThresholdForGrowth=4) would maintain the existing behavior. Alternatives There is currently no good alternative. Potentially we could configure G1 aggressively to trigger GC very frequently e.g.: -XX:-G1UseAdaptiveIHOP -XX:InitiatingHeapOccupancyPercent=20 -XX:GCTimeRatio=4 -XX:MinHeapFreeRatio=20 -XX:MaxHeapFreeRatio=60 Even with this configuration we see occasional large memory spikes where heap is quickly expanded. Even though the expanded heap contracts eventually, this poses a significant problem because in practice we don?t know if such a spike could have been avoided so it is not obvious how much memory the application really needs. Of course such configuration would also consume more CPU. Experimental results We tested this change on patched jdk17. With new flags we can use far less aggressive -XX:GCTimeRatio=9 together with -XX:-G1ScaleWithHeapPauseTimeThreshold and -XX:G1MinPausesOverThresholdForGrowth=10 (this effectively disables heap expansion based on short time pause ratio and only depends on long time pause ratio). Compared to more aggressive G1 configuration mentioned above we see lower CPU usage, and 30%-60% lower max memory usage. Implementation https://github.com/openjdk/jdk/pull/23534 From phh at openjdk.org Mon Feb 10 18:44:13 2025 From: phh at openjdk.org (Paul Hohensee) Date: Mon, 10 Feb 2025 18:44:13 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: References:

Message-ID: On Fri, 7 Feb 2025 23:56:56 GMT, Kelvin Nilsen wrote: >> src/hotspot/share/gc/shenandoah/shenandoahMetrics.cpp line 52: >> >>> 50: size_t free_actual = free_set->available(); >>> 51: // The sum of free_set->capacity() and ->reserved represents capacity of young in generational, heap in non-generational. >>> 52: size_t free_expected = ((free_set->capacity() + free_set->reserved()) / 100) * ShenandoahCriticalFreeThreshold; >> >> As an outsider, the units involved and what exactly is being calculated is pretty opaque. Why would we divide by 100 to compute free_expected and not do the same for free_actual? Do we care about integer division truncation? The default value of ShenandoahCriticalFreeThreshold is 1, so multiplying by it is a nop by default, which seems strange. > > ShenandoahCriticalFreeThreshold represents a percentage of the "total size". To calculate N% of the young generation size, we divide the generation size by 100 and then multiply by ShenandoahCriticalFreeThreshold. This code is a bit different in the most recent revision. Do you think it needs a comment? Yes :) ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1949689186 From kdnilsen at openjdk.org Mon Feb 10 19:55:12 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 10 Feb 2025 19:55:12 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v2] In-Reply-To: References:

Message-ID: <68DeNcSBaX3EJo0OuQI7800ywqaQjhcCMpIjFqwdoao=.0da72a64-afa1-43bc-83bb-d4caf0d62514@github.com> On Tue, 4 Feb 2025 16:08:02 GMT, Paul Hohensee wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Respond to reviewer feedback > > src/hotspot/share/gc/shenandoah/heuristics/shenandoahHeuristics.hpp line 87: > >> 85: size_t _declined_trigger_count; // This counts how many times since previous GC finished that this >> 86: // heuristic has answered false to should_start_gc(). >> 87: size_t _previous_trigger_declinations; // This represents the value of _declined_trigger_count as captured at the > > Maybe the name should be _most_recent_declined_trigger_count, which relates it directly to _declined_trigger_count. Thanks for suggestion. I'm making this change. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23305#discussion_r1949788716 From kdnilsen at openjdk.org Mon Feb 10 20:02:14 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 10 Feb 2025 20:02:14 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v2] In-Reply-To: References:

Message-ID: On Tue, 4 Feb 2025 16:04:34 GMT, Paul Hohensee wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Respond to reviewer feedback > > src/hotspot/share/gc/shenandoah/heuristics/shenandoahHeuristics.cpp line 261: > >> 259: >> 260: void ShenandoahHeuristics::record_success_concurrent() { >> 261: _start_gc_is_pending = false; > > The name _start_gc_is_pending implies that it should be set false as soon as a gc cycle starts, not when it finishes. Maybe _gc_pending? Or maybe setting it false at the end of a gc cycle is a bug? :) You make a good point. I'll change the control flow to cancel the trigger as soon as we start up the GC. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23305#discussion_r1949798178 From kdnilsen at openjdk.org Mon Feb 10 20:28:54 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 10 Feb 2025 20:28:54 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v2] In-Reply-To: References:

Message-ID: On Tue, 4 Feb 2025 16:14:49 GMT, Paul Hohensee wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Respond to reviewer feedback > > src/hotspot/share/gc/shenandoah/heuristics/shenandoahAdaptiveHeuristics.cpp line 318: > >> 316: >> 317: if (ShenandoahHeuristics::should_start_gc()) { >> 318: _start_gc_is_pending = true; > > I assume there's no race here, i.e., only one thread reads/writes _start_gc_is_pending. If there's a race, make sure it's benign. In either case, _start_gc_is_pending is made "sticky" by this code. There is no race. A single control thread queries should_start-gc() and that is the same thread that initiates the GC. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23305#discussion_r1949828557 From kdnilsen at openjdk.org Mon Feb 10 20:28:54 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 10 Feb 2025 20:28:54 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v3] In-Reply-To: References: Message-ID: <2v0axonBAvZDKo779TX8POWEXGeMCA5xaKV3KQBQo14=.fbd1e6bc-0e12-4a0c-a9f7-ba1d3c5f728d@github.com> > Shenandoah heuristics use a penalty mechanism to cause earlier GC triggers when recent concurrent GC cycles degenerate. Degeneration is a stop-the-world remediation that allows GC to catch up when mutator allocations fail during concurrent GC. The fact that we needed to degenerate indicates that we were overly optimistic in delaying the trigger that starts concurrent GC. > > We have observed that it is common for degenerated GC cycles to cascade upon each other. The condition that caused an initial degenerated cycle is often not fully resolved by the end of that degenerated cycle. For example, the application may be experiencing a phase change and the GC heuristics are not yet attuned to the new behavior. Furthermore, a degenerated GC may exacerbate the problem condition. During the stop-the-world pause imposed by the first degenerated GC, work continues to accumulate in the form of new client requests that are buffered in network sockets until the end of that degenerated GC. > > As originally implemented, each degeneration would "pile on" additional penalties. These penalties cause the GC frequency to continue to increase. And the expanding CPU load of GC makes it increasingly difficult for mutator threads to catchup. The large penalties accumulated while we are trying to resolve the problem linger long after the problem condition has been resolved. > > This change does not add further to the degeneration penalties if a new degenerated cycle occurs through no fault of the triggering mechanism. We only add the degeneration penalty if the reason we are now degenerating can be attributed to a consciously late trigger by the heuristic. Kelvin Nilsen has updated the pull request incrementally with two additional commits since the last revision: - Respond to reviewer feedback - Use generation size to determine expected free ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23305/files - new: https://git.openjdk.org/jdk/pull/23305/files/ee3cdacc..8a9e4c5e Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23305&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23305&range=01-02 Stats: 27 lines in 8 files changed: 13 ins; 3 del; 11 mod Patch: https://git.openjdk.org/jdk/pull/23305.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23305/head:pull/23305 PR: https://git.openjdk.org/jdk/pull/23305 From kdnilsen at openjdk.org Mon Feb 10 20:41:10 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 10 Feb 2025 20:41:10 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: References:

Message-ID: On Sat, 8 Feb 2025 02:06:13 GMT, Xiaolong Peng wrote: >> Thanks for this suggestion. I've made change. It turns out there was actually a bug in the original implementation, so I am retesting the performance results. > > Thanks, honest I didn't understand that why `(free_set->capacity() + free_set->reserved()` represents capacity of young in generational, is it the bug you found? `free_set->capacity()` is the capacity of all mutator regions which also excludes the regions doesn't have capacity for new object alloc(it is calculated when rebuild free set) > > I thought a bit more, it makes more sense to calculate free_expected in `snap_before`, max_capacity of generations may change after collection, the free_expected should be calculated before the cycle. Interesting thoughts. So young-generation size will change under these circumstances: 1. There's a lot of young-gen memory to be promoted, or we choose to promote some young-gen regions in place (by relabeling the regions as OLD without evacuating their data). In both of these cases, we may shrink young in order to expand old. 2. The GC cycle is mixed, so it has the side effect of reclaiming some old regions. These reclaimed old regions will typically be granted back to young, until such time as we need to expand old in order to hold results of promotion. While it makes sense for expected to be computed based on "original size" of young generation, the question of how much free remaining in young represents "good progress" should probably be based on the current size of young. Ultimately, we are trying to figure out if there's enough memory in young to make it worthwhile to attempt another concurrent GC cycle. I realize this thinking is a bit "fuzzy". The heuristic was originally designed for non-generational use. I'm inclined to keep as is currently implemented, but should probably add a comment to explain why. What do you think? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1949847579 From wkemper at openjdk.org Mon Feb 10 21:26:59 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 10 Feb 2025 21:26:59 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v6] In-Reply-To: References: Message-ID: > There are several changes to the operation of Shenandoah's control threads here. > * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. > * The cancellation handling is driven entirely by the cancellation cause > * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed > * The shutdown sequence is simpler > * The generational control thread uses a lock to coordinate updates to the requested cause and generation > * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance > * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles > * The control thread doesn't loop on its own (unless the pacer is enabled). William Kemper has updated the pull request incrementally with two additional commits since the last revision: - Add event for control thread state changes - Fix shutdown livelock error ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23475/files - new: https://git.openjdk.org/jdk/pull/23475/files/d16f6fd0..f11584d5 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=05 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=04-05 Stats: 13 lines in 1 file changed: 6 ins; 3 del; 4 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From wkemper at openjdk.org Mon Feb 10 21:54:51 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 10 Feb 2025 21:54:51 GMT Subject: RFR: 8348268: Test gc/shenandoah/TestResizeTLAB.java#compact: fatal error: Before Updating References: Thread C2 CompilerThread1: expected gc-state 9, actual 21 [v2] In-Reply-To: References: Message-ID: <73lkaeIWH7aBWahNyU_czTYSnSmCMOURWYDv55-zc4Y=.39398a24-6904-465c-8e47-7bfe32efc9db@github.com> > Non-java threads were not having their gc-state configured when they attach. If they were created before the verifier's safepoint, but after the iteration over non-java threads, they would not have the correct state. William Kemper has updated the pull request incrementally with one additional commit since the last revision: Hold the thread lock when concurrently changing gc state ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23428/files - new: https://git.openjdk.org/jdk/pull/23428/files/f402628e..1a4e3bb1 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23428&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23428&range=00-01 Stats: 13 lines in 1 file changed: 8 ins; 0 del; 5 mod Patch: https://git.openjdk.org/jdk/pull/23428.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23428/head:pull/23428 PR: https://git.openjdk.org/jdk/pull/23428 From xpeng at openjdk.org Mon Feb 10 22:27:12 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Mon, 10 Feb 2025 22:27:12 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: References:

Message-ID: <7B6iyKusHAIUGeaVJVEiCWQTIe64ZPJKImH-YYUB3K0=.8d7612e4-9e08-4b90-9c60-0f68d3e7c4ad@github.com> On Mon, 10 Feb 2025 20:38:35 GMT, Kelvin Nilsen wrote: >> Thanks, honest I didn't understand that why `(free_set->capacity() + free_set->reserved()` represents capacity of young in generational, is it the bug you found? `free_set->capacity()` is the capacity of all mutator regions which also excludes the regions doesn't have capacity for new object alloc(it is calculated when rebuild free set) >> >> I thought a bit more, it makes more sense to calculate free_expected in `snap_before`, max_capacity of generations may change after collection, the free_expected should be calculated before the cycle. > > Interesting thoughts. So young-generation size will change under these circumstances: > > 1. There's a lot of young-gen memory to be promoted, or we choose to promote some young-gen regions in place (by relabeling the regions as OLD without evacuating their data). In both of these cases, we may shrink young in order to expand old. > 2. The GC cycle is mixed, so it has the side effect of reclaiming some old regions. These reclaimed old regions will typically be granted back to young, until such time as we need to expand old in order to hold results of promotion. > > While it makes sense for expected to be computed based on "original size" of young generation, the question of how much free remaining in young represents "good progress" should probably be based on the current size of young. Ultimately, we are trying to figure out if there's enough memory in young to make it worthwhile to attempt another concurrent GC cycle. > > I realize this thinking is a bit "fuzzy". The heuristic was originally designed for non-generational use. > > I'm inclined to keep as is currently implemented, but should probably add a comment to explain why. What do you think? Thanks for the explanation, I agree with it is bit "fuzzy". I'm not sure we should consider following case: Degen cycle doesn't reclaim any memory, but promoted some young regions resulting in young capacity to shrink, in this case we may treat it as "good progress" but actually it is not. A "good progress" could be `free_actual_after > free_actual_before && free_actual_after > free_expected`, what do you think? I am not sure all cases triggering degen cycle, this might be a false case that never happens. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1949985042 From kdnilsen at openjdk.org Mon Feb 10 23:19:27 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 10 Feb 2025 23:19:27 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v4] In-Reply-To: References: Message-ID: > Shenandoah heuristics use a penalty mechanism to cause earlier GC triggers when recent concurrent GC cycles degenerate. Degeneration is a stop-the-world remediation that allows GC to catch up when mutator allocations fail during concurrent GC. The fact that we needed to degenerate indicates that we were overly optimistic in delaying the trigger that starts concurrent GC. > > We have observed that it is common for degenerated GC cycles to cascade upon each other. The condition that caused an initial degenerated cycle is often not fully resolved by the end of that degenerated cycle. For example, the application may be experiencing a phase change and the GC heuristics are not yet attuned to the new behavior. Furthermore, a degenerated GC may exacerbate the problem condition. During the stop-the-world pause imposed by the first degenerated GC, work continues to accumulate in the form of new client requests that are buffered in network sockets until the end of that degenerated GC. > > As originally implemented, each degeneration would "pile on" additional penalties. These penalties cause the GC frequency to continue to increase. And the expanding CPU load of GC makes it increasingly difficult for mutator threads to catchup. The large penalties accumulated while we are trying to resolve the problem linger long after the problem condition has been resolved. > > This change does not add further to the degeneration penalties if a new degenerated cycle occurs through no fault of the triggering mechanism. We only add the degeneration penalty if the reason we are now degenerating can be attributed to a consciously late trigger by the heuristic. Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: Revert "Use generation size to determine expected free" This reverts commit 94a32ebfe5fefcc0e899e09e6fbfc0585c62b4e0. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23305/files - new: https://git.openjdk.org/jdk/pull/23305/files/8a9e4c5e..ee7fe689 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23305&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23305&range=02-03 Stats: 5 lines in 4 files changed: 0 ins; 0 del; 5 mod Patch: https://git.openjdk.org/jdk/pull/23305.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23305/head:pull/23305 PR: https://git.openjdk.org/jdk/pull/23305 From kdnilsen at openjdk.org Mon Feb 10 23:32:09 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 10 Feb 2025 23:32:09 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: <7B6iyKusHAIUGeaVJVEiCWQTIe64ZPJKImH-YYUB3K0=.8d7612e4-9e08-4b90-9c60-0f68d3e7c4ad@github.com> References:

<7B6iyKusHAIUGeaVJVEiCWQTIe64ZPJKImH-YYUB3K0=.8d7612e4-9e08-4b90-9c60-0f68d3e7c4ad@github.com> Message-ID: <16WXn9LEVXGdRSeJ98OxomG66UfnruLxo9nnfY52ZJo=.f4acdbb1-c99b-4be8-807b-bdbf9504af81@github.com> On Mon, 10 Feb 2025 22:24:35 GMT, Xiaolong Peng wrote: >> Interesting thoughts. So young-generation size will change under these circumstances: >> >> 1. There's a lot of young-gen memory to be promoted, or we choose to promote some young-gen regions in place (by relabeling the regions as OLD without evacuating their data). In both of these cases, we may shrink young in order to expand old. >> 2. The GC cycle is mixed, so it has the side effect of reclaiming some old regions. These reclaimed old regions will typically be granted back to young, until such time as we need to expand old in order to hold results of promotion. >> >> While it makes sense for expected to be computed based on "original size" of young generation, the question of how much free remaining in young represents "good progress" should probably be based on the current size of young. Ultimately, we are trying to figure out if there's enough memory in young to make it worthwhile to attempt another concurrent GC cycle. >> >> I realize this thinking is a bit "fuzzy". The heuristic was originally designed for non-generational use. >> >> I'm inclined to keep as is currently implemented, but should probably add a comment to explain why. What do you think? > > Thanks for the explanation, I agree with it is bit "fuzzy". > I'm not sure we should consider following case: > > Degen cycle doesn't reclaim any memory, but promoted some young regions resulting in young capacity to shrink, in this case we may treat it as "good progress" but actually it is not. > > A "good progress" could be `free_actual_after > free_actual_before && free_actual_after > free_expected`, what do you think? I am not sure all cases triggering degen cycle, this might be a false case that never happens. If we manage to pass the test "free_actual_after > free_expected" following the degen, even if young has shrunk, I think it is reasonable to pursue concurrent GC. Passing this exact test at the end of the next GC (assuming no further adjustments to generation sizes) would qualify us to continue with concurrent GC on the next cycle. In general, it is very rare that "full gc" is the right thing to do. we're in the process of deprecating it entirely. I will add a comment to clarify the thinking here. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1950040394 From kdnilsen at openjdk.org Mon Feb 10 23:43:11 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 10 Feb 2025 23:43:11 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: References:

Message-ID: On Fri, 7 Feb 2025 23:59:52 GMT, Kelvin Nilsen wrote: >> At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. >> >> For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. >> >> This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. > > Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: > > Respond to reviewer feedback > > In testing suggested refinements, I discovered a bug in original > implementation. ShenandoahFreeSet::capacity() does not represent the > size of young generation. It represents the total size of the young > regions that had available memory at the time we most recently rebuilt > the ShenandoahFreeSet. > > I am rerunning the performance tests following this suggested change. Thank for the comprehensive tests and explanations, my approve doesn't count though:) ------------- Marked as reviewed by xpeng (Author). PR Review: https://git.openjdk.org/jdk/pull/23306#pullrequestreview-2607419434 From wkemper at openjdk.org Tue Feb 11 00:54:36 2025 From: wkemper at openjdk.org (William Kemper) Date: Tue, 11 Feb 2025 00:54:36 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v7] In-Reply-To: References: Message-ID: <7vsmPKQNSOx9PxGp2C1yjC5IeEtB2ZWPRybQQ-s4YNE=.1b8ffa7e-cc6d-4885-a9c4-16a503d9d8d9@github.com> > There are several changes to the operation of Shenandoah's control threads here. > * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. > * The cancellation handling is driven entirely by the cancellation cause > * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed > * The shutdown sequence is simpler > * The generational control thread uses a lock to coordinate updates to the requested cause and generation > * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance > * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles > * The control thread doesn't loop on its own (unless the pacer is enabled). William Kemper has updated the pull request incrementally with two additional commits since the last revision: - Do not accept requests if control thread is terminating - Notify waiters when control thread terminates ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23475/files - new: https://git.openjdk.org/jdk/pull/23475/files/f11584d5..861ed699 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=06 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=05-06 Stats: 26 lines in 3 files changed: 24 ins; 2 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From kdnilsen at openjdk.org Tue Feb 11 03:39:41 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 11 Feb 2025 03:39:41 GMT Subject: RFR: 8349766: GenShen: Bad progress after degen does not always need full gc Message-ID: In generational mode, only upgrade to full GC from degenerated GC if we've done two degenerated cycles in a row and both indicated bad progress. Otherwise, start another concurrent GC, which will most likely degenerate also. But this degenerated cycle will reclaim floating garbage within the young generation much more quickly than a full GC would have done. ------------- Commit messages: - Be less eager to upgrade degen to full gc Changes: https://git.openjdk.org/jdk/pull/23552/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23552&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8349766 Stats: 20 lines in 2 files changed: 17 ins; 0 del; 3 mod Patch: https://git.openjdk.org/jdk/pull/23552.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23552/head:pull/23552 PR: https://git.openjdk.org/jdk/pull/23552 From kdnilsen at openjdk.org Tue Feb 11 03:39:41 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 11 Feb 2025 03:39:41 GMT Subject: RFR: 8349766: GenShen: Bad progress after degen does not always need full gc In-Reply-To: References: Message-ID: On Tue, 11 Feb 2025 03:31:51 GMT, Kelvin Nilsen wrote: > In generational mode, only upgrade to full GC from degenerated GC if we've done two degenerated cycles in a row and both indicated bad progress. Otherwise, start another concurrent GC, which will most likely degenerate also. But this degenerated cycle will reclaim floating garbage within the young generation much more quickly than a full GC would have done. Some detailed results running the workload mentioned in JBS ticket on tip: ![Screenshot 2025-02-10 at 7 10 18?PM](https://github.com/user-attachments/assets/c06606a6-ec21-4e40-b117-915ddfc0d1f6) These are results running the same workload with the changes of this PR: ![Screenshot 2025-02-10 at 7 35 47?PM](https://github.com/user-attachments/assets/432c227e-9bf4-4f21-8099-1b39b5af364a) ------------- PR Comment: https://git.openjdk.org/jdk/pull/23552#issuecomment-2649732684 PR Comment: https://git.openjdk.org/jdk/pull/23552#issuecomment-2649733471 From kdnilsen at openjdk.org Tue Feb 11 03:53:10 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 11 Feb 2025 03:53:10 GMT Subject: RFR: 8349766: GenShen: Bad progress after degen does not always need full gc In-Reply-To: References: Message-ID: On Tue, 11 Feb 2025 03:31:51 GMT, Kelvin Nilsen wrote: > In generational mode, only upgrade to full GC from degenerated GC if we've done two degenerated cycles in a row and both indicated bad progress. Otherwise, start another concurrent GC, which will most likely degenerate also. But this degenerated cycle will reclaim floating garbage within the young generation much more quickly than a full GC would have done. Green represents improvement compared to tip. For runs 3-5, the new code is notably better. Run 2 is significantly worse in p50, but about equal to tip average at p99.999 and above. Run 1 is close to averages of tip at p50, but up to 56% above max at higher percentiles. Most noteworthy is that we were able to significantly reduce the number of Full GCs without causing a crash or OOM. On this workload, full GCs are known to require approximately 3 s of pause time. The average degen cycle required 1.4s (102 out of cycle, 142 at roots, 149 at mark, 1 at evac, 28 at update refs). Note that an upgraded Full GC results in a pause that is the sum of a Full GC plus a degenerated GC. While there is reason to be concerned about trial two results on the PR code, I expect that unlucky scenario, whatever it was, will be much less likely in the context of in-flight PRs to advance triggering of GC when allocation rates are accelerating and to surge GC workers whenever there is increased risk of degenerated cycles. Perhaps, we should wait until those other PRs are integrated and then retest. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23552#issuecomment-2649744182 From kdnilsen at openjdk.org Tue Feb 11 04:08:48 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 11 Feb 2025 04:08:48 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v3] In-Reply-To: References: Message-ID: <0bbrstoX8nDMn2Ku_WwSYn_NYSSLi3yXkWdg28imCHo=.ab1661a4-1ea5-4c57-9fde-0ee63ebac027@github.com> > At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. > > For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. > > This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: Add comments suggested by reviewers ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23306/files - new: https://git.openjdk.org/jdk/pull/23306/files/7969515d..8f644cdb Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23306&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23306&range=01-02 Stats: 15 lines in 1 file changed: 14 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/23306.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23306/head:pull/23306 PR: https://git.openjdk.org/jdk/pull/23306 From xpeng at openjdk.org Tue Feb 11 05:54:11 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Tue, 11 Feb 2025 05:54:11 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v3] In-Reply-To: <0bbrstoX8nDMn2Ku_WwSYn_NYSSLi3yXkWdg28imCHo=.ab1661a4-1ea5-4c57-9fde-0ee63ebac027@github.com> References: <0bbrstoX8nDMn2Ku_WwSYn_NYSSLi3yXkWdg28imCHo=.ab1661a4-1ea5-4c57-9fde-0ee63ebac027@github.com> Message-ID: On Tue, 11 Feb 2025 04:08:48 GMT, Kelvin Nilsen wrote: >> At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. >> >> For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. >> >> This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. > > Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: > > Add comments suggested by reviewers Marked as reviewed by xpeng (Author). ------------- PR Review: https://git.openjdk.org/jdk/pull/23306#pullrequestreview-2607738788 From shade at openjdk.org Tue Feb 11 08:50:11 2025 From: shade at openjdk.org (Aleksey Shipilev) Date: Tue, 11 Feb 2025 08:50:11 GMT Subject: RFR: 8348268: Test gc/shenandoah/TestResizeTLAB.java#compact: fatal error: Before Updating References: Thread C2 CompilerThread1: expected gc-state 9, actual 21 [v2] In-Reply-To: <73lkaeIWH7aBWahNyU_czTYSnSmCMOURWYDv55-zc4Y=.39398a24-6904-465c-8e47-7bfe32efc9db@github.com> References: <73lkaeIWH7aBWahNyU_czTYSnSmCMOURWYDv55-zc4Y=.39398a24-6904-465c-8e47-7bfe32efc9db@github.com> Message-ID: On Mon, 10 Feb 2025 21:54:51 GMT, William Kemper wrote: >> Non-java threads were not having their gc-state configured when they attach. If they were created before the verifier's safepoint, but after the iteration over non-java threads, they would not have the correct state. > > William Kemper has updated the pull request incrementally with one additional commit since the last revision: > > Hold the thread lock when concurrently changing gc state Great find. So that means we cannot safely do `ShenandoahHeap::set_gc_state_concurrent`, unless we hold `Threads_lock` and do a handshake afterwards? I think a part of comment that you have near `MutexLocker` can go to `ShenandoahHeap::set_gc_state_concurrent` with the `assert(Threads_lock->is_locked(), ...`. ------------- PR Review: https://git.openjdk.org/jdk/pull/23428#pullrequestreview-2608045755 From iwalulya at openjdk.org Tue Feb 11 09:23:11 2025 From: iwalulya at openjdk.org (Ivan Walulya) Date: Tue, 11 Feb 2025 09:23:11 GMT Subject: RFR: 8349213: G1: Clearing bitmaps during collection set merging not claimed by region [v4] In-Reply-To: References:

Message-ID: On Sat, 8 Feb 2025 10:35:06 GMT, Thomas Schatzl wrote: >> Hi all, >> >> please review this change that makes (optional) bitmap clearing during merging remembered sets claim regions. Otherwise every thread will do the (currently little) work themselves over and over again. >> >> Testing: tier1-3 >> >> Thanks, >> Thomas > > Thomas Schatzl has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains six commits: > > - * fix botched merge > - Merge branch 'master' into 8349213-bitmapclear-merging-not-claiming-regions > - Merge branch 'master' into 8349213-bitmapclear-merging-not-claiming-regions > - * ayang review > - * move commenty > - 8349213: G1: Clearing bitmaps during collection set merging not claimed by region > > Hi all, > > please review this change that makes (optional) bitmap clearing during merging remembered sets claim regions. Otherwise every thread will do the (currently little) work themselves over and over again. > > Testing: tier1-3 > > Thanks, > Thomas LGTM! ------------- Marked as reviewed by iwalulya (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/23419#pullrequestreview-2608121652 From dholmes at openjdk.org Tue Feb 11 09:32:13 2025 From: dholmes at openjdk.org (David Holmes) Date: Tue, 11 Feb 2025 09:32:13 GMT Subject: RFR: 8192647: GClocker induced GCs can starve threads requiring memory leading to OOME [v2] In-Reply-To: References: <8Vqsu8qf5wAN8pZF-8zu8zNhryQa42EZux3nMRChX5k=.63c53ac1-ca69-4a45-a924-9a454e24ea3f@github.com> Message-ID: <_CnY-j8qQhI5hEydYYH1gfQQP909-QrWTboS79F6UHA=.cf2527c7-5a81-4e4d-8433-ce18f9d63982@github.com> On Wed, 5 Feb 2025 14:41:39 GMT, Albert Mingkun Yang wrote: >> Here is an attempt to simplify GCLocker implementation for Serial and Parallel. >> >> GCLocker prevents GC when Java threads are in a critical region (i.e., calling JNI critical APIs). JDK-7129164 introduces an optimization that updates a shared variable (used to track the number of threads in the critical region) only if there is a pending GC request. However, this also means that after reaching a GC safepoint, we may discover that GCLocker is active, preventing a GC cycle from being invoked. The inability to perform GC at a safepoint adds complexity -- for example, a caller must retry allocation if the request fails due to GC being inhibited by GCLocker. >> >> The proposed patch uses a readers-writer lock to ensure that all Java threads exit the critical region before reaching a GC safepoint. This guarantees that once inside the safepoint, we can successfully invoke a GC cycle. The approach takes inspiration from `ZJNICritical`, but some regressions were observed in j2dbench (on Windows) and the micro-benchmark in [JDK-8232575](https://bugs.openjdk.org/browse/JDK-8232575). Therefore, instead of relying on atomic operations on a global variable when entering or leaving the critical region, this PR uses an existing thread-local variable with a store-load barrier for synchronization. >> >> Performance is neutral for all benchmarks tested: DaCapo, SPECjbb2005, SPECjbb2015, SPECjvm2008, j2dbench, and CacheStress. >> >> Test: tier1-8 > > Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision: > > - Merge branch 'master' into gclocker > - review > - Merge branch 'master' into gclocker > - gclocker Sorry still on my to-do list. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23367#issuecomment-2650249785 From tschatzl at openjdk.org Tue Feb 11 09:55:15 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 11 Feb 2025 09:55:15 GMT Subject: RFR: 8349213: G1: Clearing bitmaps during collection set merging not claimed by region [v2] In-Reply-To: References:

Message-ID: On Tue, 4 Feb 2025 10:55:35 GMT, Albert Mingkun Yang wrote: >> Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: >> >> * ayang review > > Marked as reviewed by ayang (Reviewer). Thanks @albertnetymk @walulyai for your reviews ------------- PR Comment: https://git.openjdk.org/jdk/pull/23419#issuecomment-2650301620 From tschatzl at openjdk.org Tue Feb 11 09:55:16 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 11 Feb 2025 09:55:16 GMT Subject: Integrated: 8349213: G1: Clearing bitmaps during collection set merging not claimed by region In-Reply-To: References: Message-ID: On Mon, 3 Feb 2025 14:11:20 GMT, Thomas Schatzl wrote: > Hi all, > > please review this change that makes (optional) bitmap clearing during merging remembered sets claim regions. Otherwise every thread will do the (currently little) work themselves over and over again. > > Testing: tier1-3 > > Thanks, > Thomas This pull request has now been integrated. Changeset: 8e858294 Author: Thomas Schatzl URL: https://git.openjdk.org/jdk/commit/8e8582949669d5f3dcb68886ccb6a719393d1a9e Stats: 10 lines in 1 file changed: 8 ins; 2 del; 0 mod 8349213: G1: Clearing bitmaps during collection set merging not claimed by region Reviewed-by: iwalulya, ayang ------------- PR: https://git.openjdk.org/jdk/pull/23419 From phh at openjdk.org Tue Feb 11 14:20:12 2025 From: phh at openjdk.org (Paul Hohensee) Date: Tue, 11 Feb 2025 14:20:12 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v3] In-Reply-To: <0bbrstoX8nDMn2Ku_WwSYn_NYSSLi3yXkWdg28imCHo=.ab1661a4-1ea5-4c57-9fde-0ee63ebac027@github.com> References: <0bbrstoX8nDMn2Ku_WwSYn_NYSSLi3yXkWdg28imCHo=.ab1661a4-1ea5-4c57-9fde-0ee63ebac027@github.com> Message-ID: <2XLAHIk0VEr8Xae-jNqjMZjBtPTrHqm8nl7tn_rigS8=.155e8a5a-193a-49b8-a773-b8e60b4dc3f5@github.com> On Tue, 11 Feb 2025 04:08:48 GMT, Kelvin Nilsen wrote: >> At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. >> >> For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. >> >> This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. > > Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: > > Add comments suggested by reviewers Looks good. ------------- Marked as reviewed by phh (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/23306#pullrequestreview-2608901033 From kdnilsen at openjdk.org Tue Feb 11 14:20:13 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 11 Feb 2025 14:20:13 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v3] In-Reply-To: References:

Message-ID: On Mon, 10 Feb 2025 18:41:27 GMT, Paul Hohensee wrote: >> ShenandoahCriticalFreeThreshold represents a percentage of the "total size". To calculate N% of the young generation size, we divide the generation size by 100 and then multiply by ShenandoahCriticalFreeThreshold. This code is a bit different in the most recent revision. Do you think it needs a comment? > > Yes :) I've added a comment here. Thanks. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1950933308 From ayang at openjdk.org Tue Feb 11 15:28:25 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Tue, 11 Feb 2025 15:28:25 GMT Subject: RFR: 8348171: Refactor GenerationCounters and its subclasses [v5] In-Reply-To: References: Message-ID: <7otkT63ENoyKzZ29CbYpycLLwL89ARajYg36Mstz4tQ=.fd3c7dcf-5a8b-44be-9205-09e3d160d54d@github.com> > Simple refactoring of removing the use of `virtual` method and use concrete subclasses when needed. > > Test: tier1-5 Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains seven commits: - Merge branch 'master' into gen-counter - review - * some more refactoring - review - Merge branch 'master' into gen-counter - merge - gen-counter ------------- Changes: https://git.openjdk.org/jdk/pull/23209/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23209&range=04 Stats: 202 lines in 17 files changed: 6 ins; 160 del; 36 mod Patch: https://git.openjdk.org/jdk/pull/23209.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23209/head:pull/23209 PR: https://git.openjdk.org/jdk/pull/23209 From tschatzl at openjdk.org Tue Feb 11 16:19:54 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 11 Feb 2025 16:19:54 GMT Subject: RFR: 8349836: G1: Improve group prediction log message Message-ID: Hi all, please review this minor change to the group prediction log message printed with gc+ergo+cset=trace: * add group id to be able to refer to something concrete when discussing results * add total time * fix typo in `bytes_to_cop` Testing: gha, local verification Thanks, Thomas ------------- Commit messages: - 8349836 Changes: https://git.openjdk.org/jdk/pull/23562/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23562&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8349836 Stats: 12 lines in 1 file changed: 7 ins; 3 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/23562.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23562/head:pull/23562 PR: https://git.openjdk.org/jdk/pull/23562 From kdnilsen at openjdk.org Tue Feb 11 18:15:58 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 11 Feb 2025 18:15:58 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v4] In-Reply-To: References: Message-ID: > At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. > > For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. > > This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. Kelvin Nilsen has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains five additional commits since the last revision: - Merge tag 'jdk-25+9' into fix-generational-no-progress-check Added tag jdk-25+9 for changeset 30f71622 - Add comments suggested by reviewers - Respond to reviewer feedback In testing suggested refinements, I discovered a bug in original implementation. ShenandoahFreeSet::capacity() does not represent the size of young generation. It represents the total size of the young regions that had available memory at the time we most recently rebuilt the ShenandoahFreeSet. I am rerunning the performance tests following this suggested change. - Use freeset to determine goodness of progress As previously implemented, we used the heap size to measure goodness of progress. However, heap size is only appropriate for non-generational Shenandoah. Freeset abstraction works for both. - Use size-of young generation to assess progress Previously, we were using size of heap to asses progress of generational degenerated cycle. But that is not appropriate, because the collection set is chosen based on the size of young generation. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23306/files - new: https://git.openjdk.org/jdk/pull/23306/files/8f644cdb..8c610136 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23306&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23306&range=02-03 Stats: 43531 lines in 2988 files changed: 18658 ins; 14204 del; 10669 mod Patch: https://git.openjdk.org/jdk/pull/23306.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23306/head:pull/23306 PR: https://git.openjdk.org/jdk/pull/23306 From kdnilsen at openjdk.org Tue Feb 11 18:21:09 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 11 Feb 2025 18:21:09 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v5] In-Reply-To: References: