From zgu at openjdk.org Sat Feb 1 16:42:50 2025 From: zgu at openjdk.org (Zhengyu Gu) Date: Sat, 1 Feb 2025 16:42:50 GMT Subject: RFR: 8348171: Refactor GenerationCounters and its subclasses [v4] In-Reply-To: References:

Message-ID: On Fri, 31 Jan 2025 12:30:30 GMT, Albert Mingkun Yang wrote: >> Simple refactoring of removing the use of `virtual` method and use concrete subclasses when needed. >> >> Test: tier1-5 > > Albert Mingkun Yang has updated the pull request incrementally with two additional commits since the last revision: > > - review > - * some more refactoring src/hotspot/share/gc/shenandoah/shenandoahMonitoringSupport.cpp line 39: > 37: GenerationCounters("Young", 0, 0, 0, (size_t)0, (size_t)0) {}; > 38: > 39: void update_all() { Shenandoah looks a bit odd now. @kdnilsen @wkemper and @ysramakrishna may want to take a look? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23209#discussion_r1938303987 From zgu at openjdk.org Sat Feb 1 16:47:52 2025 From: zgu at openjdk.org (Zhengyu Gu) Date: Sat, 1 Feb 2025 16:47:52 GMT Subject: RFR: 8348171: Refactor GenerationCounters and its subclasses [v4] In-Reply-To: References:

Message-ID: On Fri, 31 Jan 2025 12:30:30 GMT, Albert Mingkun Yang wrote: >> Simple refactoring of removing the use of `virtual` method and use concrete subclasses when needed. >> >> Test: tier1-5 > > Albert Mingkun Yang has updated the pull request incrementally with two additional commits since the last revision: > > - review > - * some more refactoring `Shenandoah` code no longer aligns to others. Other than that, LGTM. ------------- Marked as reviewed by zgu (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/23209#pullrequestreview-2588364961 From tschatzl at openjdk.org Mon Feb 3 09:25:50 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 3 Feb 2025 09:25:50 GMT Subject: RFR: 8348171: Refactor GenerationCounters and its subclasses [v4] In-Reply-To: References:

Message-ID: <7k73_VjUmBq7-G2reVDOlB7-vSazUekr8Q3Ez3houa0=.61d2baf5-72c6-41cd-aa74-c49a5b5e9ce1@github.com> On Fri, 31 Jan 2025 12:30:30 GMT, Albert Mingkun Yang wrote: >> Simple refactoring of removing the use of `virtual` method and use concrete subclasses when needed. >> >> Test: tier1-5 > > Albert Mingkun Yang has updated the pull request incrementally with two additional commits since the last revision: > > - review > - * some more refactoring Lgtm. Related to @zhengyu123 's comment, not sure right now what is meant with "looking odd" here as the previous code did not update the counters either, but it might be useful to wait on Shenandoah team's input anyway. ------------- Marked as reviewed by tschatzl (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/23209#pullrequestreview-2589340090 From wkemper at openjdk.org Mon Feb 3 20:36:09 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 3 Feb 2025 20:36:09 GMT Subject: RFR: 8348268: Test gc/shenandoah/TestResizeTLAB.java#compact: fatal error: Before Updating References: Thread C2 CompilerThread1: expected gc-state 9, actual 21 Message-ID: Non-java threads were not having their gc-state configured when they attach. If they were created before the verifier's safepoint, but after the iteration over non-java threads, they would not have the correct state. ------------- Commit messages: - Set gc state for all attached threads (not just java threads). Changes: https://git.openjdk.org/jdk/pull/23428/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23428&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8348268 Stats: 4 lines in 1 file changed: 3 ins; 1 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/23428.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23428/head:pull/23428 PR: https://git.openjdk.org/jdk/pull/23428 From wkemper at openjdk.org Mon Feb 3 22:34:13 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 3 Feb 2025 22:34:13 GMT Subject: [jdk24] RFR: 8349002: GenShen: Deadlock during shutdown Message-ID: <6a_6G1g93RUACyYcHG5B9HtqFBaqdETRRdhvFWwrfi8=.e88c6252-f6c4-4e3f-972a-2c4495d27127@github.com> Clean backport. Fixes bug introduced by [JDK-8345970](https://bugs.openjdk.org/browse/JDK-8345970). ------------- Commit messages: - Backport 06ebb170bac3879dc1e378b48b1c7ef006070c86 Changes: https://git.openjdk.org/jdk/pull/23429/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23429&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8349002 Stats: 5 lines in 2 files changed: 4 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/23429.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23429/head:pull/23429 PR: https://git.openjdk.org/jdk/pull/23429 From wkemper at openjdk.org Mon Feb 3 22:34:26 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 3 Feb 2025 22:34:26 GMT Subject: RFR: 8349002: GenShen: Deadlock during shutdown Message-ID: Clean backport. Fixes bug introduced by [JDK-8345970](https://bugs.openjdk.org/browse/JDK-8345970). ------------- Commit messages: - Backport 06ebb170bac3879dc1e378b48b1c7ef006070c86 Changes: https://git.openjdk.org/shenandoah-jdk21u/pull/153/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=153&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8349002 Stats: 5 lines in 2 files changed: 4 ins; 0 del; 1 mod Patch: https://git.openjdk.org/shenandoah-jdk21u/pull/153.diff Fetch: git fetch https://git.openjdk.org/shenandoah-jdk21u.git pull/153/head:pull/153 PR: https://git.openjdk.org/shenandoah-jdk21u/pull/153 From kvn at openjdk.org Mon Feb 3 23:51:17 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Mon, 3 Feb 2025 23:51:17 GMT Subject: [jdk24] RFR: 8349002: GenShen: Deadlock during shutdown In-Reply-To: <6a_6G1g93RUACyYcHG5B9HtqFBaqdETRRdhvFWwrfi8=.e88c6252-f6c4-4e3f-972a-2c4495d27127@github.com> References: <6a_6G1g93RUACyYcHG5B9HtqFBaqdETRRdhvFWwrfi8=.e88c6252-f6c4-4e3f-972a-2c4495d27127@github.com> Message-ID: On Mon, 3 Feb 2025 22:27:44 GMT, William Kemper wrote: > Clean backport. Fixes bug introduced by [JDK-8345970](https://bugs.openjdk.org/browse/JDK-8345970). We are in RDP2 phase of JDK 24 release. Only P1 and P2 are allowed to be pushed with approval: https://openjdk.org/jeps/3#rdp-2 Consider backporting the fix into JDK 24 Update release. ------------- Changes requested by kvn (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/23429#pullrequestreview-2591384756 From phh at openjdk.org Tue Feb 4 15:23:17 2025 From: phh at openjdk.org (Paul Hohensee) Date: Tue, 4 Feb 2025 15:23:17 GMT Subject: RFR: 8348610: GenShen: TestShenandoahEvacuationInformationEvent failed with setRegions >= regionsFreed: expected 1 >= 57 In-Reply-To: References: Message-ID: On Thu, 30 Jan 2025 04:43:59 GMT, Satyen Subramaniam wrote: > Renaming `ShenandoahEvacuationInformation.freedRegions` to `ShenandoahEvacuationInformation.freeRegions` for clarity, and fixing incorrect assertion in TestShenandoahEvacuationInformationEvent.cpp > > Tested with tier 1, tier 2, and tier 3 tests. Marked as reviewed by phh (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/23362#pullrequestreview-2593210592 From phh at openjdk.org Tue Feb 4 15:53:13 2025 From: phh at openjdk.org (Paul Hohensee) Date: Tue, 4 Feb 2025 15:53:13 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check In-Reply-To: References: Message-ID: On Fri, 24 Jan 2025 18:30:02 GMT, Kelvin Nilsen wrote: > At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. > > For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. > > This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. src/hotspot/share/gc/shenandoah/shenandoahMetrics.cpp line 52: > 50: size_t free_actual = free_set->available(); > 51: // The sum of free_set->capacity() and ->reserved represents capacity of young in generational, heap in non-generational. > 52: size_t free_expected = ((free_set->capacity() + free_set->reserved()) / 100) * ShenandoahCriticalFreeThreshold; As an outsider, the units involved and what exactly is being calculated is pretty opaque. Why would we divide by 100 to compute free_expected and not do the same for free_actual? Do we care about integer division truncation? The default value of ShenandoahCriticalFreeThreshold is 1, so multiplying by it is a nop by default, which seems strange. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1941436272 From shade at openjdk.org Tue Feb 4 16:05:10 2025 From: shade at openjdk.org (Aleksey Shipilev) Date: Tue, 4 Feb 2025 16:05:10 GMT Subject: RFR: 8348268: Test gc/shenandoah/TestResizeTLAB.java#compact: fatal error: Before Updating References: Thread C2 CompilerThread1: expected gc-state 9, actual 21 In-Reply-To: References: Message-ID: On Mon, 3 Feb 2025 20:28:58 GMT, William Kemper wrote: > Non-java threads were not having their gc-state configured when they attach. If they were created before the verifier's safepoint, but after the iteration over non-java threads, they would not have the correct state. It looks generally okay, but I am confused how this fixes a bad state in `C2 CompilerThread1`, since compiler threads are Java threads? https://github.com/openjdk/jdk/blob/beb43e2633900bb9ab3c975376fe5860b6d054e0/src/hotspot/share/compiler/compilerThread.hpp#L42 ------------- PR Comment: https://git.openjdk.org/jdk/pull/23428#issuecomment-2634411610 From phh at openjdk.org Tue Feb 4 16:19:20 2025 From: phh at openjdk.org (Paul Hohensee) Date: Tue, 4 Feb 2025 16:19:20 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v2] In-Reply-To: References:

Message-ID: On Mon, 27 Jan 2025 02:05:02 GMT, Kelvin Nilsen wrote: >> Shenandoah heuristics use a penalty mechanism to cause earlier GC triggers when recent concurrent GC cycles degenerate. Degeneration is a stop-the-world remediation that allows GC to catch up when mutator allocations fail during concurrent GC. The fact that we needed to degenerate indicates that we were overly optimistic in delaying the trigger that starts concurrent GC. >> >> We have observed that it is common for degenerated GC cycles to cascade upon each other. The condition that caused an initial degenerated cycle is often not fully resolved by the end of that degenerated cycle. For example, the application may be experiencing a phase change and the GC heuristics are not yet attuned to the new behavior. Furthermore, a degenerated GC may exacerbate the problem condition. During the stop-the-world pause imposed by the first degenerated GC, work continues to accumulate in the form of new client requests that are buffered in network sockets until the end of that degenerated GC. >> >> As originally implemented, each degeneration would "pile on" additional penalties. These penalties cause the GC frequency to continue to increase. And the expanding CPU load of GC makes it increasingly difficult for mutator threads to catchup. The large penalties accumulated while we are trying to resolve the problem linger long after the problem condition has been resolved. >> >> This change does not add further to the degeneration penalties if a new degenerated cycle occurs through no fault of the triggering mechanism. We only add the degeneration penalty if the reason we are now degenerating can be attributed to a consciously late trigger by the heuristic. > > Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: > > Respond to reviewer feedback src/hotspot/share/gc/shenandoah/heuristics/shenandoahAdaptiveHeuristics.cpp line 318: > 316: > 317: if (ShenandoahHeuristics::should_start_gc()) { > 318: _start_gc_is_pending = true; I assume there's no race here, i.e., only one thread reads/writes _start_gc_is_pending. If there's a race, make sure it's benign. In either case, _start_gc_is_pending is made "sticky" by this code. src/hotspot/share/gc/shenandoah/heuristics/shenandoahHeuristics.cpp line 261: > 259: > 260: void ShenandoahHeuristics::record_success_concurrent() { > 261: _start_gc_is_pending = false; The name _start_gc_is_pending implies that it should be set false as soon as a gc cycle starts, not when it finishes. Maybe _gc_pending? Or maybe setting it false at the end of a gc cycle is a bug? :) src/hotspot/share/gc/shenandoah/heuristics/shenandoahHeuristics.hpp line 87: > 85: size_t _declined_trigger_count; // This counts how many times since previous GC finished that this > 86: // heuristic has answered false to should_start_gc(). > 87: size_t _previous_trigger_declinations; // This represents the value of _declined_trigger_count as captured at the Maybe the name should be _most_recent_declined_trigger_count, which relates it directly to _declined_trigger_count. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23305#discussion_r1941486248 PR Review Comment: https://git.openjdk.org/jdk/pull/23305#discussion_r1941462312 PR Review Comment: https://git.openjdk.org/jdk/pull/23305#discussion_r1941468695 From duke at openjdk.org Tue Feb 4 16:52:10 2025 From: duke at openjdk.org (duke) Date: Tue, 4 Feb 2025 16:52:10 GMT Subject: RFR: 8348610: GenShen: TestShenandoahEvacuationInformationEvent failed with setRegions >= regionsFreed: expected 1 >= 57 In-Reply-To: References: Message-ID: On Thu, 30 Jan 2025 04:43:59 GMT, Satyen Subramaniam wrote: > Renaming `ShenandoahEvacuationInformation.freedRegions` to `ShenandoahEvacuationInformation.freeRegions` for clarity, and fixing incorrect assertion in TestShenandoahEvacuationInformationEvent.cpp > > Tested with tier 1, tier 2, and tier 3 tests. @satyenme Your change (at version 923a29d84a06315bfde7d3d1d8b48ff27fef8f9e) is now ready to be sponsored by a Committer. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23362#issuecomment-2634531314 From wkemper at openjdk.org Tue Feb 4 17:25:20 2025 From: wkemper at openjdk.org (William Kemper) Date: Tue, 4 Feb 2025 17:25:20 GMT Subject: RFR: 8348268: Test gc/shenandoah/TestResizeTLAB.java#compact: fatal error: Before Updating References: Thread C2 CompilerThread1: expected gc-state 9, actual 21 In-Reply-To: References:

Message-ID: On Tue, 4 Feb 2025 16:03:02 GMT, Aleksey Shipilev wrote: >> Non-java threads were not having their gc-state configured when they attach. If they were created before the verifier's safepoint, but after the iteration over non-java threads, they would not have the correct state. > > It looks generally okay, but I am confused how this fixes a bad state in `C2 CompilerThread1`, since compiler threads are Java threads? https://github.com/openjdk/jdk/blob/beb43e2633900bb9ab3c975376fe5860b6d054e0/src/hotspot/share/compiler/compilerThread.hpp#L42 @shipilev , that is a good point. Will take a closer look. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23428#issuecomment-2634609075 From ssubramaniam at openjdk.org Tue Feb 4 17:22:25 2025 From: ssubramaniam at openjdk.org (Satyen Subramaniam) Date: Tue, 4 Feb 2025 17:22:25 GMT Subject: Integrated: 8348610: GenShen: TestShenandoahEvacuationInformationEvent failed with setRegions >= regionsFreed: expected 1 >= 57 In-Reply-To: References: Message-ID: On Thu, 30 Jan 2025 04:43:59 GMT, Satyen Subramaniam wrote: > Renaming `ShenandoahEvacuationInformation.freedRegions` to `ShenandoahEvacuationInformation.freeRegions` for clarity, and fixing incorrect assertion in TestShenandoahEvacuationInformationEvent.cpp > > Tested with tier 1, tier 2, and tier 3 tests. This pull request has now been integrated. Changeset: bad39b6d Author: Satyen Subramaniam Committer: Paul Hohensee URL: https://git.openjdk.org/jdk/commit/bad39b6d8892ba9b86bc81bf01108a1df617defb Stats: 15 lines in 5 files changed: 2 ins; 1 del; 12 mod 8348610: GenShen: TestShenandoahEvacuationInformationEvent failed with setRegions >= regionsFreed: expected 1 >= 57 Reviewed-by: wkemper, phh ------------- PR: https://git.openjdk.org/jdk/pull/23362 From wkemper at openjdk.org Wed Feb 5 19:06:20 2025 From: wkemper at openjdk.org (William Kemper) Date: Wed, 5 Feb 2025 19:06:20 GMT Subject: [jdk24] Withdrawn: 8349002: GenShen: Deadlock during shutdown In-Reply-To: <6a_6G1g93RUACyYcHG5B9HtqFBaqdETRRdhvFWwrfi8=.e88c6252-f6c4-4e3f-972a-2c4495d27127@github.com> References: <6a_6G1g93RUACyYcHG5B9HtqFBaqdETRRdhvFWwrfi8=.e88c6252-f6c4-4e3f-972a-2c4495d27127@github.com> Message-ID: On Mon, 3 Feb 2025 22:27:44 GMT, William Kemper wrote: > Clean backport. Fixes bug introduced by [JDK-8345970](https://bugs.openjdk.org/browse/JDK-8345970). This pull request has been closed without being integrated. ------------- PR: https://git.openjdk.org/jdk/pull/23429 From wkemper at openjdk.org Wed Feb 5 19:06:20 2025 From: wkemper at openjdk.org (William Kemper) Date: Wed, 5 Feb 2025 19:06:20 GMT Subject: [jdk24] RFR: 8349002: GenShen: Deadlock during shutdown In-Reply-To: <6a_6G1g93RUACyYcHG5B9HtqFBaqdETRRdhvFWwrfi8=.e88c6252-f6c4-4e3f-972a-2c4495d27127@github.com> References: <6a_6G1g93RUACyYcHG5B9HtqFBaqdETRRdhvFWwrfi8=.e88c6252-f6c4-4e3f-972a-2c4495d27127@github.com> Message-ID: On Mon, 3 Feb 2025 22:27:44 GMT, William Kemper wrote: > Clean backport. Fixes bug introduced by [JDK-8345970](https://bugs.openjdk.org/browse/JDK-8345970). Understood. Will target JDK24 update release. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23429#issuecomment-2637789977 From wkemper at openjdk.org Wed Feb 5 19:09:59 2025 From: wkemper at openjdk.org (William Kemper) Date: Wed, 5 Feb 2025 19:09:59 GMT Subject: Integrated: 8349002: GenShen: Deadlock during shutdown In-Reply-To: References: Message-ID: On Mon, 3 Feb 2025 22:27:54 GMT, William Kemper wrote: > Clean backport. Fixes bug introduced by [JDK-8345970](https://bugs.openjdk.org/browse/JDK-8345970). This pull request has now been integrated. Changeset: ceaeb7b4 Author: William Kemper URL: https://git.openjdk.org/shenandoah-jdk21u/commit/ceaeb7b4271f52af9e73e97a5d19bd441fbcd96a Stats: 5 lines in 2 files changed: 4 ins; 0 del; 1 mod 8349002: GenShen: Deadlock during shutdown Backport-of: 06ebb170bac3879dc1e378b48b1c7ef006070c86 ------------- PR: https://git.openjdk.org/shenandoah-jdk21u/pull/153 From wkemper at openjdk.org Wed Feb 5 22:35:21 2025 From: wkemper at openjdk.org (William Kemper) Date: Wed, 5 Feb 2025 22:35:21 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions Message-ID: There are several changes to the operation of Shenandoah's control threads here. * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. * The cancellation handling is driven entirely by the cancellation cause * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed * The shutdown sequence is simpler * The generational control thread uses a lock to coordinate updates to the requested cause and generation * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles * The control thread doesn't loop on its own (unless the pacer is enabled). ------------- Commit messages: - Merge remote-tracking branch 'jdk/master' into fix-control-regulator-threads - Simplify shControlThread - Revert unnecessary changes - Fix interrupted old cycle handling - Restore reporting allocations to pacer - Better names, better comments - WIP: Simplify shutdown protocol - WIP: Don't need request.mode anymore - WIP: Simplify degenerated cycle handling - WIP: Passes tier1, mostly passes tier2 - ... and 4 more: https://git.openjdk.org/jdk/compare/b499c827...f97f257b Changes: https://git.openjdk.org/jdk/pull/23475/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8349094 Stats: 817 lines in 14 files changed: 241 ins; 286 del; 290 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From wkemper at openjdk.org Thu Feb 6 14:25:15 2025 From: wkemper at openjdk.org (William Kemper) Date: Thu, 6 Feb 2025 14:25:15 GMT Subject: RFR: Merge openjdk/jdk21u:master Message-ID: Merges tag jdk-21.0.7+1 ------------- Commit messages: - 8345468: test/jdk/javax/swing/JScrollBar/4865918/bug4865918.java fails in ubuntu22.04 - 8348625: [21u, 17u] Revert JDK-8185862 to restore old java.awt.headless behavior on Windows - 8347427: JTabbedPane/8134116/Bug8134116.java has no license header - 8332494: java/util/zip/EntryCount64k.java failing with java.lang.RuntimeException: '\\A\\Z' missing from stderr - 8343378: Exceptions in javax/management DeadLockTest.java do not cause test failure - 8327986: ASAN reports use-after-free in DirectivesParserTest.empty_object_vm - 8328387: Convert java/awt/Frame/FrameStateTest/FrameStateTest.html applet test to main - 8327098: GTest needs larger combination limit - 8315486: vmTestbase/nsk/jdwp/ThreadReference/ForceEarlyReturn/forceEarlyReturn002/forceEarlyReturn002.java timed out - 8347129: cpuset cgroups controller is required for no good reason - ... and 157 more: https://git.openjdk.org/shenandoah-jdk21u/compare/7069f193...d2cbada0 The webrev contains the conflicts with master: - merge conflicts: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=154&range=00.conflicts Changes: https://git.openjdk.org/shenandoah-jdk21u/pull/154/files Stats: 61790 lines in 579 files changed: 15357 ins; 6745 del; 39688 mod Patch: https://git.openjdk.org/shenandoah-jdk21u/pull/154.diff Fetch: git fetch https://git.openjdk.org/shenandoah-jdk21u.git pull/154/head:pull/154 PR: https://git.openjdk.org/shenandoah-jdk21u/pull/154 From wkemper at openjdk.org Thu Feb 6 17:20:52 2025 From: wkemper at openjdk.org (William Kemper) Date: Thu, 6 Feb 2025 17:20:52 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v2] In-Reply-To: References: Message-ID: <9vqH905wEy_k3MoOq-wmpzFWuniRKpiDAu6en7bOSr4=.a8fee870-a8fc-4532-acc7-c37975e8a948@github.com> > There are several changes to the operation of Shenandoah's control threads here. > * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. > * The cancellation handling is driven entirely by the cancellation cause > * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed > * The shutdown sequence is simpler > * The generational control thread uses a lock to coordinate updates to the requested cause and generation > * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance > * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles > * The control thread doesn't loop on its own (unless the pacer is enabled). William Kemper has updated the pull request incrementally with one additional commit since the last revision: Remove invalid assert, alloc waiters wait until allocation failure is clear ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23475/files - new: https://git.openjdk.org/jdk/pull/23475/files/f97f257b..a7a6eea1 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=00-01 Stats: 5 lines in 2 files changed: 2 ins; 1 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From kdnilsen at openjdk.org Thu Feb 6 23:34:15 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Thu, 6 Feb 2025 23:34:15 GMT Subject: RFR: 8343468: GenShen: Enable relocation of remembered set card tables [v3] In-Reply-To: <6_AoWQhldJttOIEOL1T7HSapPzE4Qn2j4WN7E-bI3rM=.2685d3d8-e47c-42a6-845b-b68f50cc568e@github.com> References: <6_AoWQhldJttOIEOL1T7HSapPzE4Qn2j4WN7E-bI3rM=.2685d3d8-e47c-42a6-845b-b68f50cc568e@github.com> Message-ID: On Thu, 23 Jan 2025 05:45:43 GMT, Cesar Soares Lucas wrote: >> In the current Generational Shenandoah implementation, the pointers to the read and write card tables are established at JVM launch time and fixed during the whole of the application execution. Because they are considered constants, they are embedded as such in JIT-compiled code. >> >> The cleaning of dirty cards in the read card table is performed during the `init-mark` pause, and our experiments show that it represents a sizable portion of that phase's duration. This pull request makes the addresses of the read and write card tables dynamic, with the end goal of reducing the duration of the `init-mark` pause by moving the cleaning of the dirty cards in the read card table to the `reset` concurrent phase. >> >> The idea is quite simple. Instead of using distinct read and write card tables for the entire duration of the JVM execution, we alternate which card table serves as the read/write table during each GC cycle. In the `reset` phase we concurrently clean the cards in the the current _read_ table so that when the cycle reaches the next `init-mark` phase we have a version of the card table totally clear. In the next `init-mark` pause we swap the pointers to the base of the read and write tables. When the `init-mark` finishes the mutator threads will operate on the table just cleaned in the `reset` phase; the GC will operate on the table that just turned the new _read_ table. >> >> Most of the changes in the patch account for the fact that the write card table is no longer at a fixed address. >> >> The primary benefit of this change is that it eliminates the need to copy and zero the remembered set during the init-mark Safepoint. A secondary benefit is that it allows us to replace the init-mark Safepoint with an `init-mark` handshake?something we plan to work on after this PR is merged. >> >> Our internal performance testing showed a significant reduction in the duration of `init-mark` pauses and no statistically significant regression due to the dynamic loading of the card table address in JIT-compiled code. >> >> Functional testing was performed on Linux, macOS, Windows running on x64, AArch64, and their respective 32-bit versions. I?d appreciate it if someone with access to RISC-V (@luhenry ?) and PowerPC (@TheRealMDoerr ?) platforms could review and test the changes for those platforms, as I have limited access to running tests on them. > > Cesar Soares Lucas has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains three commits: > > - Merge master > - Addressing PR comments: some refactorings, ppc fix, off-by-one fix. > - Relocation of Card Tables Thanks for pulling this together. Looks great. ------------- Marked as reviewed by kdnilsen (Author). PR Review: https://git.openjdk.org/jdk/pull/23170#pullrequestreview-2600251495 From wkemper at openjdk.org Fri Feb 7 02:04:29 2025 From: wkemper at openjdk.org (William Kemper) Date: Fri, 7 Feb 2025 02:04:29 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v3] In-Reply-To: References: Message-ID: > There are several changes to the operation of Shenandoah's control threads here. > * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. > * The cancellation handling is driven entirely by the cancellation cause > * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed > * The shutdown sequence is simpler > * The generational control thread uses a lock to coordinate updates to the requested cause and generation > * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance > * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles > * The control thread doesn't loop on its own (unless the pacer is enabled). William Kemper has updated the pull request incrementally with three additional commits since the last revision: - Resuming an old cycle should not preempt a young cycle - Use logging tag 'thread' to help control debug volume - Do not stomp on pending requests when running a degenerated cycle ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23475/files - new: https://git.openjdk.org/jdk/pull/23475/files/a7a6eea1..ae207480 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=01-02 Stats: 64 lines in 8 files changed: 35 ins; 12 del; 17 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From wkemper at openjdk.org Fri Feb 7 22:21:45 2025 From: wkemper at openjdk.org (William Kemper) Date: Fri, 7 Feb 2025 22:21:45 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v4] In-Reply-To: References: Message-ID: > There are several changes to the operation of Shenandoah's control threads here. > * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. > * The cancellation handling is driven entirely by the cancellation cause > * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed > * The shutdown sequence is simpler > * The generational control thread uses a lock to coordinate updates to the requested cause and generation > * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance > * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles > * The control thread doesn't loop on its own (unless the pacer is enabled). William Kemper has updated the pull request incrementally with two additional commits since the last revision: - Simplify locking protocol - Make shutdown more robust, make better use of request lock ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23475/files - new: https://git.openjdk.org/jdk/pull/23475/files/ae207480..a6513bcb Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=02-03 Stats: 133 lines in 5 files changed: 54 ins; 39 del; 40 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From wkemper at openjdk.org Fri Feb 7 22:28:25 2025 From: wkemper at openjdk.org (William Kemper) Date: Fri, 7 Feb 2025 22:28:25 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v5] In-Reply-To: References: Message-ID: > There are several changes to the operation of Shenandoah's control threads here. > * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. > * The cancellation handling is driven entirely by the cancellation cause > * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed > * The shutdown sequence is simpler > * The generational control thread uses a lock to coordinate updates to the requested cause and generation > * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance > * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles > * The control thread doesn't loop on its own (unless the pacer is enabled). William Kemper has updated the pull request incrementally with one additional commit since the last revision: Fix includes ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23475/files - new: https://git.openjdk.org/jdk/pull/23475/files/a6513bcb..d16f6fd0 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=04 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=03-04 Stats: 3 lines in 1 file changed: 1 ins; 2 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From wkemper at openjdk.org Fri Feb 7 22:46:08 2025 From: wkemper at openjdk.org (William Kemper) Date: Fri, 7 Feb 2025 22:46:08 GMT Subject: RFR: Merge openjdk/jdk21u:master [v2] In-Reply-To: References: Message-ID: > Merges tag jdk-21.0.7+1 William Kemper has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 168 commits: - Merge branch 'master' into merge-jdk-21.0.7+1 - 8345468: test/jdk/javax/swing/JScrollBar/4865918/bug4865918.java fails in ubuntu22.04 Backport-of: 6f307623568efe4d90942cd22ec9a26b2e1ca1b1 - 8348625: [21u, 17u] Revert JDK-8185862 to restore old java.awt.headless behavior on Windows Reviewed-by: sgehwolf - 8347427: JTabbedPane/8134116/Bug8134116.java has no license header Backport-of: f67b703625afa2e049c572978d29ac00d8c956d3 - 8332494: java/util/zip/EntryCount64k.java failing with java.lang.RuntimeException: '\\A\\Z' missing from stderr Backport-of: f5ab7dff402a3152f5d5736cc6521b4be617eccf - 8343378: Exceptions in javax/management DeadLockTest.java do not cause test failure Backport-of: 4a70c83bd0c563185123ce9d8a34e006c62db7cc - 8327986: ASAN reports use-after-free in DirectivesParserTest.empty_object_vm Reviewed-by: rrich Backport-of: 47f33a59eaaffc74881fcc9e29d13ff9b2538c2a - 8328387: Convert java/awt/Frame/FrameStateTest/FrameStateTest.html applet test to main Backport-of: 269163d509ec3c80983f55c5b47f472fa76be26c - 8327098: GTest needs larger combination limit Backport-of: c901da48e30d53cb8e4e3c1f0584c5f2d3d095f1 - 8315486: vmTestbase/nsk/jdwp/ThreadReference/ForceEarlyReturn/forceEarlyReturn002/forceEarlyReturn002.java timed out Backport-of: 041510dc21df36d9860f4f0048241c2cabb55ee7 - ... and 158 more: https://git.openjdk.org/shenandoah-jdk21u/compare/ceaeb7b4...c2a2d6ce ------------- Changes: https://git.openjdk.org/shenandoah-jdk21u/pull/154/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=154&range=01 Stats: 61769 lines in 574 files changed: 15344 ins; 6744 del; 39681 mod Patch: https://git.openjdk.org/shenandoah-jdk21u/pull/154.diff Fetch: git fetch https://git.openjdk.org/shenandoah-jdk21u.git pull/154/head:pull/154 PR: https://git.openjdk.org/shenandoah-jdk21u/pull/154 From wkemper at openjdk.org Fri Feb 7 22:46:10 2025 From: wkemper at openjdk.org (William Kemper) Date: Fri, 7 Feb 2025 22:46:10 GMT Subject: Integrated: Merge openjdk/jdk21u:master In-Reply-To: References: Message-ID: On Thu, 6 Feb 2025 14:18:40 GMT, William Kemper wrote: > Merges tag jdk-21.0.7+1 This pull request has now been integrated. Changeset: c9889dbe Author: William Kemper URL: https://git.openjdk.org/shenandoah-jdk21u/commit/c9889dbe139cf78b77a72122b65905eb389294ed Stats: 61769 lines in 574 files changed: 15344 ins; 6744 del; 39681 mod Merge ------------- PR: https://git.openjdk.org/shenandoah-jdk21u/pull/154 From kdnilsen at openjdk.org Fri Feb 7 23:59:52 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 7 Feb 2025 23:59:52 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: References: Message-ID: > At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. > > For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. > > This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: Respond to reviewer feedback In testing suggested refinements, I discovered a bug in original implementation. ShenandoahFreeSet::capacity() does not represent the size of young generation. It represents the total size of the young regions that had available memory at the time we most recently rebuilt the ShenandoahFreeSet. I am rerunning the performance tests following this suggested change. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23306/files - new: https://git.openjdk.org/jdk/pull/23306/files/a850e484..7969515d Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23306&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23306&range=00-01 Stats: 13 lines in 5 files changed: 4 ins; 0 del; 9 mod Patch: https://git.openjdk.org/jdk/pull/23306.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23306/head:pull/23306 PR: https://git.openjdk.org/jdk/pull/23306 From kdnilsen at openjdk.org Fri Feb 7 23:59:52 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 7 Feb 2025 23:59:52 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: References:

Message-ID: On Fri, 31 Jan 2025 01:15:01 GMT, Xiaolong Peng wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Respond to reviewer feedback >> >> In testing suggested refinements, I discovered a bug in original >> implementation. ShenandoahFreeSet::capacity() does not represent the >> size of young generation. It represents the total size of the young >> regions that had available memory at the time we most recently rebuilt >> the ShenandoahFreeSet. >> >> I am rerunning the performance tests following this suggested change. > > src/hotspot/share/gc/shenandoah/shenandoahMetrics.cpp line 52: > >> 50: size_t free_actual = free_set->available(); >> 51: // The sum of free_set->capacity() and ->reserved represents capacity of young in generational, heap in non-generational. >> 52: size_t free_expected = ((free_set->capacity() + free_set->reserved()) / 100) * ShenandoahCriticalFreeThreshold; > > We may pass ShenandoahGeneration as parameter to `is_good_progress` to simplify the calculation of free_expected, it should be like: > ` > generation->max_capacity() / 100 * ShenandoahCriticalFreeThreshold > ` > Good part is, free_expected might be more accurate in Full GC/Degen for global cycle, e.g. Full GC collects memory for global, `free_expected` should be calculated using the metrics from global generation. But either way, `free_expected` is not clearly defined in generational mode now, current code also works. Thanks for this suggestion. I've made change. It turns out there was actually a bug in the original implementation, so I am retesting the performance results. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1947334711 From kdnilsen at openjdk.org Fri Feb 7 23:59:52 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 7 Feb 2025 23:59:52 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: References:

Message-ID: On Tue, 4 Feb 2025 15:50:59 GMT, Paul Hohensee wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Respond to reviewer feedback >> >> In testing suggested refinements, I discovered a bug in original >> implementation. ShenandoahFreeSet::capacity() does not represent the >> size of young generation. It represents the total size of the young >> regions that had available memory at the time we most recently rebuilt >> the ShenandoahFreeSet. >> >> I am rerunning the performance tests following this suggested change. > > src/hotspot/share/gc/shenandoah/shenandoahMetrics.cpp line 52: > >> 50: size_t free_actual = free_set->available(); >> 51: // The sum of free_set->capacity() and ->reserved represents capacity of young in generational, heap in non-generational. >> 52: size_t free_expected = ((free_set->capacity() + free_set->reserved()) / 100) * ShenandoahCriticalFreeThreshold; > > As an outsider, the units involved and what exactly is being calculated is pretty opaque. Why would we divide by 100 to compute free_expected and not do the same for free_actual? Do we care about integer division truncation? The default value of ShenandoahCriticalFreeThreshold is 1, so multiplying by it is a nop by default, which seems strange. ShenandoahCriticalFreeThreshold represents a percentage of the "total size". To calculate N% of the young generation size, we divide the generation size by 100 and then multiply by ShenandoahCriticalFreeThreshold. This code is a bit different in the most recent revision. Do you think it needs a comment? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1947335561 From xpeng at openjdk.org Sat Feb 8 02:11:21 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Sat, 8 Feb 2025 02:11:21 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: References:

Message-ID: On Fri, 7 Feb 2025 23:54:46 GMT, Kelvin Nilsen wrote: >> src/hotspot/share/gc/shenandoah/shenandoahMetrics.cpp line 52: >> >>> 50: size_t free_actual = free_set->available(); >>> 51: // The sum of free_set->capacity() and ->reserved represents capacity of young in generational, heap in non-generational. >>> 52: size_t free_expected = ((free_set->capacity() + free_set->reserved()) / 100) * ShenandoahCriticalFreeThreshold; >> >> We may pass ShenandoahGeneration as parameter to `is_good_progress` to simplify the calculation of free_expected, it should be like: >> ` >> generation->max_capacity() / 100 * ShenandoahCriticalFreeThreshold >> ` >> Good part is, free_expected might be more accurate in Full GC/Degen for global cycle, e.g. Full GC collects memory for global, `free_expected` should be calculated using the metrics from global generation. But either way, `free_expected` is not clearly defined in generational mode now, current code also works. > > Thanks for this suggestion. I've made change. It turns out there was actually a bug in the original implementation, so I am retesting the performance results. Thanks, honest I didn't understand that why `(free_set->capacity() + free_set->reserved()` represents capacity of young in generational, is it the bug you found? `free_set->capacity()` excludes the regions doesn't have enough capacity(it is calculated when rebuild free set) I thought a bit more, it makes more sense to calculate free_expected in `snap_before`, max_capacity of generations may change after collection, the free_expected should be calculated before the cycle. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1947405475 From duke at openjdk.org Sat Feb 8 16:03:19 2025 From: duke at openjdk.org (duke) Date: Sat, 8 Feb 2025 16:03:19 GMT Subject: Withdrawn: 8344880: AArch64: Add compile time check for class offsets In-Reply-To: References: Message-ID: On Fri, 6 Dec 2024 23:57:41 GMT, Chad Rakoczy wrote: > [JDK-8344880](https://bugs.openjdk.org/browse/JDK-8344880) > > Adds compile time checks for str/ldr instructions to verify that the immediate offset will fit. This adds static_assert for constant offsets that are checked at compile time. The macro offset_of is not constexpr so instead the class size is checked. If the size of a class fits into a memory instructions then any offset in it will fit. This pull request has been closed without being integrated. ------------- PR: https://git.openjdk.org/jdk/pull/22623 From wkemper at openjdk.org Mon Feb 10 17:52:20 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 10 Feb 2025 17:52:20 GMT Subject: RFR: 8343468: GenShen: Enable relocation of remembered set card tables [v3] In-Reply-To: <6_AoWQhldJttOIEOL1T7HSapPzE4Qn2j4WN7E-bI3rM=.2685d3d8-e47c-42a6-845b-b68f50cc568e@github.com> References: <6_AoWQhldJttOIEOL1T7HSapPzE4Qn2j4WN7E-bI3rM=.2685d3d8-e47c-42a6-845b-b68f50cc568e@github.com> Message-ID: On Thu, 23 Jan 2025 05:45:43 GMT, Cesar Soares Lucas wrote: >> In the current Generational Shenandoah implementation, the pointers to the read and write card tables are established at JVM launch time and fixed during the whole of the application execution. Because they are considered constants, they are embedded as such in JIT-compiled code. >> >> The cleaning of dirty cards in the read card table is performed during the `init-mark` pause, and our experiments show that it represents a sizable portion of that phase's duration. This pull request makes the addresses of the read and write card tables dynamic, with the end goal of reducing the duration of the `init-mark` pause by moving the cleaning of the dirty cards in the read card table to the `reset` concurrent phase. >> >> The idea is quite simple. Instead of using distinct read and write card tables for the entire duration of the JVM execution, we alternate which card table serves as the read/write table during each GC cycle. In the `reset` phase we concurrently clean the cards in the the current _read_ table so that when the cycle reaches the next `init-mark` phase we have a version of the card table totally clear. In the next `init-mark` pause we swap the pointers to the base of the read and write tables. When the `init-mark` finishes the mutator threads will operate on the table just cleaned in the `reset` phase; the GC will operate on the table that just turned the new _read_ table. >> >> Most of the changes in the patch account for the fact that the write card table is no longer at a fixed address. >> >> The primary benefit of this change is that it eliminates the need to copy and zero the remembered set during the init-mark Safepoint. A secondary benefit is that it allows us to replace the init-mark Safepoint with an `init-mark` handshake?something we plan to work on after this PR is merged. >> >> Our internal performance testing showed a significant reduction in the duration of `init-mark` pauses and no statistically significant regression due to the dynamic loading of the card table address in JIT-compiled code. >> >> Functional testing was performed on Linux, macOS, Windows running on x64, AArch64, and their respective 32-bit versions. I?d appreciate it if someone with access to RISC-V (@luhenry ?) and PowerPC (@TheRealMDoerr ?) platforms could review and test the changes for those platforms, as I have limited access to running tests on them. > > Cesar Soares Lucas has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains three commits: > > - Merge master > - Addressing PR comments: some refactorings, ppc fix, off-by-one fix. > - Relocation of Card Tables Marked as reviewed by wkemper (Committer). ------------- PR Review: https://git.openjdk.org/jdk/pull/23170#pullrequestreview-2606716056 From phh at openjdk.org Mon Feb 10 18:44:13 2025 From: phh at openjdk.org (Paul Hohensee) Date: Mon, 10 Feb 2025 18:44:13 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: References:

Message-ID: On Fri, 7 Feb 2025 23:56:56 GMT, Kelvin Nilsen wrote: >> src/hotspot/share/gc/shenandoah/shenandoahMetrics.cpp line 52: >> >>> 50: size_t free_actual = free_set->available(); >>> 51: // The sum of free_set->capacity() and ->reserved represents capacity of young in generational, heap in non-generational. >>> 52: size_t free_expected = ((free_set->capacity() + free_set->reserved()) / 100) * ShenandoahCriticalFreeThreshold; >> >> As an outsider, the units involved and what exactly is being calculated is pretty opaque. Why would we divide by 100 to compute free_expected and not do the same for free_actual? Do we care about integer division truncation? The default value of ShenandoahCriticalFreeThreshold is 1, so multiplying by it is a nop by default, which seems strange. > > ShenandoahCriticalFreeThreshold represents a percentage of the "total size". To calculate N% of the young generation size, we divide the generation size by 100 and then multiply by ShenandoahCriticalFreeThreshold. This code is a bit different in the most recent revision. Do you think it needs a comment? Yes :) ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1949689186 From shade at openjdk.org Mon Feb 10 18:53:12 2025 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 10 Feb 2025 18:53:12 GMT Subject: RFR: 8343468: GenShen: Enable relocation of remembered set card tables [v3] In-Reply-To: <6_AoWQhldJttOIEOL1T7HSapPzE4Qn2j4WN7E-bI3rM=.2685d3d8-e47c-42a6-845b-b68f50cc568e@github.com> References: <6_AoWQhldJttOIEOL1T7HSapPzE4Qn2j4WN7E-bI3rM=.2685d3d8-e47c-42a6-845b-b68f50cc568e@github.com> Message-ID: <5GD87O6WaG7QG9PLlH7ssfGtp1szWUjmosVSl8-TAok=.d04789f7-7f87-4b44-bc76-80676f0f4fc8@github.com> On Thu, 23 Jan 2025 05:45:43 GMT, Cesar Soares Lucas wrote: >> In the current Generational Shenandoah implementation, the pointers to the read and write card tables are established at JVM launch time and fixed during the whole of the application execution. Because they are considered constants, they are embedded as such in JIT-compiled code. >> >> The cleaning of dirty cards in the read card table is performed during the `init-mark` pause, and our experiments show that it represents a sizable portion of that phase's duration. This pull request makes the addresses of the read and write card tables dynamic, with the end goal of reducing the duration of the `init-mark` pause by moving the cleaning of the dirty cards in the read card table to the `reset` concurrent phase. >> >> The idea is quite simple. Instead of using distinct read and write card tables for the entire duration of the JVM execution, we alternate which card table serves as the read/write table during each GC cycle. In the `reset` phase we concurrently clean the cards in the the current _read_ table so that when the cycle reaches the next `init-mark` phase we have a version of the card table totally clear. In the next `init-mark` pause we swap the pointers to the base of the read and write tables. When the `init-mark` finishes the mutator threads will operate on the table just cleaned in the `reset` phase; the GC will operate on the table that just turned the new _read_ table. >> >> Most of the changes in the patch account for the fact that the write card table is no longer at a fixed address. >> >> The primary benefit of this change is that it eliminates the need to copy and zero the remembered set during the init-mark Safepoint. A secondary benefit is that it allows us to replace the init-mark Safepoint with an `init-mark` handshake?something we plan to work on after this PR is merged. >> >> Our internal performance testing showed a significant reduction in the duration of `init-mark` pauses and no statistically significant regression due to the dynamic loading of the card table address in JIT-compiled code. >> >> Functional testing was performed on Linux, macOS, Windows running on x64, AArch64, and their respective 32-bit versions. I?d appreciate it if someone with access to RISC-V (@luhenry ?) and PowerPC (@TheRealMDoerr ?) platforms could review and test the changes for those platforms, as I have limited access to running tests on them. > > Cesar Soares Lucas has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains three commits: > > - Merge master > - Addressing PR comments: some refactorings, ppc fix, off-by-one fix. > - Relocation of Card Tables I'll take a look at this tomorrow, thanks. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23170#issuecomment-2648942403 From kdnilsen at openjdk.org Mon Feb 10 19:55:12 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 10 Feb 2025 19:55:12 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v2] In-Reply-To: References:

Message-ID: <68DeNcSBaX3EJo0OuQI7800ywqaQjhcCMpIjFqwdoao=.0da72a64-afa1-43bc-83bb-d4caf0d62514@github.com> On Tue, 4 Feb 2025 16:08:02 GMT, Paul Hohensee wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Respond to reviewer feedback > > src/hotspot/share/gc/shenandoah/heuristics/shenandoahHeuristics.hpp line 87: > >> 85: size_t _declined_trigger_count; // This counts how many times since previous GC finished that this >> 86: // heuristic has answered false to should_start_gc(). >> 87: size_t _previous_trigger_declinations; // This represents the value of _declined_trigger_count as captured at the > > Maybe the name should be _most_recent_declined_trigger_count, which relates it directly to _declined_trigger_count. Thanks for suggestion. I'm making this change. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23305#discussion_r1949788716 From kdnilsen at openjdk.org Mon Feb 10 20:02:14 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 10 Feb 2025 20:02:14 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v2] In-Reply-To: References:

Message-ID: On Tue, 4 Feb 2025 16:04:34 GMT, Paul Hohensee wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Respond to reviewer feedback > > src/hotspot/share/gc/shenandoah/heuristics/shenandoahHeuristics.cpp line 261: > >> 259: >> 260: void ShenandoahHeuristics::record_success_concurrent() { >> 261: _start_gc_is_pending = false; > > The name _start_gc_is_pending implies that it should be set false as soon as a gc cycle starts, not when it finishes. Maybe _gc_pending? Or maybe setting it false at the end of a gc cycle is a bug? :) You make a good point. I'll change the control flow to cancel the trigger as soon as we start up the GC. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23305#discussion_r1949798178 From kdnilsen at openjdk.org Mon Feb 10 20:28:54 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 10 Feb 2025 20:28:54 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v3] In-Reply-To: References: Message-ID: <2v0axonBAvZDKo779TX8POWEXGeMCA5xaKV3KQBQo14=.fbd1e6bc-0e12-4a0c-a9f7-ba1d3c5f728d@github.com> > Shenandoah heuristics use a penalty mechanism to cause earlier GC triggers when recent concurrent GC cycles degenerate. Degeneration is a stop-the-world remediation that allows GC to catch up when mutator allocations fail during concurrent GC. The fact that we needed to degenerate indicates that we were overly optimistic in delaying the trigger that starts concurrent GC. > > We have observed that it is common for degenerated GC cycles to cascade upon each other. The condition that caused an initial degenerated cycle is often not fully resolved by the end of that degenerated cycle. For example, the application may be experiencing a phase change and the GC heuristics are not yet attuned to the new behavior. Furthermore, a degenerated GC may exacerbate the problem condition. During the stop-the-world pause imposed by the first degenerated GC, work continues to accumulate in the form of new client requests that are buffered in network sockets until the end of that degenerated GC. > > As originally implemented, each degeneration would "pile on" additional penalties. These penalties cause the GC frequency to continue to increase. And the expanding CPU load of GC makes it increasingly difficult for mutator threads to catchup. The large penalties accumulated while we are trying to resolve the problem linger long after the problem condition has been resolved. > > This change does not add further to the degeneration penalties if a new degenerated cycle occurs through no fault of the triggering mechanism. We only add the degeneration penalty if the reason we are now degenerating can be attributed to a consciously late trigger by the heuristic. Kelvin Nilsen has updated the pull request incrementally with two additional commits since the last revision: - Respond to reviewer feedback - Use generation size to determine expected free ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23305/files - new: https://git.openjdk.org/jdk/pull/23305/files/ee3cdacc..8a9e4c5e Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23305&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23305&range=01-02 Stats: 27 lines in 8 files changed: 13 ins; 3 del; 11 mod Patch: https://git.openjdk.org/jdk/pull/23305.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23305/head:pull/23305 PR: https://git.openjdk.org/jdk/pull/23305 From kdnilsen at openjdk.org Mon Feb 10 20:28:54 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 10 Feb 2025 20:28:54 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v2] In-Reply-To: References:

Message-ID: On Tue, 4 Feb 2025 16:14:49 GMT, Paul Hohensee wrote: >> Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: >> >> Respond to reviewer feedback > > src/hotspot/share/gc/shenandoah/heuristics/shenandoahAdaptiveHeuristics.cpp line 318: > >> 316: >> 317: if (ShenandoahHeuristics::should_start_gc()) { >> 318: _start_gc_is_pending = true; > > I assume there's no race here, i.e., only one thread reads/writes _start_gc_is_pending. If there's a race, make sure it's benign. In either case, _start_gc_is_pending is made "sticky" by this code. There is no race. A single control thread queries should_start-gc() and that is the same thread that initiates the GC. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23305#discussion_r1949828557 From kdnilsen at openjdk.org Mon Feb 10 20:41:10 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 10 Feb 2025 20:41:10 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: References:

Message-ID: On Sat, 8 Feb 2025 02:06:13 GMT, Xiaolong Peng wrote: >> Thanks for this suggestion. I've made change. It turns out there was actually a bug in the original implementation, so I am retesting the performance results. > > Thanks, honest I didn't understand that why `(free_set->capacity() + free_set->reserved()` represents capacity of young in generational, is it the bug you found? `free_set->capacity()` is the capacity of all mutator regions which also excludes the regions doesn't have capacity for new object alloc(it is calculated when rebuild free set) > > I thought a bit more, it makes more sense to calculate free_expected in `snap_before`, max_capacity of generations may change after collection, the free_expected should be calculated before the cycle. Interesting thoughts. So young-generation size will change under these circumstances: 1. There's a lot of young-gen memory to be promoted, or we choose to promote some young-gen regions in place (by relabeling the regions as OLD without evacuating their data). In both of these cases, we may shrink young in order to expand old. 2. The GC cycle is mixed, so it has the side effect of reclaiming some old regions. These reclaimed old regions will typically be granted back to young, until such time as we need to expand old in order to hold results of promotion. While it makes sense for expected to be computed based on "original size" of young generation, the question of how much free remaining in young represents "good progress" should probably be based on the current size of young. Ultimately, we are trying to figure out if there's enough memory in young to make it worthwhile to attempt another concurrent GC cycle. I realize this thinking is a bit "fuzzy". The heuristic was originally designed for non-generational use. I'm inclined to keep as is currently implemented, but should probably add a comment to explain why. What do you think? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1949847579 From wkemper at openjdk.org Mon Feb 10 21:26:59 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 10 Feb 2025 21:26:59 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v6] In-Reply-To: References: Message-ID: > There are several changes to the operation of Shenandoah's control threads here. > * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. > * The cancellation handling is driven entirely by the cancellation cause > * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed > * The shutdown sequence is simpler > * The generational control thread uses a lock to coordinate updates to the requested cause and generation > * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance > * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles > * The control thread doesn't loop on its own (unless the pacer is enabled). William Kemper has updated the pull request incrementally with two additional commits since the last revision: - Add event for control thread state changes - Fix shutdown livelock error ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23475/files - new: https://git.openjdk.org/jdk/pull/23475/files/d16f6fd0..f11584d5 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=05 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=04-05 Stats: 13 lines in 1 file changed: 6 ins; 3 del; 4 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From wkemper at openjdk.org Mon Feb 10 21:54:51 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 10 Feb 2025 21:54:51 GMT Subject: RFR: 8348268: Test gc/shenandoah/TestResizeTLAB.java#compact: fatal error: Before Updating References: Thread C2 CompilerThread1: expected gc-state 9, actual 21 [v2] In-Reply-To: References: Message-ID: <73lkaeIWH7aBWahNyU_czTYSnSmCMOURWYDv55-zc4Y=.39398a24-6904-465c-8e47-7bfe32efc9db@github.com> > Non-java threads were not having their gc-state configured when they attach. If they were created before the verifier's safepoint, but after the iteration over non-java threads, they would not have the correct state. William Kemper has updated the pull request incrementally with one additional commit since the last revision: Hold the thread lock when concurrently changing gc state ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23428/files - new: https://git.openjdk.org/jdk/pull/23428/files/f402628e..1a4e3bb1 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23428&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23428&range=00-01 Stats: 13 lines in 1 file changed: 8 ins; 0 del; 5 mod Patch: https://git.openjdk.org/jdk/pull/23428.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23428/head:pull/23428 PR: https://git.openjdk.org/jdk/pull/23428 From xpeng at openjdk.org Mon Feb 10 22:27:12 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Mon, 10 Feb 2025 22:27:12 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: References:

Message-ID: <7B6iyKusHAIUGeaVJVEiCWQTIe64ZPJKImH-YYUB3K0=.8d7612e4-9e08-4b90-9c60-0f68d3e7c4ad@github.com> On Mon, 10 Feb 2025 20:38:35 GMT, Kelvin Nilsen wrote: >> Thanks, honest I didn't understand that why `(free_set->capacity() + free_set->reserved()` represents capacity of young in generational, is it the bug you found? `free_set->capacity()` is the capacity of all mutator regions which also excludes the regions doesn't have capacity for new object alloc(it is calculated when rebuild free set) >> >> I thought a bit more, it makes more sense to calculate free_expected in `snap_before`, max_capacity of generations may change after collection, the free_expected should be calculated before the cycle. > > Interesting thoughts. So young-generation size will change under these circumstances: > > 1. There's a lot of young-gen memory to be promoted, or we choose to promote some young-gen regions in place (by relabeling the regions as OLD without evacuating their data). In both of these cases, we may shrink young in order to expand old. > 2. The GC cycle is mixed, so it has the side effect of reclaiming some old regions. These reclaimed old regions will typically be granted back to young, until such time as we need to expand old in order to hold results of promotion. > > While it makes sense for expected to be computed based on "original size" of young generation, the question of how much free remaining in young represents "good progress" should probably be based on the current size of young. Ultimately, we are trying to figure out if there's enough memory in young to make it worthwhile to attempt another concurrent GC cycle. > > I realize this thinking is a bit "fuzzy". The heuristic was originally designed for non-generational use. > > I'm inclined to keep as is currently implemented, but should probably add a comment to explain why. What do you think? Thanks for the explanation, I agree with it is bit "fuzzy". I'm not sure we should consider following case: Degen cycle doesn't reclaim any memory, but promoted some young regions resulting in young capacity to shrink, in this case we may treat it as "good progress" but actually it is not. A "good progress" could be `free_actual_after > free_actual_before && free_actual_after > free_expected`, what do you think? I am not sure all cases triggering degen cycle, this might be a false case that never happens. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1949985042 From kdnilsen at openjdk.org Mon Feb 10 23:19:27 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 10 Feb 2025 23:19:27 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v4] In-Reply-To: References: Message-ID: > Shenandoah heuristics use a penalty mechanism to cause earlier GC triggers when recent concurrent GC cycles degenerate. Degeneration is a stop-the-world remediation that allows GC to catch up when mutator allocations fail during concurrent GC. The fact that we needed to degenerate indicates that we were overly optimistic in delaying the trigger that starts concurrent GC. > > We have observed that it is common for degenerated GC cycles to cascade upon each other. The condition that caused an initial degenerated cycle is often not fully resolved by the end of that degenerated cycle. For example, the application may be experiencing a phase change and the GC heuristics are not yet attuned to the new behavior. Furthermore, a degenerated GC may exacerbate the problem condition. During the stop-the-world pause imposed by the first degenerated GC, work continues to accumulate in the form of new client requests that are buffered in network sockets until the end of that degenerated GC. > > As originally implemented, each degeneration would "pile on" additional penalties. These penalties cause the GC frequency to continue to increase. And the expanding CPU load of GC makes it increasingly difficult for mutator threads to catchup. The large penalties accumulated while we are trying to resolve the problem linger long after the problem condition has been resolved. > > This change does not add further to the degeneration penalties if a new degenerated cycle occurs through no fault of the triggering mechanism. We only add the degeneration penalty if the reason we are now degenerating can be attributed to a consciously late trigger by the heuristic. Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: Revert "Use generation size to determine expected free" This reverts commit 94a32ebfe5fefcc0e899e09e6fbfc0585c62b4e0. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23305/files - new: https://git.openjdk.org/jdk/pull/23305/files/8a9e4c5e..ee7fe689 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23305&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23305&range=02-03 Stats: 5 lines in 4 files changed: 0 ins; 0 del; 5 mod Patch: https://git.openjdk.org/jdk/pull/23305.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23305/head:pull/23305 PR: https://git.openjdk.org/jdk/pull/23305 From kdnilsen at openjdk.org Mon Feb 10 23:32:09 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 10 Feb 2025 23:32:09 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: <7B6iyKusHAIUGeaVJVEiCWQTIe64ZPJKImH-YYUB3K0=.8d7612e4-9e08-4b90-9c60-0f68d3e7c4ad@github.com> References:

<7B6iyKusHAIUGeaVJVEiCWQTIe64ZPJKImH-YYUB3K0=.8d7612e4-9e08-4b90-9c60-0f68d3e7c4ad@github.com> Message-ID: <16WXn9LEVXGdRSeJ98OxomG66UfnruLxo9nnfY52ZJo=.f4acdbb1-c99b-4be8-807b-bdbf9504af81@github.com> On Mon, 10 Feb 2025 22:24:35 GMT, Xiaolong Peng wrote: >> Interesting thoughts. So young-generation size will change under these circumstances: >> >> 1. There's a lot of young-gen memory to be promoted, or we choose to promote some young-gen regions in place (by relabeling the regions as OLD without evacuating their data). In both of these cases, we may shrink young in order to expand old. >> 2. The GC cycle is mixed, so it has the side effect of reclaiming some old regions. These reclaimed old regions will typically be granted back to young, until such time as we need to expand old in order to hold results of promotion. >> >> While it makes sense for expected to be computed based on "original size" of young generation, the question of how much free remaining in young represents "good progress" should probably be based on the current size of young. Ultimately, we are trying to figure out if there's enough memory in young to make it worthwhile to attempt another concurrent GC cycle. >> >> I realize this thinking is a bit "fuzzy". The heuristic was originally designed for non-generational use. >> >> I'm inclined to keep as is currently implemented, but should probably add a comment to explain why. What do you think? > > Thanks for the explanation, I agree with it is bit "fuzzy". > I'm not sure we should consider following case: > > Degen cycle doesn't reclaim any memory, but promoted some young regions resulting in young capacity to shrink, in this case we may treat it as "good progress" but actually it is not. > > A "good progress" could be `free_actual_after > free_actual_before && free_actual_after > free_expected`, what do you think? I am not sure all cases triggering degen cycle, this might be a false case that never happens. If we manage to pass the test "free_actual_after > free_expected" following the degen, even if young has shrunk, I think it is reasonable to pursue concurrent GC. Passing this exact test at the end of the next GC (assuming no further adjustments to generation sizes) would qualify us to continue with concurrent GC on the next cycle. In general, it is very rare that "full gc" is the right thing to do. we're in the process of deprecating it entirely. I will add a comment to clarify the thinking here. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1950040394 From kdnilsen at openjdk.org Mon Feb 10 23:43:11 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 10 Feb 2025 23:43:11 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v2] In-Reply-To: References:

Message-ID: On Fri, 7 Feb 2025 23:59:52 GMT, Kelvin Nilsen wrote: >> At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. >> >> For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. >> >> This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. > > Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: > > Respond to reviewer feedback > > In testing suggested refinements, I discovered a bug in original > implementation. ShenandoahFreeSet::capacity() does not represent the > size of young generation. It represents the total size of the young > regions that had available memory at the time we most recently rebuilt > the ShenandoahFreeSet. > > I am rerunning the performance tests following this suggested change. Thank for the comprehensive tests and explanations, my approve doesn't count though:) ------------- Marked as reviewed by xpeng (Author). PR Review: https://git.openjdk.org/jdk/pull/23306#pullrequestreview-2607419434 From wkemper at openjdk.org Tue Feb 11 00:54:36 2025 From: wkemper at openjdk.org (William Kemper) Date: Tue, 11 Feb 2025 00:54:36 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v7] In-Reply-To: References: Message-ID: <7vsmPKQNSOx9PxGp2C1yjC5IeEtB2ZWPRybQQ-s4YNE=.1b8ffa7e-cc6d-4885-a9c4-16a503d9d8d9@github.com> > There are several changes to the operation of Shenandoah's control threads here. > * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. > * The cancellation handling is driven entirely by the cancellation cause > * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed > * The shutdown sequence is simpler > * The generational control thread uses a lock to coordinate updates to the requested cause and generation > * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance > * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles > * The control thread doesn't loop on its own (unless the pacer is enabled). William Kemper has updated the pull request incrementally with two additional commits since the last revision: - Do not accept requests if control thread is terminating - Notify waiters when control thread terminates ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23475/files - new: https://git.openjdk.org/jdk/pull/23475/files/f11584d5..861ed699 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=06 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=05-06 Stats: 26 lines in 3 files changed: 24 ins; 2 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From kdnilsen at openjdk.org Tue Feb 11 03:39:41 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 11 Feb 2025 03:39:41 GMT Subject: RFR: 8349766: GenShen: Bad progress after degen does not always need full gc Message-ID: In generational mode, only upgrade to full GC from degenerated GC if we've done two degenerated cycles in a row and both indicated bad progress. Otherwise, start another concurrent GC, which will most likely degenerate also. But this degenerated cycle will reclaim floating garbage within the young generation much more quickly than a full GC would have done. ------------- Commit messages: - Be less eager to upgrade degen to full gc Changes: https://git.openjdk.org/jdk/pull/23552/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23552&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8349766 Stats: 20 lines in 2 files changed: 17 ins; 0 del; 3 mod Patch: https://git.openjdk.org/jdk/pull/23552.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23552/head:pull/23552 PR: https://git.openjdk.org/jdk/pull/23552 From kdnilsen at openjdk.org Tue Feb 11 03:39:41 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 11 Feb 2025 03:39:41 GMT Subject: RFR: 8349766: GenShen: Bad progress after degen does not always need full gc In-Reply-To: References: Message-ID: On Tue, 11 Feb 2025 03:31:51 GMT, Kelvin Nilsen wrote: > In generational mode, only upgrade to full GC from degenerated GC if we've done two degenerated cycles in a row and both indicated bad progress. Otherwise, start another concurrent GC, which will most likely degenerate also. But this degenerated cycle will reclaim floating garbage within the young generation much more quickly than a full GC would have done. Some detailed results running the workload mentioned in JBS ticket on tip: ![Screenshot 2025-02-10 at 7 10 18?PM](https://github.com/user-attachments/assets/c06606a6-ec21-4e40-b117-915ddfc0d1f6) These are results running the same workload with the changes of this PR: ![Screenshot 2025-02-10 at 7 35 47?PM](https://github.com/user-attachments/assets/432c227e-9bf4-4f21-8099-1b39b5af364a) ------------- PR Comment: https://git.openjdk.org/jdk/pull/23552#issuecomment-2649732684 PR Comment: https://git.openjdk.org/jdk/pull/23552#issuecomment-2649733471 From kdnilsen at openjdk.org Tue Feb 11 03:53:10 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 11 Feb 2025 03:53:10 GMT Subject: RFR: 8349766: GenShen: Bad progress after degen does not always need full gc In-Reply-To: References: Message-ID: On Tue, 11 Feb 2025 03:31:51 GMT, Kelvin Nilsen wrote: > In generational mode, only upgrade to full GC from degenerated GC if we've done two degenerated cycles in a row and both indicated bad progress. Otherwise, start another concurrent GC, which will most likely degenerate also. But this degenerated cycle will reclaim floating garbage within the young generation much more quickly than a full GC would have done. Green represents improvement compared to tip. For runs 3-5, the new code is notably better. Run 2 is significantly worse in p50, but about equal to tip average at p99.999 and above. Run 1 is close to averages of tip at p50, but up to 56% above max at higher percentiles. Most noteworthy is that we were able to significantly reduce the number of Full GCs without causing a crash or OOM. On this workload, full GCs are known to require approximately 3 s of pause time. The average degen cycle required 1.4s (102 out of cycle, 142 at roots, 149 at mark, 1 at evac, 28 at update refs). Note that an upgraded Full GC results in a pause that is the sum of a Full GC plus a degenerated GC. While there is reason to be concerned about trial two results on the PR code, I expect that unlucky scenario, whatever it was, will be much less likely in the context of in-flight PRs to advance triggering of GC when allocation rates are accelerating and to surge GC workers whenever there is increased risk of degenerated cycles. Perhaps, we should wait until those other PRs are integrated and then retest. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23552#issuecomment-2649744182 From kdnilsen at openjdk.org Tue Feb 11 04:08:48 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 11 Feb 2025 04:08:48 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v3] In-Reply-To: References: Message-ID: <0bbrstoX8nDMn2Ku_WwSYn_NYSSLi3yXkWdg28imCHo=.ab1661a4-1ea5-4c57-9fde-0ee63ebac027@github.com> > At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. > > For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. > > This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: Add comments suggested by reviewers ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23306/files - new: https://git.openjdk.org/jdk/pull/23306/files/7969515d..8f644cdb Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23306&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23306&range=01-02 Stats: 15 lines in 1 file changed: 14 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/23306.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23306/head:pull/23306 PR: https://git.openjdk.org/jdk/pull/23306 From xpeng at openjdk.org Tue Feb 11 05:54:11 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Tue, 11 Feb 2025 05:54:11 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v3] In-Reply-To: <0bbrstoX8nDMn2Ku_WwSYn_NYSSLi3yXkWdg28imCHo=.ab1661a4-1ea5-4c57-9fde-0ee63ebac027@github.com> References: <0bbrstoX8nDMn2Ku_WwSYn_NYSSLi3yXkWdg28imCHo=.ab1661a4-1ea5-4c57-9fde-0ee63ebac027@github.com> Message-ID: On Tue, 11 Feb 2025 04:08:48 GMT, Kelvin Nilsen wrote: >> At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. >> >> For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. >> >> This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. > > Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: > > Add comments suggested by reviewers Marked as reviewed by xpeng (Author). ------------- PR Review: https://git.openjdk.org/jdk/pull/23306#pullrequestreview-2607738788 From shade at openjdk.org Tue Feb 11 08:50:11 2025 From: shade at openjdk.org (Aleksey Shipilev) Date: Tue, 11 Feb 2025 08:50:11 GMT Subject: RFR: 8348268: Test gc/shenandoah/TestResizeTLAB.java#compact: fatal error: Before Updating References: Thread C2 CompilerThread1: expected gc-state 9, actual 21 [v2] In-Reply-To: <73lkaeIWH7aBWahNyU_czTYSnSmCMOURWYDv55-zc4Y=.39398a24-6904-465c-8e47-7bfe32efc9db@github.com> References: <73lkaeIWH7aBWahNyU_czTYSnSmCMOURWYDv55-zc4Y=.39398a24-6904-465c-8e47-7bfe32efc9db@github.com> Message-ID: On Mon, 10 Feb 2025 21:54:51 GMT, William Kemper wrote: >> Non-java threads were not having their gc-state configured when they attach. If they were created before the verifier's safepoint, but after the iteration over non-java threads, they would not have the correct state. > > William Kemper has updated the pull request incrementally with one additional commit since the last revision: > > Hold the thread lock when concurrently changing gc state Great find. So that means we cannot safely do `ShenandoahHeap::set_gc_state_concurrent`, unless we hold `Threads_lock` and do a handshake afterwards? I think a part of comment that you have near `MutexLocker` can go to `ShenandoahHeap::set_gc_state_concurrent` with the `assert(Threads_lock->is_locked(), ...`. ------------- PR Review: https://git.openjdk.org/jdk/pull/23428#pullrequestreview-2608045755 From phh at openjdk.org Tue Feb 11 14:20:12 2025 From: phh at openjdk.org (Paul Hohensee) Date: Tue, 11 Feb 2025 14:20:12 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v3] In-Reply-To: <0bbrstoX8nDMn2Ku_WwSYn_NYSSLi3yXkWdg28imCHo=.ab1661a4-1ea5-4c57-9fde-0ee63ebac027@github.com> References: <0bbrstoX8nDMn2Ku_WwSYn_NYSSLi3yXkWdg28imCHo=.ab1661a4-1ea5-4c57-9fde-0ee63ebac027@github.com> Message-ID: <2XLAHIk0VEr8Xae-jNqjMZjBtPTrHqm8nl7tn_rigS8=.155e8a5a-193a-49b8-a773-b8e60b4dc3f5@github.com> On Tue, 11 Feb 2025 04:08:48 GMT, Kelvin Nilsen wrote: >> At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. >> >> For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. >> >> This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. > > Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: > > Add comments suggested by reviewers Looks good. ------------- Marked as reviewed by phh (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/23306#pullrequestreview-2608901033 From kdnilsen at openjdk.org Tue Feb 11 14:20:13 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 11 Feb 2025 14:20:13 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v3] In-Reply-To: References:

Message-ID: On Mon, 10 Feb 2025 18:41:27 GMT, Paul Hohensee wrote: >> ShenandoahCriticalFreeThreshold represents a percentage of the "total size". To calculate N% of the young generation size, we divide the generation size by 100 and then multiply by ShenandoahCriticalFreeThreshold. This code is a bit different in the most recent revision. Do you think it needs a comment? > > Yes :) I've added a comment here. Thanks. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23306#discussion_r1950933308 From ayang at openjdk.org Tue Feb 11 15:28:25 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Tue, 11 Feb 2025 15:28:25 GMT Subject: RFR: 8348171: Refactor GenerationCounters and its subclasses [v5] In-Reply-To: References: Message-ID: <7otkT63ENoyKzZ29CbYpycLLwL89ARajYg36Mstz4tQ=.fd3c7dcf-5a8b-44be-9205-09e3d160d54d@github.com> > Simple refactoring of removing the use of `virtual` method and use concrete subclasses when needed. > > Test: tier1-5 Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains seven commits: - Merge branch 'master' into gen-counter - review - * some more refactoring - review - Merge branch 'master' into gen-counter - merge - gen-counter ------------- Changes: https://git.openjdk.org/jdk/pull/23209/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23209&range=04 Stats: 202 lines in 17 files changed: 6 ins; 160 del; 36 mod Patch: https://git.openjdk.org/jdk/pull/23209.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23209/head:pull/23209 PR: https://git.openjdk.org/jdk/pull/23209 From kdnilsen at openjdk.org Tue Feb 11 18:15:58 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 11 Feb 2025 18:15:58 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v4] In-Reply-To: References: Message-ID: > At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. > > For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. > > This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. Kelvin Nilsen has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains five additional commits since the last revision: - Merge tag 'jdk-25+9' into fix-generational-no-progress-check Added tag jdk-25+9 for changeset 30f71622 - Add comments suggested by reviewers - Respond to reviewer feedback In testing suggested refinements, I discovered a bug in original implementation. ShenandoahFreeSet::capacity() does not represent the size of young generation. It represents the total size of the young regions that had available memory at the time we most recently rebuilt the ShenandoahFreeSet. I am rerunning the performance tests following this suggested change. - Use freeset to determine goodness of progress As previously implemented, we used the heap size to measure goodness of progress. However, heap size is only appropriate for non-generational Shenandoah. Freeset abstraction works for both. - Use size-of young generation to assess progress Previously, we were using size of heap to asses progress of generational degenerated cycle. But that is not appropriate, because the collection set is chosen based on the size of young generation. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23306/files - new: https://git.openjdk.org/jdk/pull/23306/files/8f644cdb..8c610136 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23306&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23306&range=02-03 Stats: 43531 lines in 2988 files changed: 18658 ins; 14204 del; 10669 mod Patch: https://git.openjdk.org/jdk/pull/23306.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23306/head:pull/23306 PR: https://git.openjdk.org/jdk/pull/23306 From kdnilsen at openjdk.org Tue Feb 11 18:21:09 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 11 Feb 2025 18:21:09 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v5] In-Reply-To: References: Message-ID: > Shenandoah heuristics use a penalty mechanism to cause earlier GC triggers when recent concurrent GC cycles degenerate. Degeneration is a stop-the-world remediation that allows GC to catch up when mutator allocations fail during concurrent GC. The fact that we needed to degenerate indicates that we were overly optimistic in delaying the trigger that starts concurrent GC. > > We have observed that it is common for degenerated GC cycles to cascade upon each other. The condition that caused an initial degenerated cycle is often not fully resolved by the end of that degenerated cycle. For example, the application may be experiencing a phase change and the GC heuristics are not yet attuned to the new behavior. Furthermore, a degenerated GC may exacerbate the problem condition. During the stop-the-world pause imposed by the first degenerated GC, work continues to accumulate in the form of new client requests that are buffered in network sockets until the end of that degenerated GC. > > As originally implemented, each degeneration would "pile on" additional penalties. These penalties cause the GC frequency to continue to increase. And the expanding CPU load of GC makes it increasingly difficult for mutator threads to catchup. The large penalties accumulated while we are trying to resolve the problem linger long after the problem condition has been resolved. > > This change does not add further to the degeneration penalties if a new degenerated cycle occurs through no fault of the triggering mechanism. We only add the degeneration penalty if the reason we are now degenerating can be attributed to a consciously late trigger by the heuristic. Kelvin Nilsen has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains nine additional commits since the last revision: - Merge tag 'jdk-25+9' into eliminate-no-fault-degen-penalties Added tag jdk-25+9 for changeset 30f71622 - Revert "Use generation size to determine expected free" This reverts commit 94a32ebfe5fefcc0e899e09e6fbfc0585c62b4e0. - Respond to reviewer feedback - Use generation size to determine expected free - Respond to reviewer feedback - Fix white space - Remove debug instrumentation - Only penalize heuristic if heuristic responsible If we degenerate through no fault of "late triggering", then do not penalize the heuristic. - Eliminate no-fault degen penalties As originally implemented, we apply penalties to the triggering heuristic every time we experience a degenerated cycle. This has the effect of forcing GC triggers to spiral out of control. This commit changes the penalty mechanism. When a degen happens through no fault of the heuristic triggering mechanism, we do not pile on additional penalties. Specifically, we consider that heuristic triggering is not responsible for a degenerated cycle that is associated with a GC that began immediately following the end of the previous GC cycle. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23305/files - new: https://git.openjdk.org/jdk/pull/23305/files/ee7fe689..3aabd4db Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23305&range=04 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23305&range=03-04 Stats: 43531 lines in 2988 files changed: 18658 ins; 14204 del; 10669 mod Patch: https://git.openjdk.org/jdk/pull/23305.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23305/head:pull/23305 PR: https://git.openjdk.org/jdk/pull/23305 From phh at openjdk.org Tue Feb 11 18:51:16 2025 From: phh at openjdk.org (Paul Hohensee) Date: Tue, 11 Feb 2025 18:51:16 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v5] In-Reply-To: References:

Message-ID: <8Gt2wkVtRhYtPwLWfkuH8fWrboud7gjBRpCfzT2GeLw=.9e580aa0-34b7-4b7e-9ab7-f49cec2d3a6a@github.com> On Tue, 11 Feb 2025 18:21:09 GMT, Kelvin Nilsen wrote: >> Shenandoah heuristics use a penalty mechanism to cause earlier GC triggers when recent concurrent GC cycles degenerate. Degeneration is a stop-the-world remediation that allows GC to catch up when mutator allocations fail during concurrent GC. The fact that we needed to degenerate indicates that we were overly optimistic in delaying the trigger that starts concurrent GC. >> >> We have observed that it is common for degenerated GC cycles to cascade upon each other. The condition that caused an initial degenerated cycle is often not fully resolved by the end of that degenerated cycle. For example, the application may be experiencing a phase change and the GC heuristics are not yet attuned to the new behavior. Furthermore, a degenerated GC may exacerbate the problem condition. During the stop-the-world pause imposed by the first degenerated GC, work continues to accumulate in the form of new client requests that are buffered in network sockets until the end of that degenerated GC. >> >> As originally implemented, each degeneration would "pile on" additional penalties. These penalties cause the GC frequency to continue to increase. And the expanding CPU load of GC makes it increasingly difficult for mutator threads to catchup. The large penalties accumulated while we are trying to resolve the problem linger long after the problem condition has been resolved. >> >> This change does not add further to the degeneration penalties if a new degenerated cycle occurs through no fault of the triggering mechanism. We only add the degeneration penalty if the reason we are now degenerating can be attributed to a consciously late trigger by the heuristic. > > Kelvin Nilsen has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains nine additional commits since the last revision: > > - Merge tag 'jdk-25+9' into eliminate-no-fault-degen-penalties > > Added tag jdk-25+9 for changeset 30f71622 > - Revert "Use generation size to determine expected free" > > This reverts commit 94a32ebfe5fefcc0e899e09e6fbfc0585c62b4e0. > - Respond to reviewer feedback > - Use generation size to determine expected free > - Respond to reviewer feedback > - Fix white space > - Remove debug instrumentation > - Only penalize heuristic if heuristic responsible > > If we degenerate through no fault of "late triggering", then do not > penalize the heuristic. > - Eliminate no-fault degen penalties > > As originally implemented, we apply penalties to the triggering > heuristic every time we experience a degenerated cycle. This has the > effect of forcing GC triggers to spiral out of control. This commit > changes the penalty mechanism. When a degen happens through no fault of > the heuristic triggering mechanism, we do not pile on additional > penalties. Specifically, we consider that heuristic triggering is not > responsible for a degenerated cycle that is associated with a GC that > began immediately following the end of the previous GC cycle. Marked as reviewed by phh (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/23305#pullrequestreview-2609684439 From wkemper at openjdk.org Tue Feb 11 19:31:09 2025 From: wkemper at openjdk.org (William Kemper) Date: Tue, 11 Feb 2025 19:31:09 GMT Subject: RFR: 8348268: Test gc/shenandoah/TestResizeTLAB.java#compact: fatal error: Before Updating References: Thread C2 CompilerThread1: expected gc-state 9, actual 21 [v2] In-Reply-To: <73lkaeIWH7aBWahNyU_czTYSnSmCMOURWYDv55-zc4Y=.39398a24-6904-465c-8e47-7bfe32efc9db@github.com> References: <73lkaeIWH7aBWahNyU_czTYSnSmCMOURWYDv55-zc4Y=.39398a24-6904-465c-8e47-7bfe32efc9db@github.com> Message-ID: <91rj68CdahMzjrRCIMEH0mR6CxmDQayALIYHXBykJ5c=.a4164dc3-79db-43e8-9a8a-c6216e826f5b@github.com> On Mon, 10 Feb 2025 21:54:51 GMT, William Kemper wrote: >> Non-java threads were not having their gc-state configured when they attach. If they were created before the verifier's safepoint, but after the iteration over non-java threads, they would not have the correct state. > > William Kemper has updated the pull request incrementally with one additional commit since the last revision: > > Hold the thread lock when concurrently changing gc state That's right. The `on_thread_attach` callback and the thread being added to the list of threads _does_ happen under the `Thread_lock`, by the handshake mechanism (and the java thread iterator) do _not_ take the thread lock. In this particular assertion violation, the thread received a stale `gc_state` when it attached (before the control thread even entered `concurrent_prepare_for_update_refs`), however, the control thread executed the handshake _before_ the recently attached thread was actually added to the java thread list. I will update the comment and add an assert in `set_gc_state_concurrent`. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23428#issuecomment-2651866466 From wkemper at openjdk.org Tue Feb 11 19:39:25 2025 From: wkemper at openjdk.org (William Kemper) Date: Tue, 11 Feb 2025 19:39:25 GMT Subject: RFR: 8348268: Test gc/shenandoah/TestResizeTLAB.java#compact: fatal error: Before Updating References: Thread C2 CompilerThread1: expected gc-state 9, actual 21 [v3] In-Reply-To: References: Message-ID: <-v8uprH0cQK06apB7HGbrHNO31cCmzOXxMiZB8ipWx4=.7ce5e5da-bcc8-4101-9bf8-23fb899d06c2@github.com> > Non-java threads were not having their gc-state configured when they attach. If they were created before the verifier's safepoint, but after the iteration over non-java threads, they would not have the correct state. William Kemper has updated the pull request incrementally with one additional commit since the last revision: Update comments, add an assertion ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23428/files - new: https://git.openjdk.org/jdk/pull/23428/files/1a4e3bb1..c57bf8a0 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23428&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23428&range=01-02 Stats: 11 lines in 1 file changed: 6 ins; 1 del; 4 mod Patch: https://git.openjdk.org/jdk/pull/23428.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23428/head:pull/23428 PR: https://git.openjdk.org/jdk/pull/23428 From shade at openjdk.org Tue Feb 11 20:11:10 2025 From: shade at openjdk.org (Aleksey Shipilev) Date: Tue, 11 Feb 2025 20:11:10 GMT Subject: RFR: 8348268: Test gc/shenandoah/TestResizeTLAB.java#compact: fatal error: Before Updating References: Thread C2 CompilerThread1: expected gc-state 9, actual 21 [v3] In-Reply-To: <-v8uprH0cQK06apB7HGbrHNO31cCmzOXxMiZB8ipWx4=.7ce5e5da-bcc8-4101-9bf8-23fb899d06c2@github.com> References: <-v8uprH0cQK06apB7HGbrHNO31cCmzOXxMiZB8ipWx4=.7ce5e5da-bcc8-4101-9bf8-23fb899d06c2@github.com> Message-ID: On Tue, 11 Feb 2025 19:39:25 GMT, William Kemper wrote: >> Non-java threads were not having their gc-state configured when they attach. If they were created before the verifier's safepoint, but after the iteration over non-java threads, they would not have the correct state. > > William Kemper has updated the pull request incrementally with one additional commit since the last revision: > > Update comments, add an assertion Marked as reviewed by shade (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/23428#pullrequestreview-2609916144 From wkemper at openjdk.org Tue Feb 11 20:23:14 2025 From: wkemper at openjdk.org (William Kemper) Date: Tue, 11 Feb 2025 20:23:14 GMT Subject: Integrated: 8348268: Test gc/shenandoah/TestResizeTLAB.java#compact: fatal error: Before Updating References: Thread C2 CompilerThread1: expected gc-state 9, actual 21 In-Reply-To: References: Message-ID: <9AZBJik8xf6tZdYSesYFvrlDs6Z8tDbEZkXvQz7Cm6s=.cb15a767-4988-40df-b87e-2e1868a15752@github.com> On Mon, 3 Feb 2025 20:28:58 GMT, William Kemper wrote: > Non-java threads were not having their gc-state configured when they attach. Additionally, we need to hold the `Threads_lock` when concurrently changing the gc state to make sure that any stale gc state observed when the thread `attaches` is fixed by the handshake when the thread list is iterated. This pull request has now been integrated. Changeset: 8c09d40d Author: William Kemper URL: https://git.openjdk.org/jdk/commit/8c09d40d6c345fda9fc7b358a53cae3b5965580b Stats: 22 lines in 2 files changed: 16 ins; 1 del; 5 mod 8348268: Test gc/shenandoah/TestResizeTLAB.java#compact: fatal error: Before Updating References: Thread C2 CompilerThread1: expected gc-state 9, actual 21 Reviewed-by: shade ------------- PR: https://git.openjdk.org/jdk/pull/23428 From wkemper at openjdk.org Tue Feb 11 23:01:58 2025 From: wkemper at openjdk.org (William Kemper) Date: Tue, 11 Feb 2025 23:01:58 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v8] In-Reply-To: References: Message-ID: > There are several changes to the operation of Shenandoah's control threads here. > * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. > * The cancellation handling is driven entirely by the cancellation cause > * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed > * The shutdown sequence is simpler > * The generational control thread uses a lock to coordinate updates to the requested cause and generation > * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance > * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles > * The control thread doesn't loop on its own (unless the pacer is enabled). William Kemper has updated the pull request incrementally with one additional commit since the last revision: Make shutdown safer for threads requesting (or expecting) gc ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23475/files - new: https://git.openjdk.org/jdk/pull/23475/files/861ed699..047d6ffa Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=07 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=06-07 Stats: 35 lines in 5 files changed: 9 ins; 18 del; 8 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From wkemper at openjdk.org Wed Feb 12 21:12:39 2025 From: wkemper at openjdk.org (William Kemper) Date: Wed, 12 Feb 2025 21:12:39 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v9] In-Reply-To: References: Message-ID: > There are several changes to the operation of Shenandoah's control threads here. > * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. > * The cancellation handling is driven entirely by the cancellation cause > * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed > * The shutdown sequence is simpler > * The generational control thread uses a lock to coordinate updates to the requested cause and generation > * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance > * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles > * The control thread doesn't loop on its own (unless the pacer is enabled). William Kemper has updated the pull request incrementally with one additional commit since the last revision: Improve message for assertion ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23475/files - new: https://git.openjdk.org/jdk/pull/23475/files/047d6ffa..779492c6 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=08 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=07-08 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From wkemper at openjdk.org Thu Feb 13 00:20:40 2025 From: wkemper at openjdk.org (William Kemper) Date: Thu, 13 Feb 2025 00:20:40 GMT Subject: RFR: 8348092: Shenandoah: assert(nk >= _lowest_valid_narrow_klass_id && nk <= _highest_valid_narrow_klass_id) failed: narrowKlass ID out of range (3131947710) Message-ID: Restore weak roots rendezvous handshake. This is necessary to have mutators complete the LRB before the concurrent GC invalidates any oop handles that may exist in native stacks. ------------- Commit messages: - Restore weak roots rendezvous handshake Changes: https://git.openjdk.org/jdk/pull/23604/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23604&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8348092 Stats: 19 lines in 1 file changed: 14 ins; 0 del; 5 mod Patch: https://git.openjdk.org/jdk/pull/23604.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23604/head:pull/23604 PR: https://git.openjdk.org/jdk/pull/23604 From shade at openjdk.org Thu Feb 13 08:34:14 2025 From: shade at openjdk.org (Aleksey Shipilev) Date: Thu, 13 Feb 2025 08:34:14 GMT Subject: RFR: 8348092: Shenandoah: assert(nk >= _lowest_valid_narrow_klass_id && nk <= _highest_valid_narrow_klass_id) failed: narrowKlass ID out of range (3131947710) In-Reply-To: References: Message-ID: On Thu, 13 Feb 2025 00:15:43 GMT, William Kemper wrote: > Restore weak roots rendezvous handshake. This is necessary to have mutators complete the LRB before the concurrent GC invalidates any oop handles that may exist in native stacks. Marked as reviewed by shade (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/23604#pullrequestreview-2614219213 From ayang at openjdk.org Thu Feb 13 09:23:27 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Thu, 13 Feb 2025 09:23:27 GMT Subject: RFR: 8348171: Refactor GenerationCounters and its subclasses [v5] In-Reply-To: <7otkT63ENoyKzZ29CbYpycLLwL89ARajYg36Mstz4tQ=.fd3c7dcf-5a8b-44be-9205-09e3d160d54d@github.com> References: <7otkT63ENoyKzZ29CbYpycLLwL89ARajYg36Mstz4tQ=.fd3c7dcf-5a8b-44be-9205-09e3d160d54d@github.com> Message-ID: On Tue, 11 Feb 2025 15:28:25 GMT, Albert Mingkun Yang wrote: >> Simple refactoring of removing the use of `virtual` method and use concrete subclasses when needed. >> >> Test: tier1-5 > > Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains seven commits: > > - Merge branch 'master' into gen-counter > - review > - * some more refactoring > - review > - Merge branch 'master' into gen-counter > - merge > - gen-counter Any suggestions/comments/objections from Shenandoah team? I'd like to merge this patch, if none. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23209#issuecomment-2655989267 From wkemper at openjdk.org Thu Feb 13 14:23:52 2025 From: wkemper at openjdk.org (William Kemper) Date: Thu, 13 Feb 2025 14:23:52 GMT Subject: RFR: Merge openjdk/jdk21u:master Message-ID: Merges tag jdk-21.0.7+2 ------------- Commit messages: - 8345368: java/io/File/createTempFile/SpecialTempFile.java fails on Windows Server 2025 - 8346671: java/nio/file/Files/probeContentType/Basic.java fails on Windows 2025 - 8349603: [21u, 17u, 11u] Update GHA JDKs after Jan/25 updates - 8348562: ZGC: segmentation fault due to missing node type check in barrier elision analysis - 8340411: open source several 2D imaging tests - 8330702: Update failure handler to don't generate Error message if cores actions are empty - 8334371: [AIX] Beginning with AIX 7.3 TL1 mmap() supports 64K memory pages - 8347911: Limit the length of inflated text chunks - 8338571: [TestBug] DefaultCloseOperation.java test not working as expected wrt instruction after JDK-8325851 fix - 8343491: javax/management/remote/mandatory/connection/DeadLockTest.java failing with NoSuchObjectException: no such object in table - ... and 7 more: https://git.openjdk.org/shenandoah-jdk21u/compare/d2cbada0...3e556491 The merge commit only contains trivial merges, so no merge-specific webrevs have been generated. Changes: https://git.openjdk.org/shenandoah-jdk21u/pull/155/files Stats: 1472 lines in 28 files changed: 1150 ins; 229 del; 93 mod Patch: https://git.openjdk.org/shenandoah-jdk21u/pull/155.diff Fetch: git fetch https://git.openjdk.org/shenandoah-jdk21u.git pull/155/head:pull/155 PR: https://git.openjdk.org/shenandoah-jdk21u/pull/155 From wkemper at openjdk.org Thu Feb 13 16:37:21 2025 From: wkemper at openjdk.org (William Kemper) Date: Thu, 13 Feb 2025 16:37:21 GMT Subject: Integrated: 8348092: Shenandoah: assert(nk >= _lowest_valid_narrow_klass_id && nk <= _highest_valid_narrow_klass_id) failed: narrowKlass ID out of range (3131947710) In-Reply-To: References: Message-ID: On Thu, 13 Feb 2025 00:15:43 GMT, William Kemper wrote: > Restore weak roots rendezvous handshake. This is necessary to have mutators complete the LRB before the concurrent GC invalidates any oop handles that may exist in native stacks. This pull request has now been integrated. Changeset: 28e744dc Author: William Kemper URL: https://git.openjdk.org/jdk/commit/28e744dc642db8ebe376403f28630438a5ee3f44 Stats: 19 lines in 1 file changed: 14 ins; 0 del; 5 mod 8348092: Shenandoah: assert(nk >= _lowest_valid_narrow_klass_id && nk <= _highest_valid_narrow_klass_id) failed: narrowKlass ID out of range (3131947710) Reviewed-by: shade ------------- PR: https://git.openjdk.org/jdk/pull/23604 From andrew at openjdk.org Thu Feb 13 16:46:47 2025 From: andrew at openjdk.org (Andrew John Hughes) Date: Thu, 13 Feb 2025 16:46:47 GMT Subject: RFR: Merge jdk8u:master Message-ID: Merge jdk8u332-b06 ------------- Commit messages: - Merge jdk8u332-b06 - 8277488: Add expiry exception for Digicert (geotrustglobalca) expiring in May 2022 - Added tag jdk8u332-b05 for changeset 2a92df021686 The merge commit only contains trivial merges, so no merge-specific webrevs have been generated. Changes: https://git.openjdk.org/shenandoah-jdk8u/pull/11/files Stats: 4 lines in 2 files changed: 3 ins; 0 del; 1 mod Patch: https://git.openjdk.org/shenandoah-jdk8u/pull/11.diff Fetch: git fetch https://git.openjdk.org/shenandoah-jdk8u.git pull/11/head:pull/11 PR: https://git.openjdk.org/shenandoah-jdk8u/pull/11 From wkemper at openjdk.org Thu Feb 13 16:59:08 2025 From: wkemper at openjdk.org (William Kemper) Date: Thu, 13 Feb 2025 16:59:08 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v10] In-Reply-To: References: Message-ID: > There are several changes to the operation of Shenandoah's control threads here. > * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. > * The cancellation handling is driven entirely by the cancellation cause > * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed > * The shutdown sequence is simpler > * The generational control thread uses a lock to coordinate updates to the requested cause and generation > * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance > * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles > * The control thread doesn't loop on its own (unless the pacer is enabled). William Kemper has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains 28 additional commits since the last revision: - Merge remote-tracking branch 'jdk/master' into fix-control-regulator-threads - Improve message for assertion - Make shutdown safer for threads requesting (or expecting) gc - Do not accept requests if control thread is terminating - Notify waiters when control thread terminates - Add event for control thread state changes - Fix shutdown livelock error - Fix includes - Simplify locking protocol - Make shutdown more robust, make better use of request lock - ... and 18 more: https://git.openjdk.org/jdk/compare/06ea83a4...51d09207 ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23475/files - new: https://git.openjdk.org/jdk/pull/23475/files/779492c6..51d09207 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=09 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=08-09 Stats: 12604 lines in 600 files changed: 8568 ins; 1551 del; 2485 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From andrew at openjdk.org Thu Feb 13 17:27:25 2025 From: andrew at openjdk.org (Andrew John Hughes) Date: Thu, 13 Feb 2025 17:27:25 GMT Subject: RFR: Merge jdk8u:master In-Reply-To: References: Message-ID: On Thu, 13 Feb 2025 16:42:13 GMT, Andrew John Hughes wrote: > Merge jdk8u332-b06 GHA builds will not work until [JDK-8284622](https://bugs.openjdk.org/browse/JDK-8284622) is merged in 8u362-b03 ------------- PR Comment: https://git.openjdk.org/shenandoah-jdk8u/pull/11#issuecomment-2657279534 From andrew at openjdk.org Thu Feb 13 22:12:58 2025 From: andrew at openjdk.org (Andrew John Hughes) Date: Thu, 13 Feb 2025 22:12:58 GMT Subject: git: openjdk/shenandoah-jdk8u: master: 3 new changesets Message-ID: Changeset: 0183285d Branch: master Author: Andrew John Hughes Date: 2022-03-21 20:56:59 +0000 URL: https://git.openjdk.org/shenandoah-jdk8u/commit/0183285d00171de3cfb6791ee5eabe9127932a43 Added tag jdk8u332-b05 for changeset 2a92df021686 ! .hgtags Changeset: 9a303aef Branch: master Author: Andrew John Hughes Date: 2022-03-24 03:03:42 +0000 URL: https://git.openjdk.org/shenandoah-jdk8u/commit/9a303aef21f8db21cf6acc9dc91b6ca33819eb01 8277488: Add expiry exception for Digicert (geotrustglobalca) expiring in May 2022 Reviewed-by: weijun, sgehwolf ! jdk/test/sun/security/lib/cacerts/VerifyCACerts.java Changeset: 5ca34513 Branch: master Author: Andrew John Hughes Date: 2025-02-12 20:18:34 +0000 URL: https://git.openjdk.org/shenandoah-jdk8u/commit/5ca34513542f68d6adfe670e500e294b636fc6c2 Merge jdk8u332-b06 From andrew at openjdk.org Thu Feb 13 22:13:07 2025 From: andrew at openjdk.org (Andrew John Hughes) Date: Thu, 13 Feb 2025 22:13:07 GMT Subject: git: openjdk/shenandoah-jdk8u: Added tag jdk8u332-b06 for changeset 9a303aef Message-ID: <2bbee72d-097f-4ec2-b7f3-50e029a05129@openjdk.org> Tagged by: Andrew John Hughes Date: 2022-03-29 03:33:24 +0000 Changeset: 9a303aef Author: Andrew John Hughes Date: 2022-03-24 03:03:42 +0000 URL: https://git.openjdk.org/shenandoah-jdk8u/commit/9a303aef21f8db21cf6acc9dc91b6ca33819eb01 From andrew at openjdk.org Thu Feb 13 22:13:10 2025 From: andrew at openjdk.org (Andrew John Hughes) Date: Thu, 13 Feb 2025 22:13:10 GMT Subject: git: openjdk/shenandoah-jdk8u: Added tag shenandoah8u332-b06 for changeset 5ca34513 Message-ID: <9e464909-ca3e-47f2-b0c0-31f8f0129991@openjdk.org> Tagged by: Andrew John Hughes Date: 2025-02-13 16:39:09 +0000 Added tag shenandoah8u332-b06 for changeset 5ca34513542 Changeset: 5ca34513 Author: Andrew John Hughes Date: 2025-02-12 20:18:34 +0000 URL: https://git.openjdk.org/shenandoah-jdk8u/commit/5ca34513542f68d6adfe670e500e294b636fc6c2 From andrew at openjdk.org Thu Feb 13 22:14:51 2025 From: andrew at openjdk.org (Andrew John Hughes) Date: Thu, 13 Feb 2025 22:14:51 GMT Subject: RFR: Merge jdk8u:master [v2] In-Reply-To: References: Message-ID: <_v1DLgUyUWw1kzgJFpZtHoaMPUeOX6WMKuin7iLlz5Q=.3540a226-d66f-4cf9-9ee7-33bd46cec962@github.com> > Merge jdk8u332-b06 Andrew John Hughes has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. ------------- Changes: - all: https://git.openjdk.org/shenandoah-jdk8u/pull/11/files - new: https://git.openjdk.org/shenandoah-jdk8u/pull/11/files/5ca34513..5ca34513 Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah-jdk8u&pr=11&range=01 - incr: https://webrevs.openjdk.org/?repo=shenandoah-jdk8u&pr=11&range=00-01 Stats: 0 lines in 0 files changed: 0 ins; 0 del; 0 mod Patch: https://git.openjdk.org/shenandoah-jdk8u/pull/11.diff Fetch: git fetch https://git.openjdk.org/shenandoah-jdk8u.git pull/11/head:pull/11 PR: https://git.openjdk.org/shenandoah-jdk8u/pull/11 From iris at openjdk.org Thu Feb 13 22:14:51 2025 From: iris at openjdk.org (Iris Clark) Date: Thu, 13 Feb 2025 22:14:51 GMT Subject: Withdrawn: Merge jdk8u:master In-Reply-To: References: Message-ID: On Thu, 13 Feb 2025 16:42:13 GMT, Andrew John Hughes wrote: > Merge jdk8u332-b06 This pull request has been closed without being integrated. ------------- PR: https://git.openjdk.org/shenandoah-jdk8u/pull/11 From wkemper at openjdk.org Thu Feb 13 22:39:08 2025 From: wkemper at openjdk.org (William Kemper) Date: Thu, 13 Feb 2025 22:39:08 GMT Subject: RFR: 8349766: GenShen: Bad progress after degen does not always need full gc In-Reply-To: References: Message-ID: On Tue, 11 Feb 2025 03:31:51 GMT, Kelvin Nilsen wrote: > In generational mode, only upgrade to full GC from degenerated GC if we've done two degenerated cycles in a row and both indicated bad progress. Otherwise, start another concurrent GC, which will most likely degenerate also. But this degenerated cycle will reclaim floating garbage within the young generation much more quickly than a full GC would have done. Looks good to me. ------------- Marked as reviewed by wkemper (Committer). PR Review: https://git.openjdk.org/jdk/pull/23552#pullrequestreview-2616371490 From wkemper at openjdk.org Thu Feb 13 22:40:00 2025 From: wkemper at openjdk.org (William Kemper) Date: Thu, 13 Feb 2025 22:40:00 GMT Subject: RFR: Merge openjdk/jdk21u:master [v2] In-Reply-To: References: Message-ID: <6xL6YwMmQPLuc9PSWMJ5qMgtcBOQzAzbDozzrvBqhf8=.772326dd-2742-4aef-bb20-a7496d354811@github.com> > Merges tag jdk-21.0.7+2 William Kemper has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. ------------- Changes: - all: https://git.openjdk.org/shenandoah-jdk21u/pull/155/files - new: https://git.openjdk.org/shenandoah-jdk21u/pull/155/files/3e556491..3e556491 Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=155&range=01 - incr: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=155&range=00-01 Stats: 0 lines in 0 files changed: 0 ins; 0 del; 0 mod Patch: https://git.openjdk.org/shenandoah-jdk21u/pull/155.diff Fetch: git fetch https://git.openjdk.org/shenandoah-jdk21u.git pull/155/head:pull/155 PR: https://git.openjdk.org/shenandoah-jdk21u/pull/155 From wkemper at openjdk.org Thu Feb 13 22:40:02 2025 From: wkemper at openjdk.org (William Kemper) Date: Thu, 13 Feb 2025 22:40:02 GMT Subject: Integrated: Merge openjdk/jdk21u:master In-Reply-To: References: Message-ID: On Thu, 13 Feb 2025 14:19:54 GMT, William Kemper wrote: > Merges tag jdk-21.0.7+2 This pull request has now been integrated. Changeset: 1fc48d59 Author: William Kemper URL: https://git.openjdk.org/shenandoah-jdk21u/commit/1fc48d598e1a7c3995dceaf0556d03ba7e313c34 Stats: 1472 lines in 28 files changed: 1150 ins; 229 del; 93 mod Merge ------------- PR: https://git.openjdk.org/shenandoah-jdk21u/pull/155 From kdnilsen at openjdk.org Thu Feb 13 23:27:48 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Thu, 13 Feb 2025 23:27:48 GMT Subject: RFR: 8349766: GenShen: Bad progress after degen does not always need full gc [v2] In-Reply-To: References: Message-ID: > In generational mode, only upgrade to full GC from degenerated GC if we've done two degenerated cycles in a row and both indicated bad progress. Otherwise, start another concurrent GC, which will most likely degenerate also. But this degenerated cycle will reclaim floating garbage within the young generation much more quickly than a full GC would have done. Kelvin Nilsen has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains three additional commits since the last revision: - 8348092: Shenandoah: assert(nk >= _lowest_valid_narrow_klass_id && nk <= _highest_valid_narrow_klass_id) failed: narrowKlass ID out of range (3131947710) Reviewed-by: shade - Merge tag 'jdk-25+10' into defer-generational-full-gc Added tag jdk-25+10 for changeset a637ccf2 - Be less eager to upgrade degen to full gc ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23552/files - new: https://git.openjdk.org/jdk/pull/23552/files/17e5e919..8d662e10 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23552&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23552&range=00-01 Stats: 9730 lines in 593 files changed: 6779 ins; 1405 del; 1546 mod Patch: https://git.openjdk.org/jdk/pull/23552.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23552/head:pull/23552 PR: https://git.openjdk.org/jdk/pull/23552 From kdnilsen at openjdk.org Fri Feb 14 01:18:01 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 14 Feb 2025 01:18:01 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v5] In-Reply-To: References: Message-ID: > At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. > > For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. > > This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. Kelvin Nilsen has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains seven additional commits since the last revision: - 8348092: Shenandoah: assert(nk >= _lowest_valid_narrow_klass_id && nk <= _highest_valid_narrow_klass_id) failed: narrowKlass ID out of range (3131947710) Reviewed-by: shade - Merge tag 'jdk-25+10' into fix-generational-no-progress-check Added tag jdk-25+10 for changeset a637ccf2 - Merge tag 'jdk-25+9' into fix-generational-no-progress-check Added tag jdk-25+9 for changeset 30f71622 - Add comments suggested by reviewers - Respond to reviewer feedback In testing suggested refinements, I discovered a bug in original implementation. ShenandoahFreeSet::capacity() does not represent the size of young generation. It represents the total size of the young regions that had available memory at the time we most recently rebuilt the ShenandoahFreeSet. I am rerunning the performance tests following this suggested change. - Use freeset to determine goodness of progress As previously implemented, we used the heap size to measure goodness of progress. However, heap size is only appropriate for non-generational Shenandoah. Freeset abstraction works for both. - Use size-of young generation to assess progress Previously, we were using size of heap to asses progress of generational degenerated cycle. But that is not appropriate, because the collection set is chosen based on the size of young generation. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23306/files - new: https://git.openjdk.org/jdk/pull/23306/files/8c610136..0e86c5bd Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23306&range=04 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23306&range=03-04 Stats: 12378 lines in 689 files changed: 8313 ins; 1890 del; 2175 mod Patch: https://git.openjdk.org/jdk/pull/23306.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23306/head:pull/23306 PR: https://git.openjdk.org/jdk/pull/23306 From kdnilsen at openjdk.org Fri Feb 14 01:35:51 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 14 Feb 2025 01:35:51 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v6] In-Reply-To: References: Message-ID: > Shenandoah heuristics use a penalty mechanism to cause earlier GC triggers when recent concurrent GC cycles degenerate. Degeneration is a stop-the-world remediation that allows GC to catch up when mutator allocations fail during concurrent GC. The fact that we needed to degenerate indicates that we were overly optimistic in delaying the trigger that starts concurrent GC. > > We have observed that it is common for degenerated GC cycles to cascade upon each other. The condition that caused an initial degenerated cycle is often not fully resolved by the end of that degenerated cycle. For example, the application may be experiencing a phase change and the GC heuristics are not yet attuned to the new behavior. Furthermore, a degenerated GC may exacerbate the problem condition. During the stop-the-world pause imposed by the first degenerated GC, work continues to accumulate in the form of new client requests that are buffered in network sockets until the end of that degenerated GC. > > As originally implemented, each degeneration would "pile on" additional penalties. These penalties cause the GC frequency to continue to increase. And the expanding CPU load of GC makes it increasingly difficult for mutator threads to catchup. The large penalties accumulated while we are trying to resolve the problem linger long after the problem condition has been resolved. > > This change does not add further to the degeneration penalties if a new degenerated cycle occurs through no fault of the triggering mechanism. We only add the degeneration penalty if the reason we are now degenerating can be attributed to a consciously late trigger by the heuristic. Kelvin Nilsen has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains 11 additional commits since the last revision: - 8348092: Shenandoah: assert(nk >= _lowest_valid_narrow_klass_id && nk <= _highest_valid_narrow_klass_id) failed: narrowKlass ID out of range (3131947710) Reviewed-by: shade - Merge tag 'jdk-25+10' into eliminate-no-fault-degen-penalties Added tag jdk-25+10 for changeset a637ccf2 - Merge tag 'jdk-25+9' into eliminate-no-fault-degen-penalties Added tag jdk-25+9 for changeset 30f71622 - Revert "Use generation size to determine expected free" This reverts commit 94a32ebfe5fefcc0e899e09e6fbfc0585c62b4e0. - Respond to reviewer feedback - Use generation size to determine expected free - Respond to reviewer feedback - Fix white space - Remove debug instrumentation - Only penalize heuristic if heuristic responsible If we degenerate through no fault of "late triggering", then do not penalize the heuristic. - ... and 1 more: https://git.openjdk.org/jdk/compare/961a87d9...0d85e341 ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23305/files - new: https://git.openjdk.org/jdk/pull/23305/files/3aabd4db..0d85e341 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23305&range=05 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23305&range=04-05 Stats: 12378 lines in 689 files changed: 8313 ins; 1890 del; 2175 mod Patch: https://git.openjdk.org/jdk/pull/23305.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23305/head:pull/23305 PR: https://git.openjdk.org/jdk/pull/23305 From wkemper at openjdk.org Fri Feb 14 01:48:58 2025 From: wkemper at openjdk.org (William Kemper) Date: Fri, 14 Feb 2025 01:48:58 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v11] In-Reply-To: References: Message-ID: > There are several changes to the operation of Shenandoah's control threads here. > * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. > * The cancellation handling is driven entirely by the cancellation cause > * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed > * The shutdown sequence is simpler > * The generational control thread uses a lock to coordinate updates to the requested cause and generation > * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance > * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles > * The control thread doesn't loop on its own (unless the pacer is enabled). > > ## Testing > * jtreg hotspot_gc_shenandoah > * dacapo, extremem, diluvian, specjbb2015, specjvm2018, heapothesys William Kemper has updated the pull request incrementally with one additional commit since the last revision: Old gen bootstrap cycle must make it to init mark ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23475/files - new: https://git.openjdk.org/jdk/pull/23475/files/51d09207..82f96090 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=10 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=09-10 Stats: 5 lines in 1 file changed: 0 ins; 5 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From xpeng at openjdk.org Fri Feb 14 06:58:37 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Fri, 14 Feb 2025 06:58:37 GMT Subject: RFR: 8338737: Shenandoah: Reset marking bitmaps after the cycle [v8] In-Reply-To: <6duTgo8vKHyCUnasOsrHp341B2krxcK8jNogKjX09gs=.af63669e-9c8d-4f17-b055-bf3a03a9618e@github.com> References: <6duTgo8vKHyCUnasOsrHp341B2krxcK8jNogKjX09gs=.af63669e-9c8d-4f17-b055-bf3a03a9618e@github.com> Message-ID: > Reset marking bitmaps after collection cycle; for GenShen only do this for young generation, also choose not do this for Degen and full GC since both are running at safepoint, we should leave safepoint as ASAP. > > I have run same workload for 30s with Shenandoah in generational mode and classic mode, average average time of concurrent reset dropped significantly since in most case bitmap for young gen should have been reset after pervious concurrent cycle finishes if there is no need to preserve bitmap states. > > GenShen: > Before: > > [33.342s][info][gc,stats ] Concurrent Reset = 0.023 s (a = 1921 us) (n = 12) (lvls, us = 133, 385, 1191, 1836, 8878) > > > After: > > [33.597s][info][gc,stats ] Concurrent Reset = 0.004 s (a = 317 us) (n = 13) (lvls, us = 58, 119, 217, 410, 670) > [33.597s][info][gc,stats ] Concurrent Reset After Collect = 0.018 s (a = 1365 us) (n = 13) (lvls, us = 91, 186, 818, 1836, 3872) > > > Shenandoah: > Before: > > [33.144s][info][gc,stats ] Concurrent Reset = 0.014 s (a = 1067 us) (n = 13) (lvls, us = 139, 277, 898, 1328, 2118) > > After: > > [33.128s][info][gc,stats ] Concurrent Reset = 0.003 s (a = 225 us) (n = 13) (lvls, us = 32, 92, 137, 295, 542) > [33.128s][info][gc,stats ] Concurrent Reset After Collect = 0.009 s (a = 661 us) (n = 13) (lvls, us = 92, 160, 594, 896, 1661) > > > Additional changes: > * Remove `ShenandoahResetBitmapClosure` and `ShenandoahPrepareForMarkClosure`, merge the code with `ShenandoahResetBitmapClosure`, saving one iteration over all the regions. > * Use API `ShenandoahGeneration::parallel_heap_region_iterate_free` to iterate the regions, two benefits from this: > - Underneath it calls `ShenandoahHeap::parallel_heap_region_iterate`, which is faster for very light tasks, see https://bugs.openjdk.org/browse/JDK-8337154 > - `ShenandoahGeneration::parallel_heap_region_iterate_free` decorate the closure with `ShenandoahExcludeRegionClosure`, which simplifies the code in closure. > * When `_do_old_gc_bootstrap is true`, instead of reset mark bitmap for old gen separately, simply reset the global generations, so we don't need walk the all regions twice. > * Clean up FullGC code, remove duplicate code. > > Additional tests: > - [x] CONF=macosx-aarch64-server-fastdebug make test T... Xiaolong Peng has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains 24 additional commits since the last revision: - Merge branch 'openjdk:master' into reset-bitmap - Merge branch 'openjdk:master' into reset-bitmap - Merge branch 'openjdk:master' into reset-bitmap - Merge branch 'openjdk:master' into reset-bitmap - Adding condition "!_do_old_gc_bootstrap && !heap->is_concurrent_old_mark_in_progress()" back and address some PR comments - Remove entry_reset_after_collect from ShenandoahOldGC - Remove condition check !_do_old_gc_bootstrap && !heap->is_concurrent_old_mark_in_progress() from op_reset_after_collect - Merge branch 'openjdk:master' into reset-bitmap - Address review comments - Merge branch 'openjdk:master' into reset-bitmap - ... and 14 more: https://git.openjdk.org/jdk/compare/a90afca6...c7e9bff3 ------------- Changes: - all: https://git.openjdk.org/jdk/pull/22778/files - new: https://git.openjdk.org/jdk/pull/22778/files/92c63159..c7e9bff3 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=22778&range=07 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=22778&range=06-07 Stats: 66728 lines in 3845 files changed: 34690 ins; 16712 del; 15326 mod Patch: https://git.openjdk.org/jdk/pull/22778.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22778/head:pull/22778 PR: https://git.openjdk.org/jdk/pull/22778 From phh at openjdk.org Fri Feb 14 10:41:13 2025 From: phh at openjdk.org (Paul Hohensee) Date: Fri, 14 Feb 2025 10:41:13 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v6] In-Reply-To: References:

Message-ID: On Fri, 14 Feb 2025 01:18:01 GMT, Kelvin Nilsen wrote: >> At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. >> >> For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. >> >> This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. > > Kelvin Nilsen has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains seven additional commits since the last revision: > > - 8348092: Shenandoah: assert(nk >= _lowest_valid_narrow_klass_id && nk <= _highest_valid_narrow_klass_id) failed: narrowKlass ID out of range (3131947710) > > Reviewed-by: shade > - Merge tag 'jdk-25+10' into fix-generational-no-progress-check > > Added tag jdk-25+10 for changeset a637ccf2 > - Merge tag 'jdk-25+9' into fix-generational-no-progress-check > > Added tag jdk-25+9 for changeset 30f71622 > - Add comments suggested by reviewers > - Respond to reviewer feedback > > In testing suggested refinements, I discovered a bug in original > implementation. ShenandoahFreeSet::capacity() does not represent the > size of young generation. It represents the total size of the young > regions that had available memory at the time we most recently rebuilt > the ShenandoahFreeSet. > > I am rerunning the performance tests following this suggested change. > - Use freeset to determine goodness of progress > > As previously implemented, we used the heap size to measure goodness of > progress. However, heap size is only appropriate for non-generational > Shenandoah. Freeset abstraction works for both. > - Use size-of young generation to assess progress > > Previously, we were using size of heap to asses progress of generational > degenerated cycle. But that is not appropriate, because the collection > set is chosen based on the size of young generation. Marked as reviewed by phh (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/23306#pullrequestreview-2617414395 From duke at openjdk.org Fri Feb 14 15:14:15 2025 From: duke at openjdk.org (duke) Date: Fri, 14 Feb 2025 15:14:15 GMT Subject: RFR: 8348595: GenShen: Fix generational free-memory no-progress check [v5] In-Reply-To: References:

Message-ID: On Fri, 14 Feb 2025 01:18:01 GMT, Kelvin Nilsen wrote: >> At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. >> >> For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. >> >> This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. > > Kelvin Nilsen has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains seven additional commits since the last revision: > > - 8348092: Shenandoah: assert(nk >= _lowest_valid_narrow_klass_id && nk <= _highest_valid_narrow_klass_id) failed: narrowKlass ID out of range (3131947710) > > Reviewed-by: shade > - Merge tag 'jdk-25+10' into fix-generational-no-progress-check > > Added tag jdk-25+10 for changeset a637ccf2 > - Merge tag 'jdk-25+9' into fix-generational-no-progress-check > > Added tag jdk-25+9 for changeset 30f71622 > - Add comments suggested by reviewers > - Respond to reviewer feedback > > In testing suggested refinements, I discovered a bug in original > implementation. ShenandoahFreeSet::capacity() does not represent the > size of young generation. It represents the total size of the young > regions that had available memory at the time we most recently rebuilt > the ShenandoahFreeSet. > > I am rerunning the performance tests following this suggested change. > - Use freeset to determine goodness of progress > > As previously implemented, we used the heap size to measure goodness of > progress. However, heap size is only appropriate for non-generational > Shenandoah. Freeset abstraction works for both. > - Use size-of young generation to assess progress > > Previously, we were using size of heap to asses progress of generational > degenerated cycle. But that is not appropriate, because the collection > set is chosen based on the size of young generation. @kdnilsen Your change (at version 0e86c5bd1ae330522daa9652f7843342fef9f83e) is now ready to be sponsored by a Committer. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23306#issuecomment-2659585115 From duke at openjdk.org Fri Feb 14 15:16:25 2025 From: duke at openjdk.org (duke) Date: Fri, 14 Feb 2025 15:16:25 GMT Subject: RFR: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic [v6] In-Reply-To: References:

Message-ID: On Fri, 14 Feb 2025 01:35:51 GMT, Kelvin Nilsen wrote: >> Shenandoah heuristics use a penalty mechanism to cause earlier GC triggers when recent concurrent GC cycles degenerate. Degeneration is a stop-the-world remediation that allows GC to catch up when mutator allocations fail during concurrent GC. The fact that we needed to degenerate indicates that we were overly optimistic in delaying the trigger that starts concurrent GC. >> >> We have observed that it is common for degenerated GC cycles to cascade upon each other. The condition that caused an initial degenerated cycle is often not fully resolved by the end of that degenerated cycle. For example, the application may be experiencing a phase change and the GC heuristics are not yet attuned to the new behavior. Furthermore, a degenerated GC may exacerbate the problem condition. During the stop-the-world pause imposed by the first degenerated GC, work continues to accumulate in the form of new client requests that are buffered in network sockets until the end of that degenerated GC. >> >> As originally implemented, each degeneration would "pile on" additional penalties. These penalties cause the GC frequency to continue to increase. And the expanding CPU load of GC makes it increasingly difficult for mutator threads to catchup. The large penalties accumulated while we are trying to resolve the problem linger long after the problem condition has been resolved. >> >> This change does not add further to the degeneration penalties if a new degenerated cycle occurs through no fault of the triggering mechanism. We only add the degeneration penalty if the reason we are now degenerating can be attributed to a consciously late trigger by the heuristic. > > Kelvin Nilsen has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains 11 additional commits since the last revision: > > - 8348092: Shenandoah: assert(nk >= _lowest_valid_narrow_klass_id && nk <= _highest_valid_narrow_klass_id) failed: narrowKlass ID out of range (3131947710) > > Reviewed-by: shade > - Merge tag 'jdk-25+10' into eliminate-no-fault-degen-penalties > > Added tag jdk-25+10 for changeset a637ccf2 > - Merge tag 'jdk-25+9' into eliminate-no-fault-degen-penalties > > Added tag jdk-25+9 for changeset 30f71622 > - Revert "Use generation size to determine expected free" > > This reverts commit 94a32ebfe5fefcc0e899e09e6fbfc0585c62b4e0. > - Respond to reviewer feedback > - Use generation size to determine expected free > - Respond to reviewer feedback > - Fix white space > - Remove debug instrumentation > - Only penalize heuristic if heuristic responsible > > If we degenerate through no fault of "late triggering", then do not > penalize the heuristic. > - ... and 1 more: https://git.openjdk.org/jdk/compare/a1bdb2da...0d85e341 @kdnilsen Your change (at version 0d85e34107d74e471a791e0523cabc403e02178c) is now ready to be sponsored by a Committer. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23305#issuecomment-2659590224 From kdnilsen at openjdk.org Fri Feb 14 16:43:17 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 14 Feb 2025 16:43:17 GMT Subject: Integrated: 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic In-Reply-To: References: Message-ID: On Fri, 24 Jan 2025 18:18:25 GMT, Kelvin Nilsen wrote: > Shenandoah heuristics use a penalty mechanism to cause earlier GC triggers when recent concurrent GC cycles degenerate. Degeneration is a stop-the-world remediation that allows GC to catch up when mutator allocations fail during concurrent GC. The fact that we needed to degenerate indicates that we were overly optimistic in delaying the trigger that starts concurrent GC. > > We have observed that it is common for degenerated GC cycles to cascade upon each other. The condition that caused an initial degenerated cycle is often not fully resolved by the end of that degenerated cycle. For example, the application may be experiencing a phase change and the GC heuristics are not yet attuned to the new behavior. Furthermore, a degenerated GC may exacerbate the problem condition. During the stop-the-world pause imposed by the first degenerated GC, work continues to accumulate in the form of new client requests that are buffered in network sockets until the end of that degenerated GC. > > As originally implemented, each degeneration would "pile on" additional penalties. These penalties cause the GC frequency to continue to increase. And the expanding CPU load of GC makes it increasingly difficult for mutator threads to catchup. The large penalties accumulated while we are trying to resolve the problem linger long after the problem condition has been resolved. > > This change does not add further to the degeneration penalties if a new degenerated cycle occurs through no fault of the triggering mechanism. We only add the degeneration penalty if the reason we are now degenerating can be attributed to a consciously late trigger by the heuristic. This pull request has now been integrated. Changeset: 38322407 Author: Kelvin Nilsen URL: https://git.openjdk.org/jdk/commit/38322407cd1664115e975c7fd9cb61e40d9557b5 Stats: 82 lines in 12 files changed: 78 ins; 0 del; 4 mod 8348594: Shenandoah: Do not penalize for degeneration when not the fault of triggering heuristic Reviewed-by: phh, wkemper ------------- PR: https://git.openjdk.org/jdk/pull/23305 From kdnilsen at openjdk.org Fri Feb 14 16:44:16 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 14 Feb 2025 16:44:16 GMT Subject: Integrated: 8348595: GenShen: Fix generational free-memory no-progress check In-Reply-To: References: Message-ID: On Fri, 24 Jan 2025 18:30:02 GMT, Kelvin Nilsen wrote: > At the end of a degenerated GC, we check whether sufficient progress has been made in replenishing the memory available to the mutator. The test for good progress is implemented as a ratio of free memory against the total heap size. > > For generational Shenandoah, the ratio should be computed against the size of the young generation. Note that the size of the generational collection set is based on young generation size rather than total heap size. > > This issue first identified in GenShen GC logs, where a large number of degenerated cycles were upgrading to full GC because the free-set progress was short of desired by 10-25%. This pull request has now been integrated. Changeset: ba6c9659 Author: Kelvin Nilsen URL: https://git.openjdk.org/jdk/commit/ba6c96599aac1a6c08cb66c611474f83bbc9b260 Stats: 27 lines in 5 files changed: 21 ins; 0 del; 6 mod 8348595: GenShen: Fix generational free-memory no-progress check Reviewed-by: phh, xpeng ------------- PR: https://git.openjdk.org/jdk/pull/23306 From wkemper at openjdk.org Fri Feb 14 17:43:48 2025 From: wkemper at openjdk.org (William Kemper) Date: Fri, 14 Feb 2025 17:43:48 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v12] In-Reply-To: References: Message-ID: > There are several changes to the operation of Shenandoah's control threads here. > * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. > * The cancellation handling is driven entirely by the cancellation cause > * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed > * The shutdown sequence is simpler > * The generational control thread uses a lock to coordinate updates to the requested cause and generation > * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance > * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles > * The control thread doesn't loop on its own (unless the pacer is enabled). > > ## Testing > * jtreg hotspot_gc_shenandoah > * dacapo, extremem, diluvian, specjbb2015, specjvm2018, heapothesys William Kemper has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 30 commits: - Merge remote-tracking branch 'jdk/master' into fix-control-regulator-threads - Old gen bootstrap cycle must make it to init mark - Merge remote-tracking branch 'jdk/master' into fix-control-regulator-threads - Improve message for assertion - Make shutdown safer for threads requesting (or expecting) gc - Do not accept requests if control thread is terminating - Notify waiters when control thread terminates - Add event for control thread state changes - Fix shutdown livelock error - Fix includes - ... and 20 more: https://git.openjdk.org/jdk/compare/ba6c9659...915ffbda ------------- Changes: https://git.openjdk.org/jdk/pull/23475/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23475&range=11 Stats: 892 lines in 18 files changed: 285 ins; 281 del; 326 mod Patch: https://git.openjdk.org/jdk/pull/23475.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23475/head:pull/23475 PR: https://git.openjdk.org/jdk/pull/23475 From kdnilsen at openjdk.org Fri Feb 14 18:37:54 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 14 Feb 2025 18:37:54 GMT Subject: RFR: 8349766: GenShen: Bad progress after degen does not always need full gc [v3] In-Reply-To: References: Message-ID: > In generational mode, only upgrade to full GC from degenerated GC if we've done two degenerated cycles in a row and both indicated bad progress. Otherwise, start another concurrent GC, which will most likely degenerate also. But this degenerated cycle will reclaim floating garbage within the young generation much more quickly than a full GC would have done. Kelvin Nilsen has updated the pull request incrementally with two additional commits since the last revision: - Fix typo in merge conflict resolution - 8348595: GenShen: Fix generational free-memory no-progress check Reviewed-by: phh, xpeng ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23552/files - new: https://git.openjdk.org/jdk/pull/23552/files/8d662e10..0f5051a1 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23552&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23552&range=01-02 Stats: 27 lines in 5 files changed: 21 ins; 0 del; 6 mod Patch: https://git.openjdk.org/jdk/pull/23552.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23552/head:pull/23552 PR: https://git.openjdk.org/jdk/pull/23552 From kdnilsen at openjdk.org Fri Feb 14 18:51:31 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 14 Feb 2025 18:51:31 GMT Subject: RFR: 8349766: GenShen: Bad progress after degen does not always need full gc [v4] In-Reply-To: References: Message-ID: > In generational mode, only upgrade to full GC from degenerated GC if we've done two degenerated cycles in a row and both indicated bad progress. Otherwise, start another concurrent GC, which will most likely degenerate also. But this degenerated cycle will reclaim floating garbage within the young generation much more quickly than a full GC would have done. Kelvin Nilsen has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains six commits: - Merge master - Fix typo in merge conflict resolution - 8348595: GenShen: Fix generational free-memory no-progress check Reviewed-by: phh, xpeng - 8348092: Shenandoah: assert(nk >= _lowest_valid_narrow_klass_id && nk <= _highest_valid_narrow_klass_id) failed: narrowKlass ID out of range (3131947710) Reviewed-by: shade - Merge tag 'jdk-25+10' into defer-generational-full-gc Added tag jdk-25+10 for changeset a637ccf2 - Be less eager to upgrade degen to full gc ------------- Changes: https://git.openjdk.org/jdk/pull/23552/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23552&range=03 Stats: 20 lines in 2 files changed: 17 ins; 0 del; 3 mod Patch: https://git.openjdk.org/jdk/pull/23552.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23552/head:pull/23552 PR: https://git.openjdk.org/jdk/pull/23552 From kdnilsen at openjdk.org Fri Feb 14 19:41:16 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 14 Feb 2025 19:41:16 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v12] In-Reply-To: References:

Message-ID: On Fri, 14 Feb 2025 17:43:48 GMT, William Kemper wrote: >> There are several changes to the operation of Shenandoah's control threads here. >> * The reason for cancellation is now recorded in `ShenandoahHeap::_cancelled_gc` as a `GCCause`, instead of various member variables in the control thread. >> * The cancellation handling is driven entirely by the cancellation cause >> * The graceful shutdown, alloc failure, humongous alloc failure and preemption requested flags are all removed >> * The shutdown sequence is simpler >> * The generational control thread uses a lock to coordinate updates to the requested cause and generation >> * APIs have been simplified to avoid converting between the generation `type` and the actual generation instance >> * The old heuristic, rather than the control thread itself, is now responsible for resuming old generation cycles >> * The control thread doesn't loop on its own (unless the pacer is enabled). >> >> ## Testing >> * jtreg hotspot_gc_shenandoah >> * dacapo, extremem, diluvian, specjbb2015, specjvm2018, heapothesys > > William Kemper has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 30 commits: > > - Merge remote-tracking branch 'jdk/master' into fix-control-regulator-threads > - Old gen bootstrap cycle must make it to init mark > - Merge remote-tracking branch 'jdk/master' into fix-control-regulator-threads > - Improve message for assertion > - Make shutdown safer for threads requesting (or expecting) gc > - Do not accept requests if control thread is terminating > - Notify waiters when control thread terminates > - Add event for control thread state changes > - Fix shutdown livelock error > - Fix includes > - ... and 20 more: https://git.openjdk.org/jdk/compare/ba6c9659...915ffbda Flushing the comments at EOD; will complete review later. src/hotspot/share/gc/shenandoah/heuristics/shenandoahOldHeuristics.hpp line 188: > 186: > 187: bool should_start_gc() override; > 188: bool resume_old_cycle(); Documentation comment please, especially explaining the return value. For things that may return `false` and not do anything, it's better to use `try_` prefix. In fact, the method doesn't actually resume the cycle, but checks if we are in a state such that we should resume it. So, I'd name it `should_resume_old_cycle()`, consistent with the name `should_start_gc()` for the previous method. src/hotspot/share/gc/shenandoah/shenandoahCollectorPolicy.hpp line 101: > 99: || cause == GCCause::_shenandoah_allocation_failure_evac > 100: || cause == GCCause::_shenandoah_humongous_allocation_failure; > 101: } Would it make sense to move this implementation also to the .cpp file like the other static `is_...` methods below? src/hotspot/share/gc/shenandoah/shenandoahController.hpp line 42: > 40: volatile size_t _allocs_seen; > 41: shenandoah_padding(1); > 42: volatile size_t _gc_id; // A monotonically increasing GC count. src/hotspot/share/gc/shenandoah/shenandoahController.hpp line 66: > 64: > 65: // This cancels the collection cycle and has an option to block > 66: // until another cycle runs and clears the alloc failure gc flag. But "the alloc failure gc flag" is gone above. The comment should be updated as well. A public API's description should avoid talking about its internal implementation details here. It's OK to talk about implementation details in the implementation of the method, not in the header spec here. src/hotspot/share/gc/shenandoah/shenandoahController.hpp line 87: > 85: // Returns the internal gc count used by the control thread. Probably > 86: // doesn't need to be exposed. > 87: size_t get_gc_id(); As far as I can tell, there's a single non-internal (public) use of this, and it's from `ShenandoahOldGeneration::handle_failed_promotion()` where it's being used for reducing logging data. If we do need to expose this through a public API, I'd elide the "Probably doesn't need to be exposed", and update the comment to: // Return the value of a monotonic increasing GC count, maintained by the control thread. src/hotspot/share/gc/shenandoah/shenandoahGenerationalControlThread.cpp line 64: > 62: void ShenandoahGenerationalControlThread::run_service() { > 63: > 64: const int64_t wait_ms = ShenandoahPacing ? ShenandoahControlIntervalMin : 0; So we are supporting ShenandoahPacing with GenShenm(at least till we pull it in the future), but don't want to enable it by default, correct? src/hotspot/share/gc/shenandoah/shenandoahGenerationalControlThread.hpp line 64: > 62: private: > 63: // This lock is used to coordinate setting the _requested_gc_cause, _requested generation > 64: // and _gc_mode. It is important that these be changed together and have a consistent view. In that case, for ease of maintenance, I'd move the declaration of all of the 3 data members that this lock protects next to this lock, either immediately preceding or immediately succeeding its declaration in the body of this class. Are these data members always both read and written under this lock? If so, then `_gc_mode` below doesn't need to be defined `volatile`. src/hotspot/share/gc/shenandoah/shenandoahGenerationalControlThread.hpp line 88: > 86: uint _age_period; > 87: > 88: // The mode is read frequently by requesting threads and only ever written by the control thread. Do requesting threads lock the mutex when reading? I am trying to square yr comment that it's protected by the mutex with the field being declared `volatile`. src/hotspot/share/gc/shenandoah/shenandoahHeap.hpp line 450: > 448: > 449: void cancel_concurrent_mark(); > 450: bool cancel_gc(GCCause::Cause cause); // Returns true if and only if cancellation request was successfully communicated. ------------- PR Review: https://git.openjdk.org/jdk/pull/23475#pullrequestreview-2618968208 PR Review Comment: https://git.openjdk.org/jdk/pull/23475#discussion_r1956962731 PR Review Comment: https://git.openjdk.org/jdk/pull/23475#discussion_r1956965579 PR Review Comment: https://git.openjdk.org/jdk/pull/23475#discussion_r1956944585 PR Review Comment: https://git.openjdk.org/jdk/pull/23475#discussion_r1956918529 PR Review Comment: https://git.openjdk.org/jdk/pull/23475#discussion_r1956929734 PR Review Comment: https://git.openjdk.org/jdk/pull/23475#discussion_r1956981955 PR Review Comment: https://git.openjdk.org/jdk/pull/23475#discussion_r1956816268 PR Review Comment: https://git.openjdk.org/jdk/pull/23475#discussion_r1956824150 PR Review Comment: https://git.openjdk.org/jdk/pull/23475#discussion_r1956952381 From ysr at openjdk.org Sat Feb 15 01:55:21 2025 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Sat, 15 Feb 2025 01:55:21 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v12] In-Reply-To: References:

Message-ID: On Sat, 15 Feb 2025 01:10:51 GMT, Y. Srinivas Ramakrishna wrote: >> William Kemper has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 30 commits: >> >> - Merge remote-tracking branch 'jdk/master' into fix-control-regulator-threads >> - Old gen bootstrap cycle must make it to init mark >> - Merge remote-tracking branch 'jdk/master' into fix-control-regulator-threads >> - Improve message for assertion >> - Make shutdown safer for threads requesting (or expecting) gc >> - Do not accept requests if control thread is terminating >> - Notify waiters when control thread terminates >> - Add event for control thread state changes >> - Fix shutdown livelock error >> - Fix includes >> - ... and 20 more: https://git.openjdk.org/jdk/compare/ba6c9659...915ffbda > > src/hotspot/share/gc/shenandoah/shenandoahCollectorPolicy.hpp line 101: > >> 99: || cause == GCCause::_shenandoah_allocation_failure_evac >> 100: || cause == GCCause::_shenandoah_humongous_allocation_failure; >> 101: } > > Would it make sense to move this implementation also to the .cpp file like the other static `is_...` methods below? Or is this guaranteeing inlining into the caller's body, which you might prefer for the callers? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23475#discussion_r1956968182 From ysr at openjdk.org Sat Feb 15 01:55:22 2025 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Sat, 15 Feb 2025 01:55:22 GMT Subject: RFR: 8349094: GenShen: Race between control and regulator threads may violate assertions [v12] In-Reply-To: References:

Message-ID: On Fri, 14 Feb 2025 19:28:01 GMT, Kelvin Nilsen wrote: >> William Kemper has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 30 commits: >> >> - Merge remote-tracking branch 'jdk/master' into fix-control-regulator-threads >> - Old gen bootstrap cycle must make it to init mark >> - Merge remote-tracking branch 'jdk/master' into fix-control-regulator-threads >> - Improve message for assertion >> - Make shutdown safer for threads requesting (or expecting) gc >> - Do not accept requests if control thread is terminating >> - Notify waiters when control thread terminates >> - Add event for control thread state changes >> - Fix shutdown livelock error >> - Fix includes >> - ... and 20 more: https://git.openjdk.org/jdk/compare/ba6c9659...915ffbda > > src/hotspot/share/gc/shenandoah/shenandoahGenerationalControlThread.cpp line 98: > >> 96: } >> 97: >> 98: // In case any threads are waiting for a cycle to happen, let them know it isn't. > > maybe "it isn't happening", or "it won't happen". This is interesting. If GC is stopping prior to shutting down the VM, is there any point in notifying these waiting threads. Why not let them wait, and quietly shut things down? Are there JCK or other tests that would fail in that case? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23475#discussion_r1956979069 From ssubramaniam at openjdk.org Tue Feb 18 19:27:02 2025 From: ssubramaniam at openjdk.org (Satyen Subramaniam) Date: Tue, 18 Feb 2025 19:27:02 GMT Subject: Withdrawn: 8348610: GenShen: TestShenandoahEvacuationInformationEvent failed with setRegions >= regionsFreed: expected 1 >= 57 In-Reply-To: References: Message-ID: On Thu, 30 Jan 2025 00:07:28 GMT, Satyen Subramaniam wrote: > Renaming `ShenandoahEvacuationInformation.freedRegions` to `ShenandoahEvacuationInformation.freeRegions` for clarity, and fixing incorrect assertion in TestShenandoahEvacuationInformationEvent.cpp > > Tested with tier 1, tier 2, and tier 3 tests. This pull request has been closed without being integrated. ------------- PR: https://git.openjdk.org/shenandoah/pull/558 From kdnilsen at openjdk.org Tue Feb 18 19:28:28 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 18 Feb 2025 19:28:28 GMT Subject: RFR: 8349766: GenShen: Bad progress after degen does not always need full gc [v5] In-Reply-To: References: Message-ID: > In generational mode, only upgrade to full GC from degenerated GC if we've done two degenerated cycles in a row and both indicated bad progress. Otherwise, start another concurrent GC, which will most likely degenerate also. But this degenerated cycle will reclaim floating garbage within the young generation much more quickly than a full GC would have done. Kelvin Nilsen has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains seven commits: - Merge branch 'master' of https://git.openjdk.org/jdk into defer-generational-full-gc - Merge master - Fix typo in merge conflict resolution - 8348595: GenShen: Fix generational free-memory no-progress check Reviewed-by: phh, xpeng - 8348092: Shenandoah: assert(nk >= _lowest_valid_narrow_klass_id && nk <= _highest_valid_narrow_klass_id) failed: narrowKlass ID out of range (3131947710) Reviewed-by: shade - Merge tag 'jdk-25+10' into defer-generational-full-gc Added tag jdk-25+10 for changeset a637ccf2 - Be less eager to upgrade degen to full gc ------------- Changes: https://git.openjdk.org/jdk/pull/23552/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23552&range=04 Stats: 20 lines in 2 files changed: 17 ins; 0 del; 3 mod Patch: https://git.openjdk.org/jdk/pull/23552.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23552/head:pull/23552 PR: https://git.openjdk.org/jdk/pull/23552 From kdnilsen at openjdk.org Tue Feb 18 19:49:54 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 18 Feb 2025 19:49:54 GMT Subject: RFR: 8347804: GenShen: Crash with small GCCardSizeInBytes and small Java heap Message-ID: Original implementation was not robust to overriding of CardSizeInBytes, especially to smaller values. This fixes that issue. ------------- Commit messages: - 8348092: Shenandoah: assert(nk >= _lowest_valid_narrow_klass_id && nk <= _highest_valid_narrow_klass_id) failed: narrowKlass ID out of range (3131947710) - Merge tag 'jdk-25+10' into fix-small-card-size - Remove SIZE_FORMAT usage - Merge tag 'jdk-25+9' into fix-small-card-size - Remove debug instrumentation - Fix several bookkeeping errors - Revert "Remove debug instrumentation" - Remove debug instrumentation - Use snprintf instead of sprintf - Add a jtreg test for small card size - ... and 1 more: https://git.openjdk.org/jdk/compare/a637ccf2...7120cdf3 Changes: https://git.openjdk.org/jdk/pull/23373/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23373&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8347804 Stats: 107 lines in 6 files changed: 79 ins; 4 del; 24 mod Patch: https://git.openjdk.org/jdk/pull/23373.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23373/head:pull/23373 PR: https://git.openjdk.org/jdk/pull/23373 From kdnilsen at openjdk.org Tue Feb 18 19:49:54 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 18 Feb 2025 19:49:54 GMT Subject: RFR: 8347804: GenShen: Crash with small GCCardSizeInBytes and small Java heap In-Reply-To: References: Message-ID: On Thu, 30 Jan 2025 18:55:53 GMT, Kelvin Nilsen wrote: > Original implementation was not robust to overriding of CardSizeInBytes, especially to smaller values. This fixes that issue. Internal pipelines reveal a regression. Changing to draft mode while I chase this down. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23373#issuecomment-2625332964 From duke at openjdk.org Wed Feb 19 06:21:33 2025 From: duke at openjdk.org (sli-x) Date: Wed, 19 Feb 2025 06:21:33 GMT Subject: RFR: 8340434: Excessive Young GCs Triggered by CodeCache GC Threshold Message-ID: The trigger of _codecache_GC_threshold in CodeCache::gc_on_allocation is the key to this problem. if (used_ratio > threshold) { // After threshold is reached, scale it by free_ratio so that more aggressive // GC is triggered as we approach code cache exhaustion threshold *= free_ratio; } // If code cache has been allocated without any GC at all, let's make sure // it is eventually invoked to avoid trouble. if (allocated_since_last_ratio > threshold) { // In case the GC is concurrent, we make sure only one thread requests the GC. if (Atomic::cmpxchg(&_unloading_threshold_gc_requested, false, true) == false) { log_info(codecache)("Triggering threshold (%.3f%%) GC due to allocating %.3f%% since last unloading (%.3f%% used -> %.3f%% used)", threshold * 100.0, allocated_since_last_ratio * 100.0, last_used_ratio * 100.0, used_ratio * 100.0); Universe::heap()->collect(GCCause::_codecache_GC_threshold); } } Here with the limited codecache size, the free_ratio will get lower and lower (so as the threshold) if no methods can be swept and thus leads to a more and more frequent collection behavior. Since the collection happens in stw, the whole performance of gc will also be degraded. So a simple solution is to delete the scaling logic here. However, I think here lies some problems worth further exploring. There're two options to control a code cache sweeper, StartAggressiveSweepingAt and SweeperThreshold. StartAggressiveSweepingAt is a sweeper triggered for little space in codeCache and does little harm. However, SweeperThreshold, first introduced by [JDK-8244660](https://bugs.openjdk.org/browse/JDK-8244660), was designed for a regular sweep for codecache, when codeCache sweeper and heap collection are actually individual. After [JDK-8290025](https://bugs.openjdk.org/browse/JDK-8290025) and some patches related, the old mechanism of codeCache sweeper is merged into a concurrent heap collection. So the Code cache sweeper heuristics and the unloading behavior will be promised by the concurrent collection. There's no longer any "zombie" methods to be counted. Considering it will introduce lots of useless collection jobs, I think SweeperThreshold should be deleted now. ------------- Commit messages: - remove SweeperThreshold and set it to Obselete Changes: https://git.openjdk.org/jdk/pull/21084/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=21084&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8340434 Stats: 55 lines in 14 files changed: 1 ins; 53 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/21084.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/21084/head:pull/21084 PR: https://git.openjdk.org/jdk/pull/21084 From tschatzl at openjdk.org Wed Feb 19 06:21:33 2025 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Wed, 19 Feb 2025 06:21:33 GMT Subject: RFR: 8340434: Excessive Young GCs Triggered by CodeCache GC Threshold In-Reply-To: References: Message-ID: On Thu, 19 Sep 2024 08:43:50 GMT, sli-x wrote: > Here with the limited codecache size, the free_ratio will get lower and lower (so as the threshold) if no methods can be swept and thus leads to a more and more frequent collection behavior. Since the collection happens in stw, the whole performance of gc will also be degraded. > >So a simple solution is to delete the scaling logic here. However, I think here lies some problems worth further exploring. > >There're two options to control a code cache sweeper, StartAggressiveSweepingAt and SweeperThreshold. StartAggressiveSweepingAt is a sweeper triggered for little space in codeCache and does little harm. However, SweeperThreshold, first introduced by [JDK-8244660](https://bugs.openjdk.org/browse/JDK-8244660), was designed for a regular sweep for codecache, when codeCache sweeper and heap collection are actually individual. After [JDK-8290025](https://bugs.openjdk.org/browse/JDK-8290025) and some patches related, the old mechanism of codeCache sweeper is merged into a concurrent heap collection. So the Code cache sweeper heuristics and the unloading behavior will be promised by the concurrent collection. There's no longer any "zombie" methods to be counted. Considering it will introduce lots of useless collection jobs, I think SweeperThreshold should be deleted now. I think the general concern presented out by the code > // After threshold is reached, scale it by free_ratio so that more aggressive > // GC is triggered as we approach code cache exhaustion is still valid. How this is implemented also makes somewhat sense: changes are the trigger for collections, allow larger changes before trying to clean out the code cache the emptier the code cache is. It tries to limit code cache memory usage by increasingly doing more frequent collections the more occupied the code cache becomes, i.e. some kind of backpressure on code cache usage. Your use case of limiting the code cache size (and setting initial == max) seems to be relatively unusual one to me, and apparently does not fit that model as it seems that you set code cache size close to actual max usage. Removing `SweepingThreshold` would affect the regular case as well in a significant way (allocate until bumping into the `StartAggressiveSweepingAt`) I do not think removing this part of the heuristic isn't good (or desired at all). Maybe an alternative could be only not doing this heuristic part in your case; and even then am not sure that waiting until hitting the `StartAggressiveSweepingAt` threshold is a good idea; it may be too late to avoid disabling the compiler at least temporarily. And even then, as long as the memory usage keeps being larger larger than the threshold, this will result in continuous code cache sweeps (_every time_ _any_ memory is allocated in the code cache). >From the [JDK-8244660](https://bugs.openjdk.org/browse/JDK-8244660) CR: > This is because users with different sized code caches might want different thresholds. (Otherwise there would be no way to control the sweepers intensity). Which means that one could just take that suggestion literally and not only change the initial/max code cache size but also that threshold in your use case. Stepping back a little, this situation very much resembles issues with G1's `InitiatingHeapOccupancyPercent` pre [JDK-8136677](https://bugs.openjdk.org/browse/JDK-8136677) where a one-size-fits-all value also did not work, and many many people tuned `InitiatingHeapOccupancyPercent` manually in the past. Maybe a similar mechanism at least taking actual code cache allocation rate into account ("when will the current watermark will be hit"?) would be preferable to replace both options (note that since I'm not an expert in code cache, there may be other reasons to clean out the code cache than just occupancy threshold)? Thomas ------------- PR Comment: https://git.openjdk.org/jdk/pull/21084#issuecomment-2383475220 From robilad at openjdk.org Wed Feb 19 06:21:33 2025 From: robilad at openjdk.org (Dalibor Topic) Date: Wed, 19 Feb 2025 06:21:33 GMT Subject: RFR: 8340434: Excessive Young GCs Triggered by CodeCache GC Threshold In-Reply-To: References: Message-ID: On Thu, 19 Sep 2024 08:43:50 GMT, sli-x wrote: > The trigger of _codecache_GC_threshold in CodeCache::gc_on_allocation is the key to this problem. > > if (used_ratio > threshold) { > // After threshold is reached, scale it by free_ratio so that more aggressive > // GC is triggered as we approach code cache exhaustion > threshold *= free_ratio; > } > // If code cache has been allocated without any GC at all, let's make sure > // it is eventually invoked to avoid trouble. > if (allocated_since_last_ratio > threshold) { > // In case the GC is concurrent, we make sure only one thread requests the GC. > if (Atomic::cmpxchg(&_unloading_threshold_gc_requested, false, true) == false) { > log_info(codecache)("Triggering threshold (%.3f%%) GC due to allocating %.3f%% since last unloading (%.3f%% used -> %.3f%% used)", > threshold * 100.0, allocated_since_last_ratio * 100.0, last_used_ratio * 100.0, used_ratio * 100.0); > Universe::heap()->collect(GCCause::_codecache_GC_threshold); > } > } > > Here with the limited codecache size, the free_ratio will get lower and lower (so as the threshold) if no methods can be swept and thus leads to a more and more frequent collection behavior. Since the collection happens in stw, the whole performance of gc will also be degraded. > > So a simple solution is to delete the scaling logic here. However, I think here lies some problems worth further exploring. > > There're two options to control a code cache sweeper, StartAggressiveSweepingAt and SweeperThreshold. StartAggressiveSweepingAt is a sweeper triggered for little space in codeCache and does little harm. However, SweeperThreshold, first introduced by [JDK-8244660](https://bugs.openjdk.org/browse/JDK-8244660), was designed for a regular sweep for codecache, when codeCache sweeper and heap collection are actually individual. After [JDK-8290025](https://bugs.openjdk.org/browse/JDK-8290025) and some patches related, the old mechanism of codeCache sweeper is merged into a concurrent heap collection. So the Code cache sweeper heuristics and the unloading behavior will be promised by the concurrent collection. There's no longer any "zombie" methods to be counted. Considering it will introduce lots of useless collection jobs, I think SweeperThreshold should be deleted now. Hi, please send an e-mail to dalibor.topic at oracle.com so that I can verify your account in Skara. ------------- PR Comment: https://git.openjdk.org/jdk/pull/21084#issuecomment-2427338142 From ayang at openjdk.org Wed Feb 19 14:18:05 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 19 Feb 2025 14:18:05 GMT Subject: RFR: 8348171: Refactor GenerationCounters and its subclasses [v5] In-Reply-To: <7otkT63ENoyKzZ29CbYpycLLwL89ARajYg36Mstz4tQ=.fd3c7dcf-5a8b-44be-9205-09e3d160d54d@github.com> References: <7otkT63ENoyKzZ29CbYpycLLwL89ARajYg36Mstz4tQ=.fd3c7dcf-5a8b-44be-9205-09e3d160d54d@github.com> Message-ID: On Tue, 11 Feb 2025 15:28:25 GMT, Albert Mingkun Yang wrote: >> Simple refactoring of removing the use of `virtual` method and use concrete subclasses when needed. >> >> Test: tier1-5 > > Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains seven commits: > > - Merge branch 'master' into gen-counter > - review > - * some more refactoring > - review > - Merge branch 'master' into gen-counter > - merge > - gen-counter Thanks for review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23209#issuecomment-2668780554 From ayang at openjdk.org Wed Feb 19 14:18:05 2025 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 19 Feb 2025 14:18:05 GMT Subject: Integrated: 8348171: Refactor GenerationCounters and its subclasses In-Reply-To: References: Message-ID: On Tue, 21 Jan 2025 09:53:07 GMT, Albert Mingkun Yang wrote: > Simple refactoring of removing the use of `virtual` method and use concrete subclasses when needed. > > Test: tier1-5 This pull request has now been integrated. Changeset: c6e47fd5 Author: Albert Mingkun Yang URL: https://git.openjdk.org/jdk/commit/c6e47fd5812997e3428249be1c77c60e7b05a5df Stats: 202 lines in 17 files changed: 6 ins; 160 del; 36 mod 8348171: Refactor GenerationCounters and its subclasses Co-authored-by: Thomas Schatzl Reviewed-by: gli, tschatzl, zgu ------------- PR: https://git.openjdk.org/jdk/pull/23209 From andrew at openjdk.org Wed Feb 19 15:49:28 2025 From: andrew at openjdk.org (Andrew John Hughes) Date: Wed, 19 Feb 2025 15:49:28 GMT Subject: RFR: Merge jdk8u:master Message-ID: <0-KY5icPKpiaB8p261b901PBCh1qzjGBN4xQFGDJPKU=.6f22bf19-fcc8-4aff-92d8-f21498a7d2bd@github.com> Merge jdk8u332-b07 ------------- Commit messages: - Merge jdk8u332-b07 - 8284548: Invalid XPath expression causes StringIndexOutOfBoundsException - 8281388: Change wrapping of EncryptedPrivateKeyInfo - 8282300: Throws NamingException instead of InvalidNameException after JDK-8278972 - 8278972: Improve URL supports - 8278805: Enhance BMP image loading - 8278449: Improve keychain support - 8282397: createTempFile method of java.io.File is failing when called with suffix of spaces character - 8278356: Improve file creation - 8278008: Improve Santuario processing - ... and 10 more: https://git.openjdk.org/shenandoah-jdk8u/compare/5ca34513...0c935e9c The merge commit only contains trivial merges, so no merge-specific webrevs have been generated. Changes: https://git.openjdk.org/shenandoah-jdk8u/pull/12/files Stats: 3516 lines in 62 files changed: 2391 ins; 729 del; 396 mod Patch: https://git.openjdk.org/shenandoah-jdk8u/pull/12.diff Fetch: git fetch https://git.openjdk.org/shenandoah-jdk8u.git pull/12/head:pull/12 PR: https://git.openjdk.org/shenandoah-jdk8u/pull/12 From andrew at openjdk.org Wed Feb 19 15:49:28 2025 From: andrew at openjdk.org (Andrew John Hughes) Date: Wed, 19 Feb 2025 15:49:28 GMT Subject: RFR: Merge jdk8u:master In-Reply-To: <0-KY5icPKpiaB8p261b901PBCh1qzjGBN4xQFGDJPKU=.6f22bf19-fcc8-4aff-92d8-f21498a7d2bd@github.com> References: <0-KY5icPKpiaB8p261b901PBCh1qzjGBN4xQFGDJPKU=.6f22bf19-fcc8-4aff-92d8-f21498a7d2bd@github.com> Message-ID: On Wed, 19 Feb 2025 15:44:23 GMT, Andrew John Hughes wrote: > Merge jdk8u332-b07 GHA builds will not work until [JDK-8284622](https://bugs.openjdk.org/browse/JDK-8284622) is merged in 8u362-b03 ------------- PR Comment: https://git.openjdk.org/shenandoah-jdk8u/pull/12#issuecomment-2669042924 From andrew at openjdk.org Wed Feb 19 15:55:47 2025 From: andrew at openjdk.org (Andrew John Hughes) Date: Wed, 19 Feb 2025 15:55:47 GMT Subject: RFR: Merge jdk8u:master [v2] In-Reply-To: <0-KY5icPKpiaB8p261b901PBCh1qzjGBN4xQFGDJPKU=.6f22bf19-fcc8-4aff-92d8-f21498a7d2bd@github.com> References: <0-KY5icPKpiaB8p261b901PBCh1qzjGBN4xQFGDJPKU=.6f22bf19-fcc8-4aff-92d8-f21498a7d2bd@github.com> Message-ID: > Merge jdk8u332-b07 Andrew John Hughes has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. ------------- Changes: - all: https://git.openjdk.org/shenandoah-jdk8u/pull/12/files - new: https://git.openjdk.org/shenandoah-jdk8u/pull/12/files/0c935e9c..0c935e9c Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah-jdk8u&pr=12&range=01 - incr: https://webrevs.openjdk.org/?repo=shenandoah-jdk8u&pr=12&range=00-01 Stats: 0 lines in 0 files changed: 0 ins; 0 del; 0 mod Patch: https://git.openjdk.org/shenandoah-jdk8u/pull/12.diff Fetch: git fetch https://git.openjdk.org/shenandoah-jdk8u.git pull/12/head:pull/12 PR: https://git.openjdk.org/shenandoah-jdk8u/pull/12 From andrew at openjdk.org Wed Feb 19 15:55:40 2025 From: andrew at openjdk.org (Andrew John Hughes) Date: Wed, 19 Feb 2025 15:55:40 GMT Subject: git: openjdk/shenandoah-jdk8u: master: 20 new changesets Message-ID: <73df6b88-1a42-4cab-a12e-3f5c42089dad@openjdk.org> Changeset: 703cd7c3 Branch: master Author: Andrew John Hughes Date: 2022-03-29 03:33:24 +0000 URL: https://git.openjdk.org/shenandoah-jdk8u/commit/703cd7c3247f4122b2f316838a99adbc28454015 Added tag jdk8u332-b06 for changeset 6d5c4e11830c ! .hgtags Changeset: c957789e Branch: master Author: Martin Balao Date: 2022-04-15 01:35:49 +0000 URL: https://git.openjdk.org/shenandoah-jdk8u/commit/c957789e68889bc05aeece367779cc427bf4780f 8269938: Enhance XML processing passes redux Reviewed-by: andrew ! jaxp/src/com/sun/org/apache/xerces/internal/parsers/AbstractDOMParser.java ! jaxp/src/com/sun/org/apache/xml/internal/serializer/ToHTMLStream.java ! jaxp/src/com/sun/org/apache/xml/internal/serializer/ToStream.java ! jaxp/src/com/sun/xml/internal/stream/events/EntityDeclarationImpl.java ! jaxp/src/com/sun/xml/internal/stream/events/NotationDeclarationImpl.java ! jaxp/src/jdk/xml/internal/JdkXmlUtils.java Changeset: 22ae2b4e Branch: master Author: Yuri Nesterenko Date: 2022-04-15 01:58:03 +0000 URL: https://git.openjdk.org/shenandoah-jdk8u/commit/22ae2b4eaf411bcedb292a851fb184a01ceabb9d 8270504: Better Xpath expression handling Reviewed-by: andrew ! jaxp/src/com/sun/java_cup/internal/runtime/lr_parser.java ! jaxp/src/com/sun/org/apache/xalan/internal/XalanConstants.java - jaxp/src/com/sun/org/apache/xalan/internal/utils/XMLSecurityManager.java ! jaxp/src/com/sun/org/apache/xalan/internal/xsltc/compiler/Parser.java ! jaxp/src/com/sun/org/apache/xalan/internal/xsltc/compiler/XPathParser.java ! jaxp/src/com/sun/org/apache/xalan/internal/xsltc/compiler/XSLTC.java ! jaxp/src/com/sun/org/apache/xalan/internal/xsltc/compiler/sym.java ! jaxp/src/com/sun/org/apache/xalan/internal/xsltc/compiler/util/ErrorMessages.java ! jaxp/src/com/sun/org/apache/xalan/internal/xsltc/compiler/util/ErrorMsg.java ! jaxp/src/com/sun/org/apache/xalan/internal/xsltc/trax/TransformerFactoryImpl.java ! jaxp/src/com/sun/org/apache/xalan/internal/xsltc/trax/TransformerImpl.java ! jaxp/src/com/sun/org/apache/xalan/internal/xsltc/trax/Util.java ! jaxp/src/com/sun/org/apache/xml/internal/utils/XMLReaderManager.java ! jaxp/src/com/sun/org/apache/xpath/internal/XPath.java ! jaxp/src/com/sun/org/apache/xpath/internal/compiler/Lexer.java + jaxp/src/com/sun/org/apache/xpath/internal/compiler/Token.java ! jaxp/src/com/sun/org/apache/xpath/internal/compiler/XPathParser.java ! jaxp/src/com/sun/org/apache/xpath/internal/jaxp/XPathFactoryImpl.java ! jaxp/src/com/sun/org/apache/xpath/internal/jaxp/XPathImpl.java ! jaxp/src/com/sun/org/apache/xpath/internal/res/XPATHErrorResources.java ! jaxp/src/jdk/xml/internal/JdkXmlUtils.java + jaxp/src/jdk/xml/internal/XMLLimitAnalyzer.java + jaxp/src/jdk/xml/internal/XMLSecurityManager.java Changeset: d5e4c821 Branch: master Author: David Alvarez Date: 2022-03-30 22:50:06 +0000 URL: https://git.openjdk.org/shenandoah-jdk8u/commit/d5e4c8211b2cc7c5d5e02f8d5f51b4d048c40836 8272255: Completely handle MIDI files Reviewed-by: andrew ! jdk/src/share/classes/com/sun/media/sound/AudioFileSoundbankReader.java Changeset: 0417c4b7 Branch: master Author: Oli Gillespie Committer: David Alvarez Date: 2022-02-04 18:41:32 +0000 URL: https://git.openjdk.org/shenandoah-jdk8u/commit/0417c4b79bb90197b92404b93900f65ccab4d453 8272261: Improve JFR recording file processing Reviewed-by: andrew ! jdk/src/share/classes/jdk/jfr/internal/tool/JSONWriter.java ! jdk/src/share/classes/jdk/jfr/internal/tool/XMLWriter.java Changeset: d4a2cb2f Branch: master Author: Martin Balao Date: 2022-04-15 02:17:55 +0000 URL: https://git.openjdk.org/shenandoah-jdk8u/commit/d4a2cb2fc593a23e8b55242c84210e217b9eefa7 8272594: Better record of recordings Reviewed-by: andrew ! jdk/src/share/classes/jdk/jfr/consumer/ParserFactory.java ! jdk/src/share/classes/jdk/jfr/internal/MetadataReader.java ! jdk/src/share/classes/jdk/jfr/internal/consumer/RecordingInput.java Changeset: b44052f1 Branch: master Author: David Alvarez Date: 2022-04-15 02:24:16 +0000 URL: https://git.openjdk.org/shenandoah-jdk8u/commit/b44052f15b5939c30ee910fd6f8d4c1ddef44a6b 8274221: More definite BER encodings Reviewed-by: andrew ! jdk/src/share/classes/sun/security/util/DerIndefLenConverter.java Changeset: 09520f3c Branch: master Author: Aleksei Voitylov Date: 2022-02-18 00:38:29 +0000 URL: https://git.openjdk.org/shenandoah-jdk8u/commit/09520f3c48e3656f1b225cda0d4057e3183daa3a 8275151: Improved Object Identification Reviewed-by: andrew ! jdk/src/share/classes/sun/security/util/ObjectIdentifier.java Changeset: 399ad9ed Branch: master Author: Aleksei Voitylov Date: 2022-02-18 00:39:20 +0000 URL: https://git.openjdk.org/shenandoah-jdk8u/commit/399ad9ed3f097966da84e02415371ed22406e906 8277227: Better identification of OIDs Reviewed-by: andrew ! jdk/src/share/classes/sun/security/util/ObjectIdentifier.java Changeset: 89d03b67 Branch: master Author: Aleksei Voitylov