From ysr at openjdk.org Wed Nov 1 01:10:45 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Wed, 1 Nov 2023 01:10:45 GMT Subject: RFR: 8318462: [GenShen] Prevent unsafe access to displaced mark-word [v3] In-Reply-To: <7WgrlAQun762-rhgugi6qSKkqbrJqAskJDBEDMJ8O6o=.9516f2e1-314a-4173-9b0d-d4e334884c2e@github.com> References:

<62iAP1xppfg6WDXm-ZdoT8cugt1L-2athpTtFc145M8=.9e763172-f272-4675-93a7-3733d63b335b@github.com> <3ZaUyCn3Fxysz9hN4CK8EHQmDiGW3VEK8eIiBhFM70k=.e9b27e34-c63c-419a-b6b8-4af376c28480@github.com> <7WgrlAQun762-rhgugi6qSKkqbrJqAskJDBEDMJ8O6o=.9516f2e1-314a-4173-9b0d-d4e334884c2e@github.com> Message-ID: On Tue, 31 Oct 2023 23:21:43 GMT, Kelvin Nilsen wrote: >> The add method of ShenandoahAgeCensus explicitly recognizes and uses the sentinel value, and explicitly skips sentinels that are used to signal that the age could not be read: >> >> >> CENSUS_NOISE(void ShenandoahAgeCensus::add(uint obj_age, uint region_age, uint region_youth, size_t size, uint worker_id) {) >> NO_CENSUS_NOISE(void ShenandoahAgeCensus::add(uint obj_age, uint region_age, size_t size, uint worker_id) {) >> if (obj_age <= markWord::max_age) { >> assert(obj_age < MAX_COHORTS && region_age < MAX_COHORTS, "Should have been tenured"); >> #ifdef SHENANDOAH_CENSUS_NOISE >> // Region ageing is stochastic and non-monotonic; this vitiates mortality >> // demographics in ways that might defeat our algorithms. Marking may be a >> // time when we might be able to correct this, but we currently do not do >> // this. Like skipped statistics further below, we want to track the >> // impact of this noise to see if this may be worthwhile. JDK-. >> uint age = obj_age; >> if (region_age > 0) { >> add_aged(size, worker_id); // this tracking is coarse for now >> age += region_age; >> if (age >= MAX_COHORTS) { >> age = (uint)(MAX_COHORTS - 1); // clamp >> add_clamped(size, worker_id); >> } >> } >> if (region_youth > 0) { // track object volume with retrograde age >> add_young(size, worker_id); >> } >> #else // SHENANDOAH_CENSUS_NOISE >> uint age = MIN2(obj_age + region_age, (uint)(MAX_COHORTS - 1)); // clamp >> #endif // SHENANDOAH_CENSUS_NOISE >> get_local_age_table(worker_id)->add(age, size); >> } else { >> // update skipped statistics >> CENSUS_NOISE(add_skipped(size, worker_id);) >> } >> } >> >> >> I think we should leave in place the original code, where all of the special handling of the sentinel values is relegated to the ShenandoahAgeCensus's add method, with no special handling needed at the caller. > > Thanks. That looks good. I concur. I realize though that your comment kind of applies to the evacuation tracker code that needs to be cognizant of the change in the semantics of `get_object_age` following Roman's change. This is the relevant call at: https://github.com/openjdk/shenandoah/blob/113a3d2bbec5679a9c25550fc825feb75a7b41e8/src/hotspot/share/gc/shenandoah/shenandoahHeap.inline.hpp#L491 which lands us at: https://github.com/openjdk/shenandoah/blob/113a3d2bbec5679a9c25550fc825feb75a7b41e8/src/hotspot/share/gc/shenandoah/shenandoahEvacTracker.cpp#L60 and thence at: https://github.com/openjdk/shenandoah/blob/113a3d2bbec5679a9c25550fc825feb75a7b41e8/src/hotspot/share/gc/shared/ageTable.hpp#L63 which might either assert or silently overflow past the end of the table. It would be useful to filter those records out in ShenandoahEvacuationStats::record_age() or at the caller. ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/343#discussion_r1378299556 From wkemper at openjdk.org Wed Nov 1 12:37:35 2023 From: wkemper at openjdk.org (William Kemper) Date: Wed, 1 Nov 2023 12:37:35 GMT Subject: RFR: 8318462: [GenShen] Prevent unsafe access to displaced mark-word [v4] In-Reply-To: <7g3I1quU1AP4udb0EdB_s_8dK1coaiFR8wGTEUZ2ASw=.4102020d-3d17-4901-87da-7a47bb4106fd@github.com> References: <7g3I1quU1AP4udb0EdB_s_8dK1coaiFR8wGTEUZ2ASw=.4102020d-3d17-4901-87da-7a47bb4106fd@github.com> Message-ID: On Tue, 31 Oct 2023 09:51:29 GMT, Roman Kennke wrote: >> We must be very careful when we access (load or update) the object age concurrently because the age is stored in the object's header and that header can be 'displaced'. This means that the header is overloaded with a pointer (stack or ObjectMonitor*) and the original header is stored at that location. The header can also be in INFLATING state, which means that a stack-lock is currently in the process to be inflated to an ObjectMonitor. Let's consider the cases separately: >> >> Loading object age: >> - INFLATING: we can not access the header, and thus not the age. >> - Stack-locked: the thread which locks the object can unlock at any time, concurrently. It is not safe to access the header, and thus the age. >> - Monitor-locked: monitors don't go away without coordinating with Java threads and possibly GC threads. This coordination is done by handshaking the thread - a Java thread would be brought to its next safepoint, and GC threads, which are typically participating in handshakes by joining the SuspendibleThreadSet, can opt to handshake at a safe point by calling SuspendibleThreadSet::yield(). If 'our' thread participates in handshakes in one way or the other, it is safe to access an ObjectMonitor once we got a valid ObjectMonitor* from an object's header, but only until that thread handshakes. Long story short: it is typically safe to access ObjectMonitor :-) >> >> Updating object age: >> Updating object age only happens during evacuation, and only on a new copy of an object, before that copy has been published. Access to the object header is therefore fully thread-local and not problematic. What *is* problematic is when we have to deal with displaced headers, because displaced headers are *not* thread local, and must be considered a shared resource. A competing thread may succeed to evacuate the object and publish its copy before 'our' thread can do so. If 'our' thread would update the displaced header, it may over-write whatever the other thread has done. >> - INFLATING: We cannot access the header at all. However, that should not happen: a thread can only inflate once it has successfully evacuated an object, and >> - Stack-locked: the other thread may succeed evacuation and continue to unlock the object, at which point the stack-pointer would be 'dangling' and we may over-write random information on the foreign thread stack. >> - Monitor-locked: even though monitor deflation is coordinated (see above), updating the displaced header in a monitor might over-... > > Roman Kennke has updated the pull request incrementally with two additional commits since the last revision: > > - Treat age sentinel correctly > - Fix comment formatting Internal testing looks good. ------------- Marked as reviewed by wkemper (Committer). PR Review: https://git.openjdk.org/shenandoah/pull/343#pullrequestreview-1708129567 From zgu at openjdk.org Wed Nov 1 13:29:59 2023 From: zgu at openjdk.org (Zhengyu Gu) Date: Wed, 1 Nov 2023 13:29:59 GMT Subject: RFR: 8318462: [GenShen] Prevent unsafe access to displaced mark-word [v2] In-Reply-To: References: <2lwvFiHSE-N-kXfzhKYx4HhQASkwI01cjVO_tPJ_KZI=.5c873318-cb3f-44e6-8745-d6f5cbe3e52d@github.com>

Message-ID: On Tue, 31 Oct 2023 09:39:45 GMT, Roman Kennke wrote: > > Just FYI: Here is another place to update object age, probably you want to consolidate. > > https://github.com/openjdk/shenandoah/blob/master/src/hotspot/share/gc/shenandoah/shenandoahStringDedup.inline.hpp#L55 > > Right. The problem there is that this block of code does a sort of concurrent/atomic update of the object age that would require re-working and/or adding age helper methods. I'd rather not do that now. I think for GenShen, StringDeduplication should not update age, so code can just be simplified as following: bool ShenandoahStringDedup::is_candidate(oop obj) { if (!is_string_candidate(obj)) { return false; } const markWord mark = obj->mark(); // Having/had displaced header, too risky to deal with them, skip if ((LockingMode != LM_LIGHTWEIGHT && mark == markWord::INFLATING()) || mark.has_displaced_mark_helper()) { return false; } return StringDedup::is_threshold_age(new_mark.age()) && !dedup_requested(obj); } ------------- PR Comment: https://git.openjdk.org/shenandoah/pull/343#issuecomment-1788955550 From kdnilsen at openjdk.org Wed Nov 1 16:06:55 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 1 Nov 2023 16:06:55 GMT Subject: RFR: 8319198: GenShen: Old at end of Full GC must include newly promoted objects Message-ID: At end of Full GC, remember live bytes after last old-gen mark after promotions are completed rather than immediately after old marking is completed. This is a more accurate representation of old live. This reduces the likelihood that we will need to immediately trigger another old-gen GC following the Full GC. ------------- Commit messages: - Record old-gen live after promotion is done Changes: https://git.openjdk.org/shenandoah/pull/350/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=350&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8319198 Stats: 4 lines in 1 file changed: 3 ins; 1 del; 0 mod Patch: https://git.openjdk.org/shenandoah/pull/350.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/350/head:pull/350 PR: https://git.openjdk.org/shenandoah/pull/350 From wkemper at openjdk.org Wed Nov 1 16:20:36 2023 From: wkemper at openjdk.org (William Kemper) Date: Wed, 1 Nov 2023 16:20:36 GMT Subject: RFR: 8319198: GenShen: Old at end of Full GC must include newly promoted objects In-Reply-To: References: Message-ID: On Wed, 1 Nov 2023 16:01:34 GMT, Kelvin Nilsen wrote: > At end of Full GC, remember live bytes after last old-gen mark after promotions are completed rather than immediately after old marking is completed. This is a more accurate representation of old live. This reduces the likelihood that we will need to immediately trigger another old-gen GC following the Full GC. Looks good to me. ------------- Marked as reviewed by wkemper (Committer). PR Review: https://git.openjdk.org/shenandoah/pull/350#pullrequestreview-1708565899 From kdnilsen at openjdk.org Wed Nov 1 17:38:09 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 1 Nov 2023 17:38:09 GMT Subject: Integrated: 8319198: GenShen: Old at end of Full GC must include newly promoted objects In-Reply-To: References: Message-ID: On Wed, 1 Nov 2023 16:01:34 GMT, Kelvin Nilsen wrote: > At end of Full GC, remember live bytes after last old-gen mark after promotions are completed rather than immediately after old marking is completed. This is a more accurate representation of old live. This reduces the likelihood that we will need to immediately trigger another old-gen GC following the Full GC. This pull request has now been integrated. Changeset: e4a8a438 Author: Kelvin Nilsen URL: https://git.openjdk.org/shenandoah/commit/e4a8a43806537303fbf18a062ac536f0c2729551 Stats: 4 lines in 1 file changed: 3 ins; 1 del; 0 mod 8319198: GenShen: Old at end of Full GC must include newly promoted objects Reviewed-by: wkemper ------------- PR: https://git.openjdk.org/shenandoah/pull/350 From wkemper at openjdk.org Wed Nov 1 17:48:46 2023 From: wkemper at openjdk.org (William Kemper) Date: Wed, 1 Nov 2023 17:48:46 GMT Subject: RFR: Merge openjdk/jdk:master [v2] In-Reply-To: References:

Message-ID: On Fri, 20 Oct 2023 15:47:44 GMT, William Kemper wrote: >> Merges tag jdk-22+20 > > William Kemper has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 128 commits: > > - Merge remote-tracking branch 'shenandoah/master' into merge-jdk-22+20 > - 8318363: Foreign benchmarks fail to build on some platforms > > Reviewed-by: jvernee, mcimadamore > - 8318183: C2: VM may crash after hitting node limit > > Reviewed-by: kvn, thartmann > - 8315974: Make fields final in 'com.sun.crypto.provider' package > > Reviewed-by: valeriep > - 8317886: Add @sealedGraph to ByteBuffer > > Reviewed-by: rriggs, iris > - 8318365: Test runtime/cds/appcds/sharedStrings/InternSharedString.java fails after JDK-8311538 > > Reviewed-by: cjplummer, lmesnik > - 8309966: Enhanced TLS connections > > Reviewed-by: mschoene, hchao, rhalade, jnimeh > - 8286503: Enhance security classes > > Reviewed-by: rhalade, mullan, skoivu, weijun > - 8297856: Improve handling of Bidi characters > > Reviewed-by: rhalade, mschoene, rriggs > - 8296581: Better system proxy support > > Reviewed-by: rhalade, dfuchs, michaelm, alanb > - ... and 118 more: https://git.openjdk.org/shenandoah/compare/ac30d485...de29536b Superseded by PR to merge jdk-22+21. ------------- PR Comment: https://git.openjdk.org/shenandoah/pull/346#issuecomment-1789389504 From wkemper at openjdk.org Wed Nov 1 17:48:46 2023 From: wkemper at openjdk.org (William Kemper) Date: Wed, 1 Nov 2023 17:48:46 GMT Subject: Withdrawn: Merge openjdk/jdk:master In-Reply-To: References: Message-ID: On Thu, 19 Oct 2023 22:17:19 GMT, William Kemper wrote: > Merges tag jdk-22+20 This pull request has been closed without being integrated. ------------- PR: https://git.openjdk.org/shenandoah/pull/346 From rkennke at openjdk.org Thu Nov 2 11:52:56 2023 From: rkennke at openjdk.org (Roman Kennke) Date: Thu, 2 Nov 2023 11:52:56 GMT Subject: RFR: 8318462: [GenShen] Prevent unsafe access to displaced mark-word [v5] In-Reply-To: References: Message-ID: > We must be very careful when we access (load or update) the object age concurrently because the age is stored in the object's header and that header can be 'displaced'. This means that the header is overloaded with a pointer (stack or ObjectMonitor*) and the original header is stored at that location. The header can also be in INFLATING state, which means that a stack-lock is currently in the process to be inflated to an ObjectMonitor. Let's consider the cases separately: > > Loading object age: > - INFLATING: we can not access the header, and thus not the age. > - Stack-locked: the thread which locks the object can unlock at any time, concurrently. It is not safe to access the header, and thus the age. > - Monitor-locked: monitors don't go away without coordinating with Java threads and possibly GC threads. This coordination is done by handshaking the thread - a Java thread would be brought to its next safepoint, and GC threads, which are typically participating in handshakes by joining the SuspendibleThreadSet, can opt to handshake at a safe point by calling SuspendibleThreadSet::yield(). If 'our' thread participates in handshakes in one way or the other, it is safe to access an ObjectMonitor once we got a valid ObjectMonitor* from an object's header, but only until that thread handshakes. Long story short: it is typically safe to access ObjectMonitor :-) > > Updating object age: > Updating object age only happens during evacuation, and only on a new copy of an object, before that copy has been published. Access to the object header is therefore fully thread-local and not problematic. What *is* problematic is when we have to deal with displaced headers, because displaced headers are *not* thread local, and must be considered a shared resource. A competing thread may succeed to evacuate the object and publish its copy before 'our' thread can do so. If 'our' thread would update the displaced header, it may over-write whatever the other thread has done. > - INFLATING: We cannot access the header at all. However, that should not happen: a thread can only inflate once it has successfully evacuated an object, and > - Stack-locked: the other thread may succeed evacuation and continue to unlock the object, at which point the stack-pointer would be 'dangling' and we may over-write random information on the foreign thread stack. > - Monitor-locked: even though monitor deflation is coordinated (see above), updating the displaced header in a monitor might over-write whatever a competi... Roman Kennke has updated the pull request incrementally with one additional commit since the last revision: Simplify string-dedup age handling ------------- Changes: - all: https://git.openjdk.org/shenandoah/pull/343/files - new: https://git.openjdk.org/shenandoah/pull/343/files/113a3d2b..969e4650 Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah&pr=343&range=04 - incr: https://webrevs.openjdk.org/?repo=shenandoah&pr=343&range=03-04 Stats: 21 lines in 1 file changed: 3 ins; 14 del; 4 mod Patch: https://git.openjdk.org/shenandoah/pull/343.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/343/head:pull/343 PR: https://git.openjdk.org/shenandoah/pull/343 From wkemper at openjdk.org Thu Nov 2 14:15:58 2023 From: wkemper at openjdk.org (William Kemper) Date: Thu, 2 Nov 2023 14:15:58 GMT Subject: RFR: Merge openjdk/jdk:master Message-ID: Merges tag jdk-22+22 ------------- Commit messages: - 8318694: [JVMCI] disable can_call_java in most contexts for libjvmci compiler threads - 8319046: Execute tests in source/class-file order in JavadocTester - 8319139: Improve diagnosability of `JavadocTester` output - 8316025: Use testUI() method of PassFailJFrame.Builder in FileChooserSymLinkTest.java - 8317048: VerifyError with unnamed pattern variable and more than one components - 8318647: Serial: Refactor BlockOffsetTable - 8304939: os::win32::exit_process_or_thread should be marked noreturn - 8318016: Per-compilation memory ceiling - 8318683: compiler/c2/irTests/TestPhiDuplicatedConversion.java "Failed IR Rules (2) of Methods (2)" - 8306980: Generated docs should contain correct Legal Documents - ... and 66 more: https://git.openjdk.org/shenandoah/compare/d96f38b8...d354141a The webrev contains the conflicts with master: - merge conflicts: https://webrevs.openjdk.org/?repo=shenandoah&pr=351&range=00.conflicts Changes: https://git.openjdk.org/shenandoah/pull/351/files Stats: 27458 lines in 1170 files changed: 14959 ins; 4156 del; 8343 mod Patch: https://git.openjdk.org/shenandoah/pull/351.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/351/head:pull/351 PR: https://git.openjdk.org/shenandoah/pull/351 From wkemper at openjdk.org Thu Nov 2 16:29:38 2023 From: wkemper at openjdk.org (William Kemper) Date: Thu, 2 Nov 2023 16:29:38 GMT Subject: RFR: Merge openjdk/jdk:master [v2] In-Reply-To: References: Message-ID: <8uEDs5EvShRtlND-ytX70TgezC0-5w3OhvtSLVD-Rsw=.1d7eb8e7-bdff-4cb8-b03b-47f79f655829@github.com> > Merges tag jdk-22+22 William Kemper has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 77 commits: - Merge branch 'shenandoah/master' into merge-jdk-22+22 - 8318694: [JVMCI] disable can_call_java in most contexts for libjvmci compiler threads Reviewed-by: dholmes, never - 8319046: Execute tests in source/class-file order in JavadocTester Reviewed-by: hannesw - 8319139: Improve diagnosability of `JavadocTester` output Reviewed-by: hannesw - 8316025: Use testUI() method of PassFailJFrame.Builder in FileChooserSymLinkTest.java Reviewed-by: azvegint - 8317048: VerifyError with unnamed pattern variable and more than one components Reviewed-by: jlahoda - 8318647: Serial: Refactor BlockOffsetTable Reviewed-by: tschatzl, iwalulya - 8304939: os::win32::exit_process_or_thread should be marked noreturn Reviewed-by: dholmes, kbarrett - 8318016: Per-compilation memory ceiling Reviewed-by: roland, thartmann - 8318683: compiler/c2/irTests/TestPhiDuplicatedConversion.java "Failed IR Rules (2) of Methods (2)" Reviewed-by: thartmann, kvn - ... and 67 more: https://git.openjdk.org/shenandoah/compare/e4a8a438...578915b0 ------------- Changes: https://git.openjdk.org/shenandoah/pull/351/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=351&range=01 Stats: 27452 lines in 1168 files changed: 14955 ins; 4156 del; 8341 mod Patch: https://git.openjdk.org/shenandoah/pull/351.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/351/head:pull/351 PR: https://git.openjdk.org/shenandoah/pull/351 From cslucas at openjdk.org Thu Nov 2 22:36:20 2023 From: cslucas at openjdk.org (Cesar Soares Lucas) Date: Thu, 2 Nov 2023 22:36:20 GMT Subject: RFR: JDK-8241503: C2: Share MacroAssembler between mach nodes during code emission Message-ID: # Description Please review this PR with a patch to re-use the same C2_MacroAssembler object to emit all instructions in the same compilation unit. Overall, the change is pretty simple. However, due to the renaming of the variable to access C2_MacroAssembler, from `_masm.` to `masm->`, and also some method prototype changes, the patch became quite large. # Help Needed for Testing I don't have access to all platforms necessary to test this. I hope some other folks can help with testing on `S390`, `RISC-V` and `PPC`. ------------- Commit messages: - Reuse same C2_MacroAssembler object to emit instructions. Changes: https://git.openjdk.org/jdk/pull/16484/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16484&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8241503 Stats: 2665 lines in 60 files changed: 112 ins; 429 del; 2124 mod Patch: https://git.openjdk.org/jdk/pull/16484.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16484/head:pull/16484 PR: https://git.openjdk.org/jdk/pull/16484 From mdoerr at openjdk.org Thu Nov 2 23:24:05 2023 From: mdoerr at openjdk.org (Martin Doerr) Date: Thu, 2 Nov 2023 23:24:05 GMT Subject: RFR: JDK-8241503: C2: Share MacroAssembler between mach nodes during code emission In-Reply-To: References: Message-ID: On Thu, 2 Nov 2023 22:17:43 GMT, Cesar Soares Lucas wrote: > # Description > > Please review this PR with a patch to re-use the same C2_MacroAssembler object to emit all instructions in the same compilation unit. > > Overall, the change is pretty simple. However, due to the renaming of the variable to access C2_MacroAssembler, from `_masm.` to `masm->`, and also some method prototype changes, the patch became quite large. > > # Help Needed for Testing > > I don't have access to all platforms necessary to test this. I hope some other folks can help with testing on `S390`, `RISC-V` and `PPC`. PPC64 runs into assert(masm->inst_mark() == nullptr) failed: should be. V [libjvm.so+0x1648528] PhaseOutput::fill_buffer(C2_MacroAssembler*, unsigned int*)+0x10c8 (output.cpp:1812) V [libjvm.so+0x164b35c] PhaseOutput::Output()+0xd5c (output.cpp:362) V [libjvm.so+0x958f9c] Compile::Code_Gen()+0x4ec (compile.cpp:2989) V [libjvm.so+0x95e484] Compile::Compile(ciEnv*, ciMethod*, int, Options, DirectiveSet*)+0x1a84 (compile.cpp:887) V [libjvm.so+0x718f58] C2Compiler::compile_method(ciEnv*, ciMethod*, int, bool, DirectiveSet*)+0x198 (c2compiler.cpp:119) ------------- PR Comment: https://git.openjdk.org/jdk/pull/16484#issuecomment-1791694904 From cslucas at openjdk.org Thu Nov 2 23:39:02 2023 From: cslucas at openjdk.org (Cesar Soares Lucas) Date: Thu, 2 Nov 2023 23:39:02 GMT Subject: RFR: JDK-8241503: C2: Share MacroAssembler between mach nodes during code emission In-Reply-To: References:

Message-ID: <4LqTlmsaVA4gGTi5WQNjhn9KCN9SotYddp9SpdIpv-g=.294ccf81-e719-4621-8384-10242d2d7e95@github.com> On Thu, 2 Nov 2023 23:21:23 GMT, Martin Doerr wrote: >> # Description >> >> Please review this PR with a patch to re-use the same C2_MacroAssembler object to emit all instructions in the same compilation unit. >> >> Overall, the change is pretty simple. However, due to the renaming of the variable to access C2_MacroAssembler, from `_masm.` to `masm->`, and also some method prototype changes, the patch became quite large. >> >> # Help Needed for Testing >> >> I don't have access to all platforms necessary to test this. I hope some other folks can help with testing on `S390`, `RISC-V` and `PPC`. > > PPC64 runs into assert(masm->inst_mark() == nullptr) failed: should be. > V [libjvm.so+0x1648528] PhaseOutput::fill_buffer(C2_MacroAssembler*, unsigned int*)+0x10c8 (output.cpp:1812) > V [libjvm.so+0x164b35c] PhaseOutput::Output()+0xd5c (output.cpp:362) > V [libjvm.so+0x958f9c] Compile::Code_Gen()+0x4ec (compile.cpp:2989) > V [libjvm.so+0x95e484] Compile::Compile(ciEnv*, ciMethod*, int, Options, DirectiveSet*)+0x1a84 (compile.cpp:887) > V [libjvm.so+0x718f58] C2Compiler::compile_method(ciEnv*, ciMethod*, int, bool, DirectiveSet*)+0x198 (c2compiler.cpp:119) @TheRealMDoerr - this is likely because of some missing `clear_inst_mark` call on my part in the PPC ad, I'll take a look into it. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16484#issuecomment-1791705658 From ysr at openjdk.org Fri Nov 3 00:24:47 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Fri, 3 Nov 2023 00:24:47 GMT Subject: RFR: Merge openjdk/jdk:master [v2] In-Reply-To: <8uEDs5EvShRtlND-ytX70TgezC0-5w3OhvtSLVD-Rsw=.1d7eb8e7-bdff-4cb8-b03b-47f79f655829@github.com> References: <8uEDs5EvShRtlND-ytX70TgezC0-5w3OhvtSLVD-Rsw=.1d7eb8e7-bdff-4cb8-b03b-47f79f655829@github.com> Message-ID: On Thu, 2 Nov 2023 16:29:38 GMT, William Kemper wrote: >> Merges tag jdk-22+22 > > William Kemper has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 77 commits: > > - Merge branch 'shenandoah/master' into merge-jdk-22+22 > - 8318694: [JVMCI] disable can_call_java in most contexts for libjvmci compiler threads > > Reviewed-by: dholmes, never > - 8319046: Execute tests in source/class-file order in JavadocTester > > Reviewed-by: hannesw > - 8319139: Improve diagnosability of `JavadocTester` output > > Reviewed-by: hannesw > - 8316025: Use testUI() method of PassFailJFrame.Builder in FileChooserSymLinkTest.java > > Reviewed-by: azvegint > - 8317048: VerifyError with unnamed pattern variable and more than one components > > Reviewed-by: jlahoda > - 8318647: Serial: Refactor BlockOffsetTable > > Reviewed-by: tschatzl, iwalulya > - 8304939: os::win32::exit_process_or_thread should be marked noreturn > > Reviewed-by: dholmes, kbarrett > - 8318016: Per-compilation memory ceiling > > Reviewed-by: roland, thartmann > - 8318683: compiler/c2/irTests/TestPhiDuplicatedConversion.java "Failed IR Rules (2) of Methods (2)" > > Reviewed-by: thartmann, kvn > - ... and 67 more: https://git.openjdk.org/shenandoah/compare/e4a8a438...578915b0 @earthling-amzn : it looks like a few shenandoah only tests e.g.: https://github.com/earthling-amzn/shenandoah/actions/runs/6735431147#user-content-gc_shenandoah_oom_testthreadfailure need to be changed to accommodate the changes to ProcessTools: https://github.com/earthling-amzn/shenandoah/blob/578915b06592d273a986a9d56a7631ea7f5995f8/test/lib/jdk/test/lib/process/ProcessTools.java#L505 ------------- PR Comment: https://git.openjdk.org/shenandoah/pull/351#issuecomment-1791736274 From dlong at openjdk.org Fri Nov 3 01:47:02 2023 From: dlong at openjdk.org (Dean Long) Date: Fri, 3 Nov 2023 01:47:02 GMT Subject: RFR: JDK-8241503: C2: Share MacroAssembler between mach nodes during code emission In-Reply-To: References: Message-ID: On Thu, 2 Nov 2023 22:17:43 GMT, Cesar Soares Lucas wrote: > # Description > > Please review this PR with a patch to re-use the same C2_MacroAssembler object to emit all instructions in the same compilation unit. > > Overall, the change is pretty simple. However, due to the renaming of the variable to access C2_MacroAssembler, from `_masm.` to `masm->`, and also some method prototype changes, the patch became quite large. > > # Help Needed for Testing > > I don't have access to all platforms necessary to test this. I hope some other folks can help with testing on `S390`, `RISC-V` and `PPC`. src/hotspot/cpu/x86/gc/z/z_x86_64.ad line 37: > 35: #include "gc/z/zBarrierSetAssembler.hpp" > 36: > 37: static void z_color(MacroAssembler* masm, const MachNode* node, Register ref) { For files already using MacroAssembler& _masm, the only change needed is this at the top: undef __ #define __ _masm. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16484#discussion_r1381015121 From dlong at openjdk.org Fri Nov 3 02:07:10 2023 From: dlong at openjdk.org (Dean Long) Date: Fri, 3 Nov 2023 02:07:10 GMT Subject: RFR: JDK-8241503: C2: Share MacroAssembler between mach nodes during code emission In-Reply-To: References:

Message-ID: On Fri, 3 Nov 2023 01:44:42 GMT, Dean Long wrote: >> # Description >> >> Please review this PR with a patch to re-use the same C2_MacroAssembler object to emit all instructions in the same compilation unit. >> >> Overall, the change is pretty simple. However, due to the renaming of the variable to access C2_MacroAssembler, from `_masm.` to `masm->`, and also some method prototype changes, the patch became quite large. >> >> # Help Needed for Testing >> >> I don't have access to all platforms necessary to test this. I hope some other folks can help with testing on `S390`, `RISC-V` and `PPC`. > > src/hotspot/cpu/x86/gc/z/z_x86_64.ad line 37: > >> 35: #include "gc/z/zBarrierSetAssembler.hpp" >> 36: >> 37: static void z_color(MacroAssembler* masm, const MachNode* node, Register ref) { > > For files already using MacroAssembler& _masm, the only change needed is this at the top: > > undef __ > #define __ _masm. I guess that doesn't work because different files are concatenated together, causing a conflict if some files expect MacroAssembler *masm. To reduce the number of changes, couldn't we use MacroAssembler& _masm everywhere? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16484#discussion_r1381098489 From fyang at openjdk.org Fri Nov 3 04:49:01 2023 From: fyang at openjdk.org (Fei Yang) Date: Fri, 3 Nov 2023 04:49:01 GMT Subject: RFR: JDK-8241503: C2: Share MacroAssembler between mach nodes during code emission In-Reply-To: References: Message-ID: On Thu, 2 Nov 2023 22:17:43 GMT, Cesar Soares Lucas wrote: > # Description > > Please review this PR with a patch to re-use the same C2_MacroAssembler object to emit all instructions in the same compilation unit. > > Overall, the change is pretty simple. However, due to the renaming of the variable to access C2_MacroAssembler, from `_masm.` to `masm->`, and also some method prototype changes, the patch became quite large. > > # Help Needed for Testing > > I don't have access to all platforms necessary to test this. I hope some other folks can help with testing on `S390`, `RISC-V` and `PPC`. Hello, I guess you might want to merge latest jdk master and add more changes. I witnessed some build errors when building the latest jdk master with this patch on linux-riscv64: /home/fyang/jdk/src/hotspot/cpu/riscv/riscv.ad: In member function 'virtual void UdivINode::emit(C2_MacroAssembler*, PhaseRegAlloc*) const': /home/fyang/jdk/src/hotspot/cpu/riscv/riscv.ad:2412:30: error: 'cbuf' was not declared in this scope 2412 | C2_MacroAssembler _masm(&cbuf); | ^~~~ /home/fyang/jdk/src/hotspot/cpu/riscv/riscv.ad: In member function 'virtual void UdivLNode::emit(C2_MacroAssembler*, PhaseRegAlloc*) const': /home/fyang/jdk/src/hotspot/cpu/riscv/riscv.ad:2427:30: error: 'cbuf' was not declared in this scope 2427 | C2_MacroAssembler _masm(&cbuf); | ^~~~ /home/fyang/jdk/src/hotspot/cpu/riscv/riscv.ad: In member function 'virtual void UmodINode::emit(C2_MacroAssembler*, PhaseRegAlloc*) const': /home/fyang/jdk/src/hotspot/cpu/riscv/riscv.ad:2442:30: error: 'cbuf' was not declared in this scope 2442 | C2_MacroAssembler _masm(&cbuf); | ^~~~ /home/fyang/jdk/src/hotspot/cpu/riscv/riscv.ad: In member function 'virtual void UmodLNode::emit(C2_MacroAssembler*, PhaseRegAlloc*) const': /home/fyang/jdk/src/hotspot/cpu/riscv/riscv.ad:2457:30: error: 'cbuf' was not declared in this scope 2457 | C2_MacroAssembler _masm(&cbuf); ------------- PR Comment: https://git.openjdk.org/jdk/pull/16484#issuecomment-1791888196 From wkemper at openjdk.org Fri Nov 3 16:19:22 2023 From: wkemper at openjdk.org (William Kemper) Date: Fri, 3 Nov 2023 16:19:22 GMT Subject: RFR: Merge openjdk/jdk:master [v3] In-Reply-To: References: Message-ID: > Merges tag jdk-22+22 William Kemper has updated the pull request incrementally with one additional commit since the last revision: Fix merge error ------------- Changes: - all: https://git.openjdk.org/shenandoah/pull/351/files - new: https://git.openjdk.org/shenandoah/pull/351/files/578915b0..177886ac Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah&pr=351&range=02 - incr: https://webrevs.openjdk.org/?repo=shenandoah&pr=351&range=01-02 Stats: 2 lines in 1 file changed: 0 ins; 0 del; 2 mod Patch: https://git.openjdk.org/shenandoah/pull/351.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/351/head:pull/351 PR: https://git.openjdk.org/shenandoah/pull/351 From cslucas at openjdk.org Fri Nov 3 16:27:06 2023 From: cslucas at openjdk.org (Cesar Soares Lucas) Date: Fri, 3 Nov 2023 16:27:06 GMT Subject: RFR: JDK-8241503: C2: Share MacroAssembler between mach nodes during code emission In-Reply-To: References:

Message-ID: <_Qvj6Msy_P6hIWUc4IwGPLKXLK7391GMCEI4XlNYQrY=.1e03e561-071f-48ae-b322-d944c7b1ff36@github.com> On Fri, 3 Nov 2023 04:44:40 GMT, Fei Yang wrote: >> # Description >> >> Please review this PR with a patch to re-use the same C2_MacroAssembler object to emit all instructions in the same compilation unit. >> >> Overall, the change is pretty simple. However, due to the renaming of the variable to access C2_MacroAssembler, from `_masm.` to `masm->`, and also some method prototype changes, the patch became quite large. >> >> # Help Needed for Testing >> >> I don't have access to all platforms necessary to test this. I hope some other folks can help with testing on `S390`, `RISC-V` and `PPC`. > > Hello, I guess you might want to merge latest jdk master and add more changes. > I witnessed some build errors when building the latest jdk master with this patch on linux-riscv64: > > /home/fyang/jdk/src/hotspot/cpu/riscv/riscv.ad: In member function 'virtual void UdivINode::emit(C2_MacroAssembler*, PhaseRegAlloc*) const': > /home/fyang/jdk/src/hotspot/cpu/riscv/riscv.ad:2412:30: error: 'cbuf' was not declared in this scope > 2412 | C2_MacroAssembler _masm(&cbuf); > | ^~~~ > /home/fyang/jdk/src/hotspot/cpu/riscv/riscv.ad: In member function 'virtual void UdivLNode::emit(C2_MacroAssembler*, PhaseRegAlloc*) const': > /home/fyang/jdk/src/hotspot/cpu/riscv/riscv.ad:2427:30: error: 'cbuf' was not declared in this scope > 2427 | C2_MacroAssembler _masm(&cbuf); > | ^~~~ > /home/fyang/jdk/src/hotspot/cpu/riscv/riscv.ad: In member function 'virtual void UmodINode::emit(C2_MacroAssembler*, PhaseRegAlloc*) const': > /home/fyang/jdk/src/hotspot/cpu/riscv/riscv.ad:2442:30: error: 'cbuf' was not declared in this scope > 2442 | C2_MacroAssembler _masm(&cbuf); > | ^~~~ > /home/fyang/jdk/src/hotspot/cpu/riscv/riscv.ad: In member function 'virtual void UmodLNode::emit(C2_MacroAssembler*, PhaseRegAlloc*) const': > /home/fyang/jdk/src/hotspot/cpu/riscv/riscv.ad:2457:30: error: 'cbuf' was not declared in this scope > 2457 | C2_MacroAssembler _masm(&cbuf); @RealFYang - Thanks for the note. I'll do that and update the PR. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16484#issuecomment-1792746837 From wkemper at openjdk.org Fri Nov 3 19:34:06 2023 From: wkemper at openjdk.org (William Kemper) Date: Fri, 3 Nov 2023 19:34:06 GMT Subject: RFR: Merge openjdk/jdk:master [v4] In-Reply-To: References: Message-ID: <2RWdMoUPRGXsUmRqzsR7WxFvlcozVfTVXtyzJrK0E5g=.82b77ffc-7501-47aa-8a74-62b7ae68cbe3@github.com> > Merges tag jdk-22+22 William Kemper has updated the pull request incrementally with one additional commit since the last revision: Fix merge errors, harder ------------- Changes: - all: https://git.openjdk.org/shenandoah/pull/351/files - new: https://git.openjdk.org/shenandoah/pull/351/files/177886ac..8a52f169 Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah&pr=351&range=03 - incr: https://webrevs.openjdk.org/?repo=shenandoah&pr=351&range=02-03 Stats: 2 lines in 2 files changed: 0 ins; 0 del; 2 mod Patch: https://git.openjdk.org/shenandoah/pull/351.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/351/head:pull/351 PR: https://git.openjdk.org/shenandoah/pull/351 From kdnilsen at openjdk.org Fri Nov 3 21:25:49 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 3 Nov 2023 21:25:49 GMT Subject: RFR: 8319342: GenShen: Reset the count of degenerated cycles in a row following Full GC Message-ID: After a Full GC completes, reset the count on consecutive degenerated cycles to zero. This prevents automatic triggering of back-to-back Full GC. ------------- Commit messages: - Do not increment degen count if degen upgrades to full - Count degen cycles in a row within ShenandoahCollectorPolicy Changes: https://git.openjdk.org/shenandoah/pull/352/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=352&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8319342 Stats: 53 lines in 9 files changed: 16 ins; 32 del; 5 mod Patch: https://git.openjdk.org/shenandoah/pull/352.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/352/head:pull/352 PR: https://git.openjdk.org/shenandoah/pull/352 From kbarrett at openjdk.org Sat Nov 4 11:31:20 2023 From: kbarrett at openjdk.org (Kim Barrett) Date: Sat, 4 Nov 2023 11:31:20 GMT Subject: RFR: 8319439: Move BufferNode from PtrQueue files to new files Message-ID: Please review this simple (trivial?) change that moves the BufferNode class from share/gc/shared/ptrQueue.[ch]pp to new files share/gc/shared/bufferNode.[ch]pp. The code (as opposed to the surrounding boiler plate and #include's) was moved unchanged. This led to adding #include's of the new header file to a bunch of other files (include what you use), in some cases eliminating the need to #include ptrQueue.hpp. Also tidied up the #include's for ptrQueue.[ch]pp to take advantage of the split. Also, the gtest gc/shared/test_ptrQueueBufferAllocator.cpp is renamed to gc/shared/test_bufferNodeAllocator.cpp, with some adjustments to includes and test naming. Testing: mach5 tier1 ------------- Commit messages: - fixup gtest after rename - move test - move BufferNode to new files Changes: https://git.openjdk.org/jdk/pull/16506/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16506&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8319439 Stats: 383 lines in 17 files changed: 223 ins; 152 del; 8 mod Patch: https://git.openjdk.org/jdk/pull/16506.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16506/head:pull/16506 PR: https://git.openjdk.org/jdk/pull/16506 From rkennke at openjdk.org Mon Nov 6 12:38:46 2023 From: rkennke at openjdk.org (Roman Kennke) Date: Mon, 6 Nov 2023 12:38:46 GMT Subject: RFR: 8318462: [GenShen] Prevent unsafe access to displaced mark-word [v6] In-Reply-To: References: Message-ID: > We must be very careful when we access (load or update) the object age concurrently because the age is stored in the object's header and that header can be 'displaced'. This means that the header is overloaded with a pointer (stack or ObjectMonitor*) and the original header is stored at that location. The header can also be in INFLATING state, which means that a stack-lock is currently in the process to be inflated to an ObjectMonitor. Let's consider the cases separately: > > Loading object age: > - INFLATING: we can not access the header, and thus not the age. > - Stack-locked: the thread which locks the object can unlock at any time, concurrently. It is not safe to access the header, and thus the age. > - Monitor-locked: monitors don't go away without coordinating with Java threads and possibly GC threads. This coordination is done by handshaking the thread - a Java thread would be brought to its next safepoint, and GC threads, which are typically participating in handshakes by joining the SuspendibleThreadSet, can opt to handshake at a safe point by calling SuspendibleThreadSet::yield(). If 'our' thread participates in handshakes in one way or the other, it is safe to access an ObjectMonitor once we got a valid ObjectMonitor* from an object's header, but only until that thread handshakes. Long story short: it is typically safe to access ObjectMonitor :-) > > Updating object age: > Updating object age only happens during evacuation, and only on a new copy of an object, before that copy has been published. Access to the object header is therefore fully thread-local and not problematic. What *is* problematic is when we have to deal with displaced headers, because displaced headers are *not* thread local, and must be considered a shared resource. A competing thread may succeed to evacuate the object and publish its copy before 'our' thread can do so. If 'our' thread would update the displaced header, it may over-write whatever the other thread has done. > - INFLATING: We cannot access the header at all. However, that should not happen: a thread can only inflate once it has successfully evacuated an object, and > - Stack-locked: the other thread may succeed evacuation and continue to unlock the object, at which point the stack-pointer would be 'dangling' and we may over-write random information on the foreign thread stack. > - Monitor-locked: even though monitor deflation is coordinated (see above), updating the displaced header in a monitor might over-write whatever a competi... Roman Kennke has updated the pull request incrementally with one additional commit since the last revision: Fix handling of age sentinel ------------- Changes: - all: https://git.openjdk.org/shenandoah/pull/343/files - new: https://git.openjdk.org/shenandoah/pull/343/files/969e4650..3c982491 Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah&pr=343&range=05 - incr: https://webrevs.openjdk.org/?repo=shenandoah&pr=343&range=04-05 Stats: 4 lines in 2 files changed: 2 ins; 1 del; 1 mod Patch: https://git.openjdk.org/shenandoah/pull/343.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/343/head:pull/343 PR: https://git.openjdk.org/shenandoah/pull/343 From cslucas at openjdk.org Mon Nov 6 22:22:30 2023 From: cslucas at openjdk.org (Cesar Soares Lucas) Date: Mon, 6 Nov 2023 22:22:30 GMT Subject: RFR: JDK-8241503: C2: Share MacroAssembler between mach nodes during code emission In-Reply-To: References:

Message-ID: <_1LI3cbeE9XR-7E7gD22M7xNueLRhlAtHWopiTccb1Y=.a26dde00-d6b8-4c98-a84d-143d65ef251c@github.com> On Fri, 3 Nov 2023 02:04:13 GMT, Dean Long wrote: >> src/hotspot/cpu/x86/gc/z/z_x86_64.ad line 37: >> >>> 35: #include "gc/z/zBarrierSetAssembler.hpp" >>> 36: >>> 37: static void z_color(MacroAssembler* masm, const MachNode* node, Register ref) { >> >> For files already using MacroAssembler& _masm, the only change needed is this at the top: >> >> undef __ >> #define __ _masm. > > I guess that doesn't work because different files are concatenated together, causing a conflict if some files expect MacroAssembler *masm. To reduce the number of changes, couldn't we use MacroAssembler& _masm everywhere? Because some places were using `if (cbuf)` I ended up opting to make the parameter also a pointer instead of a reference. > For files already using MacroAssembler& _masm, the only change needed is this at the top: My opinion would be to use Reference or Pointer everywhere and not mix the two - to prevent confusion. But if you folks think it's best to go that way, I'm fine with it. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16484#discussion_r1384095916 From wkemper at openjdk.org Mon Nov 6 23:11:51 2023 From: wkemper at openjdk.org (William Kemper) Date: Mon, 6 Nov 2023 23:11:51 GMT Subject: RFR: 8319342: GenShen: Reset the count of degenerated cycles in a row following Full GC In-Reply-To: References: Message-ID: On Fri, 3 Nov 2023 21:17:42 GMT, Kelvin Nilsen wrote: > After a Full GC completes, reset the count on consecutive degenerated cycles to zero. This prevents automatic triggering of back-to-back Full GC. Changes requested by wkemper (Committer). src/hotspot/share/gc/shenandoah/heuristics/shenandoahPassiveHeuristics.cpp line 49: > 47: bool ShenandoahPassiveHeuristics::should_degenerate_cycle() { > 48: // Always fail to Degenerated GC, if enabled > 49: return ShenandoahDegeneratedGC; Are we changing how the passive heuristic behaves now? With this change `ShenandoahCollectorPolicy::should_degenerate_cycle` would return `false` sometimes when the passive heuristic would have "degenerated". src/hotspot/share/gc/shenandoah/shenandoahCollectorPolicy.hpp line 64: > 62: ShenandoahTracer* _tracer; > 63: > 64: uint _degenerated_cycles_in_a_row; Nit, but we could call this `_consecutive_degenerated_gcs` for consistency with other fields in this class? (Also, possibly rename `_abbreviated_cycles` to `_abbreviated_gcs` for the same reason. ------------- PR Review: https://git.openjdk.org/shenandoah/pull/352#pullrequestreview-1716475112 PR Review Comment: https://git.openjdk.org/shenandoah/pull/352#discussion_r1384151138 PR Review Comment: https://git.openjdk.org/shenandoah/pull/352#discussion_r1384144872 From ysr at openjdk.org Tue Nov 7 00:29:25 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Tue, 7 Nov 2023 00:29:25 GMT Subject: RFR: 8319342: GenShen: Reset the count of degenerated cycles in a row following Full GC In-Reply-To: References:

Message-ID: <7V6Xb-M5kmJPyGNmcewQ0wrI8P1JrzSKnaWG63DXJmM=.cfbf51b4-dd3d-412a-b8fe-f9c39688a64e@github.com> On Mon, 6 Nov 2023 22:59:31 GMT, William Kemper wrote: >> After a Full GC completes, reset the count on consecutive degenerated cycles to zero. This prevents automatic triggering of back-to-back Full GC. > > src/hotspot/share/gc/shenandoah/heuristics/shenandoahPassiveHeuristics.cpp line 49: > >> 47: bool ShenandoahPassiveHeuristics::should_degenerate_cycle() { >> 48: // Always fail to Degenerated GC, if enabled >> 49: return ShenandoahDegeneratedGC; > > Are we changing how the passive heuristic behaves now? With this change `ShenandoahCollectorPolicy::should_degenerate_cycle` would return `false` sometimes when the passive heuristic would have "degenerated". I agee with William... You can leave all state (counters) and its manipulation entirely in the heuristics object, and let the policy object query the heuristics like you do today, and inform it when it does do a full gc, etc. so relevant state may be updated. Of course, the policy should not independently reach in and change the state maintained in the heuristics, just query that state, and inform it of events that might entail changes to that state. We can clean things up by asking what the role of heuristics is vs what the role of policy is. It seems as if policy is broad and heuristics is more specific within that broad policy. It is also possible to keep the state in the policy, be mutated only by policy in response to events, and have the heuristics be completely state-free, but call-back to policy to use any state it would use for its decisions. Clarifying this structure and their relationship, and sticking to a consistent (documented) protocol here would avoid confusion and errors. ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/352#discussion_r1384200774 From wkemper at openjdk.org Tue Nov 7 01:03:37 2023 From: wkemper at openjdk.org (William Kemper) Date: Tue, 7 Nov 2023 01:03:37 GMT Subject: RFR: Merge openjdk/jdk:master [v5] In-Reply-To: References: Message-ID: > Merges tag jdk-22+22 William Kemper has updated the pull request incrementally with one additional commit since the last revision: Increase no progress threshold for generation mode ------------- Changes: - all: https://git.openjdk.org/shenandoah/pull/351/files - new: https://git.openjdk.org/shenandoah/pull/351/files/8a52f169..14d3f617 Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah&pr=351&range=04 - incr: https://webrevs.openjdk.org/?repo=shenandoah&pr=351&range=03-04 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/shenandoah/pull/351.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/351/head:pull/351 PR: https://git.openjdk.org/shenandoah/pull/351 From tschatzl at openjdk.org Tue Nov 7 11:21:27 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 7 Nov 2023 11:21:27 GMT Subject: RFR: 8319439: Move BufferNode from PtrQueue files to new files In-Reply-To: References: Message-ID: On Sat, 4 Nov 2023 11:25:05 GMT, Kim Barrett wrote: > Please review this simple (trivial?) change that moves the BufferNode class > from share/gc/shared/ptrQueue.[ch]pp to new files > share/gc/shared/bufferNode.[ch]pp. The code (as opposed to the surrounding > boiler plate and #include's) was moved unchanged. > > This led to adding #include's of the new header file to a bunch of other files > (include what you use), in some cases eliminating the need to #include > ptrQueue.hpp. Also tidied up the #include's for ptrQueue.[ch]pp to take > advantage of the split. > > Also, the gtest gc/shared/test_ptrQueueBufferAllocator.cpp is renamed to > gc/shared/test_bufferNodeAllocator.cpp, with some adjustments to includes and > test naming. > > Testing: > mach5 tier1 lgtm ------------- Marked as reviewed by tschatzl (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16506#pullrequestreview-1717445517 From ysr at openjdk.org Tue Nov 7 19:10:33 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Tue, 7 Nov 2023 19:10:33 GMT Subject: RFR: Merge openjdk/jdk:master [v5] In-Reply-To: References:

Message-ID: On Tue, 7 Nov 2023 01:03:37 GMT, William Kemper wrote: >> Merges tag jdk-22+22 > > William Kemper has updated the pull request incrementally with one additional commit since the last revision: > > Increase no progress threshold for generation mode Marked as reviewed by ysr (Committer). ------------- PR Review: https://git.openjdk.org/shenandoah/pull/351#pullrequestreview-1718498719 From kdnilsen at openjdk.org Tue Nov 7 21:22:41 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 7 Nov 2023 21:22:41 GMT Subject: RFR: 8319342: GenShen: Reset the count of degenerated cycles in a row following Full GC In-Reply-To: References:

Message-ID: On Mon, 6 Nov 2023 22:49:57 GMT, William Kemper wrote: >> After a Full GC completes, reset the count on consecutive degenerated cycles to zero. This prevents automatic triggering of back-to-back Full GC. > > src/hotspot/share/gc/shenandoah/shenandoahCollectorPolicy.hpp line 64: > >> 62: ShenandoahTracer* _tracer; >> 63: >> 64: uint _degenerated_cycles_in_a_row; > > Nit, but we could call this `_consecutive_degenerated_gcs` for consistency with other fields in this class? (Also, possibly rename `_abbreviated_cycles` to `_abbreviated_gcs` for the same reason. Thanks. Making this change. ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/352#discussion_r1385601350 From kdnilsen at openjdk.org Tue Nov 7 21:42:27 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 7 Nov 2023 21:42:27 GMT Subject: RFR: 8319342: GenShen: Reset the count of degenerated cycles in a row following Full GC In-Reply-To: <7V6Xb-M5kmJPyGNmcewQ0wrI8P1JrzSKnaWG63DXJmM=.cfbf51b4-dd3d-412a-b8fe-f9c39688a64e@github.com> References:

<7V6Xb-M5kmJPyGNmcewQ0wrI8P1JrzSKnaWG63DXJmM=.cfbf51b4-dd3d-412a-b8fe-f9c39688a64e@github.com> Message-ID: On Tue, 7 Nov 2023 00:26:39 GMT, Y. Srinivas Ramakrishna wrote: >> src/hotspot/share/gc/shenandoah/heuristics/shenandoahPassiveHeuristics.cpp line 49: >> >>> 47: bool ShenandoahPassiveHeuristics::should_degenerate_cycle() { >>> 48: // Always fail to Degenerated GC, if enabled >>> 49: return ShenandoahDegeneratedGC; >> >> Are we changing how the passive heuristic behaves now? With this change `ShenandoahCollectorPolicy::should_degenerate_cycle` would return `false` sometimes when the passive heuristic would have "degenerated". > > I agee with William... You can leave all state (counters) and its manipulation entirely in the heuristics object, and let the policy object query the heuristics like you do today, and inform it when it does do a full gc, etc. so relevant state may be updated. > > Of course, the policy should not independently reach in and change the state maintained in the heuristics, just query that state, and inform it of events that might entail changes to that state. > > We can clean things up by asking what the role of heuristics is vs what the role of policy is. It seems as if policy is broad and heuristics is more specific within that broad policy. It is also possible to keep the state in the policy, be mutated only by policy in response to events, and have the heuristics be completely state-free, but call-back to policy to use any state it would use for its decisions. Clarifying this structure and their relationship, and sticking to a consistent (documented) protocol here would avoid confusion and errors. Good catch. That was an unintended consequence. I will return the methods to the Heuristics hierarchy, but will query the consecutive_degenerated_gcs value from collector policy. ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/352#discussion_r1385618089 From kdnilsen at openjdk.org Tue Nov 7 21:58:07 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 7 Nov 2023 21:58:07 GMT Subject: RFR: 8319342: GenShen: Reset the count of degenerated cycles in a row following Full GC [v2] In-Reply-To: References: Message-ID: <0yq9fTGZoWjjhbleGS8GRDOZ3kAIp92nZRet9gX7plM=.7247cee6-db8c-4355-8a11-257af3d06b35@github.com> > After a Full GC completes, reset the count on consecutive degenerated cycles to zero. This prevents automatic triggering of back-to-back Full GC. Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: Respond to reviewer feedback Restore the original ShenandoahHeuristics::should_degenerate_cycle() so that ShenandoahPassiveHeuristics can differentate from other implementations. Improve instance variable names. ------------- Changes: - all: https://git.openjdk.org/shenandoah/pull/352/files - new: https://git.openjdk.org/shenandoah/pull/352/files/3b3a1672..8bd0b992 Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah&pr=352&range=01 - incr: https://webrevs.openjdk.org/?repo=shenandoah&pr=352&range=00-01 Stats: 39 lines in 7 files changed: 19 ins; 10 del; 10 mod Patch: https://git.openjdk.org/shenandoah/pull/352.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/352/head:pull/352 PR: https://git.openjdk.org/shenandoah/pull/352 From kdnilsen at openjdk.org Tue Nov 7 22:02:51 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 7 Nov 2023 22:02:51 GMT Subject: RFR: 8319342: GenShen: Reset the count of degenerated cycles in a row following Full GC [v3] In-Reply-To: References: Message-ID: > After a Full GC completes, reset the count on consecutive degenerated cycles to zero. This prevents automatic triggering of back-to-back Full GC. Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: Fix typo ------------- Changes: - all: https://git.openjdk.org/shenandoah/pull/352/files - new: https://git.openjdk.org/shenandoah/pull/352/files/8bd0b992..b18f7041 Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah&pr=352&range=02 - incr: https://webrevs.openjdk.org/?repo=shenandoah&pr=352&range=01-02 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/shenandoah/pull/352.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/352/head:pull/352 PR: https://git.openjdk.org/shenandoah/pull/352 From wkemper at openjdk.org Tue Nov 7 22:24:53 2023 From: wkemper at openjdk.org (William Kemper) Date: Tue, 7 Nov 2023 22:24:53 GMT Subject: Integrated: Merge openjdk/jdk:master In-Reply-To: References: Message-ID: On Thu, 2 Nov 2023 14:10:06 GMT, William Kemper wrote: > Merges tag jdk-22+22 This pull request has now been integrated. Changeset: 20172ba8 Author: William Kemper URL: https://git.openjdk.org/shenandoah/commit/20172ba83a5d0348dba93bfbfce5bfd84d088258 Stats: 27457 lines in 1169 files changed: 14955 ins; 4156 del; 8346 mod Merge Reviewed-by: ysr ------------- PR: https://git.openjdk.org/shenandoah/pull/351 From wkemper at openjdk.org Wed Nov 8 01:10:45 2023 From: wkemper at openjdk.org (William Kemper) Date: Wed, 8 Nov 2023 01:10:45 GMT Subject: RFR: 8319671 GenShen: Old marking may encounter invalid pointers in SATB buffers Message-ID: Weak handles which are null'd out after final marking of young, but during marking of old, will have their original referent placed into a SATB buffer. These entries need to be purged before old marking resumes. In the case of abbreviated young cycles, these entries would not be purged. ------------- Commit messages: - Purge SATB buffers if necessary at the end of an abbreviated cycle Changes: https://git.openjdk.org/shenandoah/pull/353/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=353&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8319671 Stats: 7 lines in 1 file changed: 6 ins; 0 del; 1 mod Patch: https://git.openjdk.org/shenandoah/pull/353.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/353/head:pull/353 PR: https://git.openjdk.org/shenandoah/pull/353 From amitkumar at openjdk.org Wed Nov 8 04:36:53 2023 From: amitkumar at openjdk.org (Amit Kumar) Date: Wed, 8 Nov 2023 04:36:53 GMT Subject: RFR: JDK-8241503: C2: Share MacroAssembler between mach nodes during code emission In-Reply-To: References: Message-ID: On Thu, 2 Nov 2023 22:17:43 GMT, Cesar Soares Lucas wrote: > # Description > > Please review this PR with a patch to re-use the same C2_MacroAssembler object to emit all instructions in the same compilation unit. > > Overall, the change is pretty simple. However, due to the renaming of the variable to access C2_MacroAssembler, from `_masm.` to `masm->`, and also some method prototype changes, the patch became quite large. > > # Help Needed for Testing > > I don't have access to all platforms necessary to test this. I hope some other folks can help with testing on `S390`, `RISC-V` and `PPC`. `s390x` also run into assert failure: `assert(masm->inst_mark() == nullptr) failed: should be.` V [libjvm.so+0xfb0938] PhaseOutput::fill_buffer(C2_MacroAssembler*, unsigned int*)+0x2370 (output.cpp:1812) V [libjvm.so+0xfb21ce] PhaseOutput::Output()+0xcae (output.cpp:362) V [libjvm.so+0x6a90a8] Compile::Code_Gen()+0x460 (compile.cpp:2989) V [libjvm.so+0x6ad848] Compile::Compile(ciEnv*, ciMethod*, int, Options, DirectiveSet*)+0x1738 (compile.cpp:887) V [libjvm.so+0x4fb932] C2Compiler::compile_method(ciEnv*, ciMethod*, int, bool, DirectiveSet*)+0x14a (c2compiler.cpp:119) V [libjvm.so+0x6b81a2] CompileBroker::invoke_compiler_on_method(CompileTask*)+0xd9a (compileBroker.cpp:2282) V [libjvm.so+0x6b8eaa] CompileBroker::compiler_thread_loop()+0x5a2 (compileBroker.cpp:1943) ------------- PR Comment: https://git.openjdk.org/jdk/pull/16484#issuecomment-1801070622 From ysr at openjdk.org Wed Nov 8 05:40:43 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Wed, 8 Nov 2023 05:40:43 GMT Subject: RFR: 8319671 GenShen: Old marking may encounter invalid pointers in SATB buffers In-Reply-To: References: Message-ID: On Wed, 8 Nov 2023 01:05:24 GMT, William Kemper wrote: > Weak handles which are null'd out after final marking of young, but during marking of old, will have their original referent placed into a SATB buffer. These entries need to be purged before old marking resumes. In the case of abbreviated young cycles, these entries would not be purged. This seems safe, but we should separately explore as a follow-up a more discriminatory SATB barrier for old vs young marking so a to avoid these wrinkles and the extra work. ------------- Marked as reviewed by ysr (Committer). PR Review: https://git.openjdk.org/shenandoah/pull/353#pullrequestreview-1719406785 From ysr at openjdk.org Wed Nov 8 09:19:40 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Wed, 8 Nov 2023 09:19:40 GMT Subject: RFR: 8318462: [GenShen] Prevent unsafe access to displaced mark-word [v6] In-Reply-To: References:

Message-ID: On Thu, 2 Nov 2023 23:21:23 GMT, Martin Doerr wrote: >> # Description >> >> Please review this PR with a patch to re-use the same C2_MacroAssembler object to emit all instructions in the same compilation unit. >> >> Overall, the change is pretty simple. However, due to the renaming of the variable to access C2_MacroAssembler, from `_masm.` to `masm->`, and also some method prototype changes, the patch became quite large. >> >> # Help Needed for Testing >> >> I don't have access to all platforms necessary to test this. I hope some other folks can help with testing on `S390`, `RISC-V` and `PPC`. > > PPC64 runs into assert(masm->inst_mark() == nullptr) failed: should be. > V [libjvm.so+0x1648528] PhaseOutput::fill_buffer(C2_MacroAssembler*, unsigned int*)+0x10c8 (output.cpp:1812) > V [libjvm.so+0x164b35c] PhaseOutput::Output()+0xd5c (output.cpp:362) > V [libjvm.so+0x958f9c] Compile::Code_Gen()+0x4ec (compile.cpp:2989) > V [libjvm.so+0x95e484] Compile::Compile(ciEnv*, ciMethod*, int, Options, DirectiveSet*)+0x1a84 (compile.cpp:887) > V [libjvm.so+0x718f58] C2Compiler::compile_method(ciEnv*, ciMethod*, int, bool, DirectiveSet*)+0x198 (c2compiler.cpp:119) Thank you for helping with test @TheRealMDoerr @offamitkumar @RealFYang . I working on an update and I'll push it today or soon after. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16484#issuecomment-1802277112 From wkemper at openjdk.org Wed Nov 8 17:54:37 2023 From: wkemper at openjdk.org (William Kemper) Date: Wed, 8 Nov 2023 17:54:37 GMT Subject: Integrated: 8319671 GenShen: Old marking may encounter invalid pointers in SATB buffers In-Reply-To: References: Message-ID: On Wed, 8 Nov 2023 01:05:24 GMT, William Kemper wrote: > Weak handles which are null'd out after final marking of young, but during marking of old, will have their original referent placed into a SATB buffer. These entries need to be purged before old marking resumes. In the case of abbreviated young cycles, these entries would not be purged. This pull request has now been integrated. Changeset: a71c7587 Author: William Kemper URL: https://git.openjdk.org/shenandoah/commit/a71c7587e4ddcc0b63d3e3acbc468dd9f19e0b60 Stats: 7 lines in 1 file changed: 6 ins; 0 del; 1 mod 8319671: GenShen: Old marking may encounter invalid pointers in SATB buffers Reviewed-by: ysr ------------- PR: https://git.openjdk.org/shenandoah/pull/353 From rkennke at openjdk.org Thu Nov 9 13:25:40 2023 From: rkennke at openjdk.org (Roman Kennke) Date: Thu, 9 Nov 2023 13:25:40 GMT Subject: RFR: 8318462: [GenShen] Prevent unsafe access to displaced mark-word [v6] In-Reply-To: References:

Message-ID: On Mon, 6 Nov 2023 12:38:46 GMT, Roman Kennke wrote: >> We must be very careful when we access (load or update) the object age concurrently because the age is stored in the object's header and that header can be 'displaced'. This means that the header is overloaded with a pointer (stack or ObjectMonitor*) and the original header is stored at that location. The header can also be in INFLATING state, which means that a stack-lock is currently in the process to be inflated to an ObjectMonitor. Let's consider the cases separately: >> >> Loading object age: >> - INFLATING: we can not access the header, and thus not the age. >> - Stack-locked: the thread which locks the object can unlock at any time, concurrently. It is not safe to access the header, and thus the age. >> - Monitor-locked: monitors don't go away without coordinating with Java threads and possibly GC threads. This coordination is done by handshaking the thread - a Java thread would be brought to its next safepoint, and GC threads, which are typically participating in handshakes by joining the SuspendibleThreadSet, can opt to handshake at a safe point by calling SuspendibleThreadSet::yield(). If 'our' thread participates in handshakes in one way or the other, it is safe to access an ObjectMonitor once we got a valid ObjectMonitor* from an object's header, but only until that thread handshakes. Long story short: it is typically safe to access ObjectMonitor :-) >> >> Updating object age: >> Updating object age only happens during evacuation, and only on a new copy of an object, before that copy has been published. Access to the object header is therefore fully thread-local and not problematic. What *is* problematic is when we have to deal with displaced headers, because displaced headers are *not* thread local, and must be considered a shared resource. A competing thread may succeed to evacuate the object and publish its copy before 'our' thread can do so. If 'our' thread would update the displaced header, it may over-write whatever the other thread has done. >> - INFLATING: We cannot access the header at all. However, that should not happen: a thread can only inflate once it has successfully evacuated an object, and >> - Stack-locked: the other thread may succeed evacuation and continue to unlock the object, at which point the stack-pointer would be 'dangling' and we may over-write random information on the foreign thread stack. >> - Monitor-locked: even though monitor deflation is coordinated (see above), updating the displaced header in a monitor might over-... > > Roman Kennke has updated the pull request incrementally with one additional commit since the last revision: > > Fix handling of age sentinel Thanks! ------------- PR Comment: https://git.openjdk.org/shenandoah/pull/343#issuecomment-1803825424 From rkennke at openjdk.org Thu Nov 9 13:43:07 2023 From: rkennke at openjdk.org (Roman Kennke) Date: Thu, 9 Nov 2023 13:43:07 GMT Subject: Integrated: 8318462: [GenShen] Prevent unsafe access to displaced mark-word In-Reply-To: References: Message-ID: On Wed, 18 Oct 2023 17:23:15 GMT, Roman Kennke wrote: > We must be very careful when we access (load or update) the object age concurrently because the age is stored in the object's header and that header can be 'displaced'. This means that the header is overloaded with a pointer (stack or ObjectMonitor*) and the original header is stored at that location. The header can also be in INFLATING state, which means that a stack-lock is currently in the process to be inflated to an ObjectMonitor. Let's consider the cases separately: > > Loading object age: > - INFLATING: we can not access the header, and thus not the age. > - Stack-locked: the thread which locks the object can unlock at any time, concurrently. It is not safe to access the header, and thus the age. > - Monitor-locked: monitors don't go away without coordinating with Java threads and possibly GC threads. This coordination is done by handshaking the thread - a Java thread would be brought to its next safepoint, and GC threads, which are typically participating in handshakes by joining the SuspendibleThreadSet, can opt to handshake at a safe point by calling SuspendibleThreadSet::yield(). If 'our' thread participates in handshakes in one way or the other, it is safe to access an ObjectMonitor once we got a valid ObjectMonitor* from an object's header, but only until that thread handshakes. Long story short: it is typically safe to access ObjectMonitor :-) > > Updating object age: > Updating object age only happens during evacuation, and only on a new copy of an object, before that copy has been published. Access to the object header is therefore fully thread-local and not problematic. What *is* problematic is when we have to deal with displaced headers, because displaced headers are *not* thread local, and must be considered a shared resource. A competing thread may succeed to evacuate the object and publish its copy before 'our' thread can do so. If 'our' thread would update the displaced header, it may over-write whatever the other thread has done. > - INFLATING: We cannot access the header at all. However, that should not happen: a thread can only inflate once it has successfully evacuated an object, and > - Stack-locked: the other thread may succeed evacuation and continue to unlock the object, at which point the stack-pointer would be 'dangling' and we may over-write random information on the foreign thread stack. > - Monitor-locked: even though monitor deflation is coordinated (see above), updating the displaced header in a monitor might over-write whatever a competi... This pull request has now been integrated. Changeset: 48293049 Author: Roman Kennke URL: https://git.openjdk.org/shenandoah/commit/48293049b6f79fc197fdfc14b23253fa91a20344 Stats: 69 lines in 5 files changed: 28 ins; 26 del; 15 mod 8318462: [GenShen] Prevent unsafe access to displaced mark-word Reviewed-by: kdnilsen, wkemper, ysr ------------- PR: https://git.openjdk.org/shenandoah/pull/343 From wkemper at openjdk.org Thu Nov 9 14:15:37 2023 From: wkemper at openjdk.org (William Kemper) Date: Thu, 9 Nov 2023 14:15:37 GMT Subject: RFR: Merge openjdk/jdk:master Message-ID: Merges tag jdk-22+23 ------------- Commit messages: - 8319318: bufferedStream fixed case can be removed - 8319556: Harmonize interface formatting in the FFM API - 8319378: Spec for j.util.Timer::purge and j.util.Timer::cancel could be improved - 8315680: java/lang/ref/ReachabilityFenceTest.java should run with -Xbatch - 8305814: Update Xalan Java to 2.7.3 - 8319573: Change to Visual Studio 17.6.5 for building on Windows at Oracle - 8319338: tools/jpackage/share/RuntimeImageTest.java fails with -XX:+UseZGC - 8319436: Proxy.newProxyInstance throws NPE if loader is null and interface not visible from class loader - 8314891: Additional Zip64 extra header validation - 8318594: NMT: VM.native_memory crashes on assert if functionality isn't supported by OS - ... and 69 more: https://git.openjdk.org/shenandoah/compare/d354141a...8555e0f6 The merge commit only contains trivial merges, so no merge-specific webrevs have been generated. Changes: https://git.openjdk.org/shenandoah/pull/354/files Stats: 12726 lines in 343 files changed: 6049 ins; 3383 del; 3294 mod Patch: https://git.openjdk.org/shenandoah/pull/354.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/354/head:pull/354 PR: https://git.openjdk.org/shenandoah/pull/354 From wkemper at openjdk.org Thu Nov 9 18:50:05 2023 From: wkemper at openjdk.org (William Kemper) Date: Thu, 9 Nov 2023 18:50:05 GMT Subject: RFR: Merge openjdk/jdk:master [v2] In-Reply-To: References: Message-ID: > Merges tag jdk-22+23 William Kemper has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. ------------- Changes: - all: https://git.openjdk.org/shenandoah/pull/354/files - new: https://git.openjdk.org/shenandoah/pull/354/files/8555e0f6..8555e0f6 Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah&pr=354&range=01 - incr: https://webrevs.openjdk.org/?repo=shenandoah&pr=354&range=00-01 Stats: 0 lines in 0 files changed: 0 ins; 0 del; 0 mod Patch: https://git.openjdk.org/shenandoah/pull/354.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/354/head:pull/354 PR: https://git.openjdk.org/shenandoah/pull/354 From wkemper at openjdk.org Thu Nov 9 18:50:07 2023 From: wkemper at openjdk.org (William Kemper) Date: Thu, 9 Nov 2023 18:50:07 GMT Subject: Integrated: Merge openjdk/jdk:master In-Reply-To: References: Message-ID: On Thu, 9 Nov 2023 14:09:32 GMT, William Kemper wrote: > Merges tag jdk-22+23 This pull request has now been integrated. Changeset: a9718c13 Author: William Kemper URL: https://git.openjdk.org/shenandoah/commit/a9718c135cde2e7402a4051340e5fa91f08313ee Stats: 12726 lines in 343 files changed: 6049 ins; 3383 del; 3294 mod Merge ------------- PR: https://git.openjdk.org/shenandoah/pull/354 From rkennke at openjdk.org Thu Nov 9 20:05:39 2023 From: rkennke at openjdk.org (Roman Kennke) Date: Thu, 9 Nov 2023 20:05:39 GMT Subject: RFR: 8305896: Alternative full GC forwarding [v55] In-Reply-To: References: Message-ID: > Currently, the full-GC modes of Serial, Shenandoah and G1 GCs are forwarding objects by over-writing the object header with the new object location. Unfortunately, for compact object headers ([JDK-8294992](https://bugs.openjdk.org/browse/JDK-8294992)) this would not work, because the crucial class information is also stored in the header, and we could no longer iterate over objects until the headers would be restored. Also, the preserved-headers tables would grow quite large. > > I propose to use an alternative algorithm for full-GC (sliding-GC) forwarding that uses a special encoding so that the forwarding information fits into the lowest 32 bits of the header. > > It exploits the insight that, with sliding GCs, objects from one region will only ever be forwarded to one of two possible target regions. For this to work, we need to divide the heap into equal-sized regions. This is already the case for Shenandoah and G1, and can easily be overlaid for Serial GC, by assuming either the whole heap as a single region (if it fits) or by using SpaceAlignment-sized virtual regions. > > We also build and maintain a table that has N elements, where N is the number of regions. Each entry is two addresses, which are the start-address of the possible target regions for each source region. > > With this, forwarding information would be encoded like this: > - Bits 0 and 1: same as before, we put in '11' to indicate that the object is forwarded. > - Bit 2: Used for 'fallback'-forwarding (see below) > - Bit 3: Selects the target region 0 or 1. Look up the base address in the table (see above) > - Bits 4..31 The number of heap words from the target base address > > This works well for all sliding GCs in Serial, G1 and Shenandoah. The exception is in G1, there is a special mode called 'serial compaction' which acts as a last-last-ditch effort to squeeze more space out of the heap by re-forwarding the tails of the compaction chains. Unfortunately, this breaks the assumption of the sliding-forwarding-table. When that happens, we initialize a fallback table, which is a simple open hash-table, and set the Bit 2 in the forwarding to indicate that we shall look up the forwardee in the fallback-table. > > All the table accesses can be done unsynchronized because: > - Serial GC is single-threaded anyway > - In G1 and Shenandoah, GC worker threads divide up the work such that each worker does disjoint sets of regions. > - G1 serial compaction is single-threaded > > The change introduces a new (experimental) flag -... Roman Kennke has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 122 commits: - Merge remote-tracking branch 'upstream/master' into JDK-8305896 - Fix build - Merge remote-tracking branch 'upstream/master' into JDK-8305896 - Merge remote-tracking branch 'upstream/master' into JDK-8305896 - Merge branch 'master' into JDK-8305896 - Various cleanups - Some @shipilev comments from downstream review - Further templatize Serial GC's adjust_pointers() - Merge branch 'master' into JDK-8305896 - Specialize full-GC loops to get UseAltGCForwarding flag check out of hot paths - ... and 112 more: https://git.openjdk.org/jdk/compare/38745eca...bdc8c823 ------------- Changes: https://git.openjdk.org/jdk/pull/13582/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=13582&range=54 Stats: 1203 lines in 39 files changed: 1079 ins; 10 del; 114 mod Patch: https://git.openjdk.org/jdk/pull/13582.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/13582/head:pull/13582 PR: https://git.openjdk.org/jdk/pull/13582 From wkemper at openjdk.org Thu Nov 9 21:40:53 2023 From: wkemper at openjdk.org (William Kemper) Date: Thu, 9 Nov 2023 21:40:53 GMT Subject: RFR: 8319867: GenShen: Make old regions parseable at end of concurrent cycles Message-ID: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com> ### Background When Shenandoah is performing a concurrent mark of the old generation, it is not safe for the young generation's remembered set scan to use the old mark bitmap. For this reason, any old regions that were not included in a collection set (either mixed or a global collection) must be made _parseable_ for the remembered set scan (we call this "coalescing and filling" in the code). ### Description Before this change, Shenandoah has deferred making these regions parseable until it begins the young generation bootstrap cycle. This is fine, except that we have also recently made old generation collections subordinate to young collections. In other words, an old collection cannot start unless Shenandoah wants to first start a young collection. Unfortunately, the additional time to concurrently coalesce and fill old regions before resetting the old mark bitmaps is not accounted for in the heuristics and this increases the likelihood that the heuristic has started the cycles too late and that the cycle will degenerate into a stop the world pause. After this change, Shenandoah will make old regions parseable immediately following the last mixed collection (or following a global collection). At this point, Shenandoah has just finished reclaiming memory and we expect less urgency to coalesce and fill old regions. It also means that young bootstrap cycles will not take (much) longer than conventional young generation collections. ------------- Commit messages: - Fix up documentation - Do not change class unloading configuration for degenerated cycles - Merge remote-tracking branch 'shenandoah/master' into make-old-parseable-after-evac - Purge SATB buffers if necessary at the end of an abbreviated cycle - Log if remembered set scan is using mark bitmap or not - Don't allow generation to change after we've decided on class unloading - Do not attempt to unload classes when resuming old gc operations - WIP: Reset old at start of bootstrap cycle - WIP: Remove unused parameters - WIP: Fix simple issue - ... and 2 more: https://git.openjdk.org/shenandoah/compare/20172ba8...012068ac Changes: https://git.openjdk.org/shenandoah/pull/355/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=355&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8319867 Stats: 261 lines in 17 files changed: 78 ins; 93 del; 90 mod Patch: https://git.openjdk.org/shenandoah/pull/355.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/355/head:pull/355 PR: https://git.openjdk.org/shenandoah/pull/355 From kdnilsen at openjdk.org Thu Nov 9 22:04:35 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Thu, 9 Nov 2023 22:04:35 GMT Subject: RFR: 8319867: GenShen: Make old regions parseable at end of concurrent cycles In-Reply-To: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com> References: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com> Message-ID: On Thu, 9 Nov 2023 21:35:40 GMT, William Kemper wrote: > ### Background > When Shenandoah is performing a concurrent mark of the old generation, it is not safe for the young generation's remembered set scan to use the old mark bitmap. For this reason, any old regions that were not included in a collection set (either mixed or a global collection) must be made _parseable_ for the remembered set scan (we call this "coalescing and filling" in the code). > > ### Description > Before this change, Shenandoah has deferred making these regions parseable until it begins the young generation bootstrap cycle. This is fine, except that we have also recently made old generation collections subordinate to young collections. In other words, an old collection cannot start unless Shenandoah wants to first start a young collection. > > Unfortunately, the additional time to concurrently coalesce and fill old regions before resetting the old mark bitmaps is not accounted for in the heuristics and this increases the likelihood that the heuristic has started the cycles too late and that the cycle will degenerate into a stop the world pause. > > After this change, Shenandoah will make old regions parseable immediately following the last mixed collection (or following a global collection). At this point, Shenandoah has just finished reclaiming memory and we expect less urgency to coalesce and fill old regions. It also means that young bootstrap cycles will not take (much) longer than conventional young generation collections. Thank you. I believe this will improve latency for several of our workloads. ------------- Marked as reviewed by kdnilsen (Committer). PR Review: https://git.openjdk.org/shenandoah/pull/355#pullrequestreview-1723575156 From ysr at openjdk.org Sat Nov 11 00:22:33 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Sat, 11 Nov 2023 00:22:33 GMT Subject: RFR: 8319867: GenShen: Make old regions parseable at end of concurrent cycles In-Reply-To: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com> References: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com> Message-ID: On Thu, 9 Nov 2023 21:35:40 GMT, William Kemper wrote: > ### Background > When Shenandoah is performing a concurrent mark of the old generation, it is not safe for the young generation's remembered set scan to use the old mark bitmap. For this reason, any old regions that were not included in a collection set (either mixed or a global collection) must be made _parseable_ for the remembered set scan (we call this "coalescing and filling" in the code). > > ### Description > Before this change, Shenandoah has deferred making these regions parseable until it begins the young generation bootstrap cycle. This is fine, except that we have also recently made old generation collections subordinate to young collections. In other words, an old collection cannot start unless Shenandoah wants to first start a young collection. > > Unfortunately, the additional time to concurrently coalesce and fill old regions before resetting the old mark bitmaps is not accounted for in the heuristics and this increases the likelihood that the heuristic has started the cycles too late and that the cycle will degenerate into a stop the world pause. > > After this change, Shenandoah will make old regions parseable immediately following the last mixed collection (or following a global collection). At this point, Shenandoah has just finished reclaiming memory and we expect less urgency to coalesce and fill old regions. It also means that young bootstrap cycles will not take (much) longer than conventional young generation collections. Some remarks, not all of them need to be immediately actioned (if the remarks are correct). Was there a test case that revealed this? If so, can we add that to the JBS ticket and mentioned as part of the testing for the PR? Marked as reviewed by ysr (Committer). src/hotspot/share/gc/shenandoah/shenandoahMmuTracker.hpp line 93: > 91: // both record_full() and record_degenerated() with the same value of gcid. record_full() is called first and the log > 92: // reports such a cycle as a FULL cycle. > 93: void record_young(ShenandoahGeneration* generation, size_t gcid); Thanks for cleaning up the unused ShenandoahGeneration* and other unused parms. src/hotspot/share/gc/shenandoah/shenandoahOldGeneration.cpp line 259: > 257: } > 258: > 259: // Make the old generation regions parseable, so they can be safely Nothing to do now, but my ex-colleague Peter Kessler told me many years ago that the correct spelling is "parsable" (without the "e" between the s and the a :-) One can do a cleanup of spelling in a separate PR in the fullness of time. https://word.tips/spelling/parsable-vs-parseable/ src/hotspot/share/gc/shenandoah/shenandoahOldGeneration.cpp line 428: > 426: switch (new_state) { > 427: case FILLING: > 428: assert(heap->is_old_bitmap_stable(), "Cannot begin filling without first completing marking, state is '%s'", state_name(_state)); Do you also want to say: assert(_state > FILLING && _state != WAITING_FOR_BOOTSTRAP, "Illegal state transition"): src/hotspot/share/gc/shenandoah/shenandoahOldGeneration.cpp line 432: > 430: break; > 431: case WAITING_FOR_BOOTSTRAP: > 432: // GC cancellation can send us back to IDLE from any state. Since there isn't now a state named "IDLE", it might be less confusing to just say "FILLING" or "WAITING_FOR_BOOTSTRAP"? Should we assert here that there are no `coalesce_and_fill_candidates` or is that implied by perhaps stronger checks below? (Wouldn't hurt even if so.) src/hotspot/share/gc/shenandoah/shenandoahOldGeneration.cpp line 448: > 446: break; > 447: case EVACUATING: > 448: assert(_state == WAITING_FOR_BOOTSTRAP || _state == MARKING, "Cannot have old collection candidates without first marking, state is '%s'", state_name(_state)); FWIW, the ASCII art state-transition-diagram doesn't show the transition from WAITING_FOR_BOOTSTRAP directly into EVACUATING, without passing through an intermediate MARKING. ------------- PR Review: https://git.openjdk.org/shenandoah/pull/355#pullrequestreview-1725807183 PR Review: https://git.openjdk.org/shenandoah/pull/355#pullrequestreview-1725836333 PR Review Comment: https://git.openjdk.org/shenandoah/pull/355#discussion_r1390033021 PR Review Comment: https://git.openjdk.org/shenandoah/pull/355#discussion_r1390037239 PR Review Comment: https://git.openjdk.org/shenandoah/pull/355#discussion_r1390050334 PR Review Comment: https://git.openjdk.org/shenandoah/pull/355#discussion_r1390047203 PR Review Comment: https://git.openjdk.org/shenandoah/pull/355#discussion_r1390053016 From ysr at openjdk.org Sat Nov 11 00:22:33 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Sat, 11 Nov 2023 00:22:33 GMT Subject: RFR: 8319867: GenShen: Make old regions parseable at end of concurrent cycles In-Reply-To: References: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com> Message-ID: On Sat, 11 Nov 2023 00:02:38 GMT, Y. Srinivas Ramakrishna wrote: >> ### Background >> When Shenandoah is performing a concurrent mark of the old generation, it is not safe for the young generation's remembered set scan to use the old mark bitmap. For this reason, any old regions that were not included in a collection set (either mixed or a global collection) must be made _parseable_ for the remembered set scan (we call this "coalescing and filling" in the code). >> >> ### Description >> Before this change, Shenandoah has deferred making these regions parseable until it begins the young generation bootstrap cycle. This is fine, except that we have also recently made old generation collections subordinate to young collections. In other words, an old collection cannot start unless Shenandoah wants to first start a young collection. >> >> Unfortunately, the additional time to concurrently coalesce and fill old regions before resetting the old mark bitmaps is not accounted for in the heuristics and this increases the likelihood that the heuristic has started the cycles too late and that the cycle will degenerate into a stop the world pause. >> >> After this change, Shenandoah will make old regions parseable immediately following the last mixed collection (or following a global collection). At this point, Shenandoah has just finished reclaiming memory and we expect less urgency to coalesce and fill old regions. It also means that young bootstrap cycles will not take (much) longer than conventional young generation collections. > > src/hotspot/share/gc/shenandoah/shenandoahOldGeneration.cpp line 432: > >> 430: break; >> 431: case WAITING_FOR_BOOTSTRAP: >> 432: // GC cancellation can send us back to IDLE from any state. > > Since there isn't now a state named "IDLE", it might be less confusing to just say "FILLING" or "WAITING_FOR_BOOTSTRAP"? Should we assert here that there are no `coalesce_and_fill_candidates` or is that implied by perhaps stronger checks below? (Wouldn't hurt even if so.) assert(_state != BOOTSTRAP && _state != WAITING_FOR_BOOTSTRAP, "Illegal transition"); ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/355#discussion_r1390050871 From ysr at openjdk.org Sat Nov 11 00:22:33 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Sat, 11 Nov 2023 00:22:33 GMT Subject: RFR: 8319867: GenShen: Make old regions parseable at end of concurrent cycles In-Reply-To: References: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com>

Message-ID: <7qBpCiLCAU6ecsabjcPsvjYH7YxVbnv17ohBd17BtO8=.f2c49bfd-e161-491e-a7c4-a96cb5abdb0a@github.com> On Sat, 11 Nov 2023 00:07:26 GMT, Y. Srinivas Ramakrishna wrote: >> src/hotspot/share/gc/shenandoah/shenandoahOldGeneration.cpp line 432: >> >>> 430: break; >>> 431: case WAITING_FOR_BOOTSTRAP: >>> 432: // GC cancellation can send us back to IDLE from any state. >> >> Since there isn't now a state named "IDLE", it might be less confusing to just say "FILLING" or "WAITING_FOR_BOOTSTRAP"? Should we assert here that there are no `coalesce_and_fill_candidates` or is that implied by perhaps stronger checks below? (Wouldn't hurt even if so.) > > assert(_state != BOOTSTRAP && _state != WAITING_FOR_BOOTSTRAP, "Illegal transition"); I am thinking that the most parsimonious way of covering all bases is by associating a pre-condition with entering each state and a post-condition with exiting each state, and verifying those conditions in the transition validation methods. But may be that can be done as a separate PR if it makes sense. ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/355#discussion_r1390052540 From wkemper at openjdk.org Sat Nov 11 00:36:30 2023 From: wkemper at openjdk.org (William Kemper) Date: Sat, 11 Nov 2023 00:36:30 GMT Subject: RFR: 8319867: GenShen: Make old regions parseable at end of concurrent cycles In-Reply-To: References: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com> Message-ID: <-8QhfBfs3uf6x7Dlv_IuAvFycOx4BAv1mmeD2gO5jVk=.1712f20a-bd56-4f9e-a7f9-52a12770fea6@github.com> On Fri, 10 Nov 2023 23:47:23 GMT, Y. Srinivas Ramakrishna wrote: >> ### Background >> When Shenandoah is performing a concurrent mark of the old generation, it is not safe for the young generation's remembered set scan to use the old mark bitmap. For this reason, any old regions that were not included in a collection set (either mixed or a global collection) must be made _parseable_ for the remembered set scan (we call this "coalescing and filling" in the code). >> >> ### Description >> Before this change, Shenandoah has deferred making these regions parseable until it begins the young generation bootstrap cycle. This is fine, except that we have also recently made old generation collections subordinate to young collections. In other words, an old collection cannot start unless Shenandoah wants to first start a young collection. >> >> Unfortunately, the additional time to concurrently coalesce and fill old regions before resetting the old mark bitmaps is not accounted for in the heuristics and this increases the likelihood that the heuristic has started the cycles too late and that the cycle will degenerate into a stop the world pause. >> >> After this change, Shenandoah will make old regions parseable immediately following the last mixed collection (or following a global collection). At this point, Shenandoah has just finished reclaiming memory and we expect less urgency to coalesce and fill old regions. It also means that young bootstrap cycles will not take (much) longer than conventional young generation collections. > > src/hotspot/share/gc/shenandoah/shenandoahOldGeneration.cpp line 259: > >> 257: } >> 258: >> 259: // Make the old generation regions parseable, so they can be safely > > Nothing to do now, but my ex-colleague Peter Kessler told me many years ago that the correct spelling is "parsable" (without the "e" between the s and the a :-) One can do a cleanup of spelling in a separate PR in the fullness of time. > > https://word.tips/spelling/parsable-vs-parseable/ [0] % grep -irI parsable src/ | wc -l 352 [0] % grep -irI parseable src/ | wc -l 84 Indeed. I'll fix up all the misspellings in `gc/shenandoah` in a separate PR. Thanks for the tip! ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/355#discussion_r1390061992 From wkemper at openjdk.org Sat Nov 11 00:47:32 2023 From: wkemper at openjdk.org (William Kemper) Date: Sat, 11 Nov 2023 00:47:32 GMT Subject: RFR: 8319867: GenShen: Make old regions parseable at end of concurrent cycles In-Reply-To: References: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com> Message-ID: On Sat, 11 Nov 2023 00:05:24 GMT, Y. Srinivas Ramakrishna wrote: >> ### Background >> When Shenandoah is performing a concurrent mark of the old generation, it is not safe for the young generation's remembered set scan to use the old mark bitmap. For this reason, any old regions that were not included in a collection set (either mixed or a global collection) must be made _parseable_ for the remembered set scan (we call this "coalescing and filling" in the code). >> >> ### Description >> Before this change, Shenandoah has deferred making these regions parseable until it begins the young generation bootstrap cycle. This is fine, except that we have also recently made old generation collections subordinate to young collections. In other words, an old collection cannot start unless Shenandoah wants to first start a young collection. >> >> Unfortunately, the additional time to concurrently coalesce and fill old regions before resetting the old mark bitmaps is not accounted for in the heuristics and this increases the likelihood that the heuristic has started the cycles too late and that the cycle will degenerate into a stop the world pause. >> >> After this change, Shenandoah will make old regions parseable immediately following the last mixed collection (or following a global collection). At this point, Shenandoah has just finished reclaiming memory and we expect less urgency to coalesce and fill old regions. It also means that young bootstrap cycles will not take (much) longer than conventional young generation collections. > > src/hotspot/share/gc/shenandoah/shenandoahOldGeneration.cpp line 428: > >> 426: switch (new_state) { >> 427: case FILLING: >> 428: assert(heap->is_old_bitmap_stable(), "Cannot begin filling without first completing marking, state is '%s'", state_name(_state)); > > Do you also want to say: > > > assert(_state > FILLING && _state != WAITING_FOR_BOOTSTRAP, "Illegal state transition"): No - at the end of a _GLOBAL_ collection, it may put the old generation in this state without progressing through the other states. > src/hotspot/share/gc/shenandoah/shenandoahOldGeneration.cpp line 448: > >> 446: break; >> 447: case EVACUATING: >> 448: assert(_state == WAITING_FOR_BOOTSTRAP || _state == MARKING, "Cannot have old collection candidates without first marking, state is '%s'", state_name(_state)); > > FWIW, the ASCII art state-transition-diagram doesn't show the transition from WAITING_FOR_BOOTSTRAP directly into EVACUATING, without passing through an intermediate MARKING. Yes, that can only happen at the end of a _GLOBAL_ collection. This is explained in the comment above the diagram. I will add it to the diagram. ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/355#discussion_r1390064709 PR Review Comment: https://git.openjdk.org/shenandoah/pull/355#discussion_r1390066019 From wkemper at openjdk.org Sat Nov 11 00:47:32 2023 From: wkemper at openjdk.org (William Kemper) Date: Sat, 11 Nov 2023 00:47:32 GMT Subject: RFR: 8319867: GenShen: Make old regions parseable at end of concurrent cycles In-Reply-To: <7qBpCiLCAU6ecsabjcPsvjYH7YxVbnv17ohBd17BtO8=.f2c49bfd-e161-491e-a7c4-a96cb5abdb0a@github.com> References: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com>

<7qBpCiLCAU6ecsabjcPsvjYH7YxVbnv17ohBd17BtO8=.f2c49bfd-e161-491e-a7c4-a96cb5abdb0a@github.com> Message-ID: On Sat, 11 Nov 2023 00:11:32 GMT, Y. Srinivas Ramakrishna wrote: >> assert(_state != BOOTSTRAP && _state != WAITING_FOR_BOOTSTRAP, "Illegal transition"); > > I am thinking that the most parsimonious way of covering all bases is by associating a pre-condition with entering each state and a post-condition with exiting each state, and verifying those conditions in the transition validation methods. But may be that can be done as a separate PR if it makes sense. Hmm - I thought that's what this code is doing. Or rather, it is asserting what conditions must hold to _be_ in this state (checking pre/post conditions simultaneously). ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/355#discussion_r1390064292 From wkemper at openjdk.org Sat Nov 11 00:56:02 2023 From: wkemper at openjdk.org (William Kemper) Date: Sat, 11 Nov 2023 00:56:02 GMT Subject: RFR: 8319867: GenShen: Make old regions parseable at end of concurrent cycles [v2] In-Reply-To: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com> References: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com> Message-ID: <94gpkJJtcuzv66G823Fdv3fl_0AVVXHqDt5-7umYq-E=.f49bfd59-8375-475f-a255-d6206a42e516@github.com> > ### Background > When Shenandoah is performing a concurrent mark of the old generation, it is not safe for the young generation's remembered set scan to use the old mark bitmap. For this reason, any old regions that were not included in a collection set (either mixed or a global collection) must be made _parseable_ for the remembered set scan (we call this "coalescing and filling" in the code). > > ### Description > Before this change, Shenandoah has deferred making these regions parseable until it begins the young generation bootstrap cycle. This is fine, except that we have also recently made old generation collections subordinate to young collections. In other words, an old collection cannot start unless Shenandoah wants to first start a young collection. > > Unfortunately, the additional time to concurrently coalesce and fill old regions before resetting the old mark bitmaps is not accounted for in the heuristics and this increases the likelihood that the heuristic has started the cycles too late and that the cycle will degenerate into a stop the world pause. > > After this change, Shenandoah will make old regions parseable immediately following the last mixed collection (or following a global collection). At this point, Shenandoah has just finished reclaiming memory and we expect less urgency to coalesce and fill old regions. It also means that young bootstrap cycles will not take (much) longer than conventional young generation collections. William Kemper has updated the pull request incrementally with one additional commit since the last revision: Update ascii diagram, fix spelling errors ------------- Changes: - all: https://git.openjdk.org/shenandoah/pull/355/files - new: https://git.openjdk.org/shenandoah/pull/355/files/012068ac..65bec3b2 Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah&pr=355&range=01 - incr: https://webrevs.openjdk.org/?repo=shenandoah&pr=355&range=00-01 Stats: 38 lines in 1 file changed: 1 ins; 0 del; 37 mod Patch: https://git.openjdk.org/shenandoah/pull/355.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/355/head:pull/355 PR: https://git.openjdk.org/shenandoah/pull/355 From wkemper at openjdk.org Sat Nov 11 00:56:02 2023 From: wkemper at openjdk.org (William Kemper) Date: Sat, 11 Nov 2023 00:56:02 GMT Subject: RFR: 8319867: GenShen: Make old regions parseable at end of concurrent cycles In-Reply-To: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com> References: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com> Message-ID: On Thu, 9 Nov 2023 21:35:40 GMT, William Kemper wrote: > ### Background > When Shenandoah is performing a concurrent mark of the old generation, it is not safe for the young generation's remembered set scan to use the old mark bitmap. For this reason, any old regions that were not included in a collection set (either mixed or a global collection) must be made _parseable_ for the remembered set scan (we call this "coalescing and filling" in the code). > > ### Description > Before this change, Shenandoah has deferred making these regions parseable until it begins the young generation bootstrap cycle. This is fine, except that we have also recently made old generation collections subordinate to young collections. In other words, an old collection cannot start unless Shenandoah wants to first start a young collection. > > Unfortunately, the additional time to concurrently coalesce and fill old regions before resetting the old mark bitmaps is not accounted for in the heuristics and this increases the likelihood that the heuristic has started the cycles too late and that the cycle will degenerate into a stop the world pause. > > After this change, Shenandoah will make old regions parseable immediately following the last mixed collection (or following a global collection). At this point, Shenandoah has just finished reclaiming memory and we expect less urgency to coalesce and fill old regions. It also means that young bootstrap cycles will not take (much) longer than conventional young generation collections. We don't have a test that shows this, just thought experiments. ------------- PR Comment: https://git.openjdk.org/shenandoah/pull/355#issuecomment-1806606414 From dholmes at openjdk.org Mon Nov 13 04:44:09 2023 From: dholmes at openjdk.org (David Holmes) Date: Mon, 13 Nov 2023 04:44:09 GMT Subject: RFR: 8318776: Require supports_cx8 to always be true Message-ID: As discussed in JBS all platforms (some tweaks to Zero are in progress) actually do support `cx8` i.e. 64-bit compare-and-exchange, so we can strip out the locked-based alternatives to using it and just add a guarantee that it is true at runtime. And all platforms except some ARM variants set `SUPPORTS_NATIVE_CX8`, so we can greatly simplify things. Summary of changes: - `_supports_cx8` field is only needed when `SUPPORTS_NATIVE_CX8` is not defined - Assertions for `supports_cx8()` are removed - Access backend is greatly simplified without the need for lock-based alternative - `java.util.concurrent.AtomicLongFieldUpdater` is simplified without the need for a lock-based alternative I did consider moving all the ARM `kuser_helper` related code to be only defined when `SUPPORTS_NATIVE_CX8` is not defined, but there was a theoretical risk this could change the behaviour if ARMv7 binaries were run on other ARM CPU's. I added a note to that effect in vm_version_linux_arm32.cpp so the ARM port maintainers could clean this up further if desired. Testing: - All Oracle tiers 1-5 builds (which includes an ARMv7 build) - GHA builds/tests - Oracle tiers 1-3 sanity testing Zero changes coming in via [JDK-8319777](https://bugs.openjdk.org/browse/JDK-8319777) will be merged when they arrive. Thanks. ------------- Commit messages: - 8318776: Require supports_cx8 to always be true Changes: https://git.openjdk.org/jdk/pull/16625/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16625&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8318776 Stats: 386 lines in 35 files changed: 14 ins; 359 del; 13 mod Patch: https://git.openjdk.org/jdk/pull/16625.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16625/head:pull/16625 PR: https://git.openjdk.org/jdk/pull/16625 From dholmes at openjdk.org Mon Nov 13 04:50:12 2023 From: dholmes at openjdk.org (David Holmes) Date: Mon, 13 Nov 2023 04:50:12 GMT Subject: RFR: 8318776: Require supports_cx8 to always be true [v2] In-Reply-To: References: Message-ID: > As discussed in JBS all platforms (some tweaks to Zero are in progress) actually do support `cx8` i.e. 64-bit compare-and-exchange, so we can strip out the locked-based alternatives to using it and just add a guarantee that it is true at runtime. And all platforms except some ARM variants set `SUPPORTS_NATIVE_CX8`, so we can greatly simplify things. Summary of changes: > - `_supports_cx8` field is only needed when `SUPPORTS_NATIVE_CX8` is not defined > - Assertions for `supports_cx8()` are removed > - Compiler predicates requiring `supports_cx8()` are removed > - Access backend is greatly simplified without the need for lock-based alternative > - `java.util.concurrent.AtomicLongFieldUpdater` is simplified without the need for a lock-based alternative > > I did consider moving all the ARM `kuser_helper` related code to be only defined when `SUPPORTS_NATIVE_CX8` is not defined, but there was a theoretical risk this could change the behaviour if ARMv7 binaries were run on other ARM CPU's. I added a note to that effect in vm_version_linux_arm32.cpp so the ARM port maintainers could clean this up further if desired. > > Testing: > - All Oracle tiers 1-5 builds (which includes an ARMv7 build) > - GHA builds/tests > - Oracle tiers 1-3 sanity testing > > Zero changes coming in via [JDK-8319777](https://bugs.openjdk.org/browse/JDK-8319777) will be merged when they arrive. > > Thanks. David Holmes has updated the pull request incrementally with one additional commit since the last revision: Remove test for VMSupportsCX8 ------------- Changes: - all: https://git.openjdk.org/jdk/pull/16625/files - new: https://git.openjdk.org/jdk/pull/16625/files/3f2ec66f..b6dea4b6 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=16625&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=16625&range=00-01 Stats: 53 lines in 1 file changed: 0 ins; 53 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/16625.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16625/head:pull/16625 PR: https://git.openjdk.org/jdk/pull/16625 From dholmes at openjdk.org Mon Nov 13 05:10:17 2023 From: dholmes at openjdk.org (David Holmes) Date: Mon, 13 Nov 2023 05:10:17 GMT Subject: RFR: 8318776: Require supports_cx8 to always be true [v3] In-Reply-To: References: Message-ID: > As discussed in JBS all platforms (some tweaks to Zero are in progress) actually do support `cx8` i.e. 64-bit compare-and-exchange, so we can strip out the locked-based alternatives to using it and just add a guarantee that it is true at runtime. And all platforms except some ARM variants set `SUPPORTS_NATIVE_CX8`, so we can greatly simplify things. Summary of changes: > - `_supports_cx8` field is only needed when `SUPPORTS_NATIVE_CX8` is not defined > - Assertions for `supports_cx8()` are removed > - Compiler predicates requiring `supports_cx8()` are removed > - Access backend is greatly simplified without the need for lock-based alternative > - `java.util.concurrent.AtomicLongFieldUpdater` is simplified without the need for a lock-based alternative > > I did consider moving all the ARM `kuser_helper` related code to be only defined when `SUPPORTS_NATIVE_CX8` is not defined, but there was a theoretical risk this could change the behaviour if ARMv7 binaries were run on other ARM CPU's. I added a note to that effect in vm_version_linux_arm32.cpp so the ARM port maintainers could clean this up further if desired. > > Testing: > - All Oracle tiers 1-5 builds (which includes an ARMv7 build) > - GHA builds/tests > - Oracle tiers 1-3 sanity testing > > Zero changes coming in via [JDK-8319777](https://bugs.openjdk.org/browse/JDK-8319777) will be merged when they arrive. > > Thanks. David Holmes has updated the pull request incrementally with two additional commits since the last revision: - Remove cx8 comment as no longer relevant (the spinlock is used regardless of cx8) - Remove suports_cx8() checks from gtest ------------- Changes: - all: https://git.openjdk.org/jdk/pull/16625/files - new: https://git.openjdk.org/jdk/pull/16625/files/b6dea4b6..65871144 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=16625&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=16625&range=01-02 Stats: 14 lines in 2 files changed: 0 ins; 14 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/16625.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16625/head:pull/16625 PR: https://git.openjdk.org/jdk/pull/16625 From ysr at openjdk.org Mon Nov 13 15:55:05 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Mon, 13 Nov 2023 15:55:05 GMT Subject: RFR: 8319867: GenShen: Make old regions parseable at end of concurrent cycles [v2] In-Reply-To: <94gpkJJtcuzv66G823Fdv3fl_0AVVXHqDt5-7umYq-E=.f49bfd59-8375-475f-a255-d6206a42e516@github.com> References: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com> <94gpkJJtcuzv66G823Fdv3fl_0AVVXHqDt5-7umYq-E=.f49bfd59-8375-475f-a255-d6206a42e516@github.com> Message-ID: <3x_japJ76Ze-FbVdFv2WZOwApjvKvA2oHr1VkaQhhi4=.635138d7-f16b-4ed8-b5f7-6819bdb170a6@github.com> On Sat, 11 Nov 2023 00:56:02 GMT, William Kemper wrote: >> ### Background >> When Shenandoah is performing a concurrent mark of the old generation, it is not safe for the young generation's remembered set scan to use the old mark bitmap. For this reason, any old regions that were not included in a collection set (either mixed or a global collection) must be made _parseable_ for the remembered set scan (we call this "coalescing and filling" in the code). >> >> ### Description >> Before this change, Shenandoah has deferred making these regions parseable until it begins the young generation bootstrap cycle. This is fine, except that we have also recently made old generation collections subordinate to young collections. In other words, an old collection cannot start unless Shenandoah wants to first start a young collection. >> >> Unfortunately, the additional time to concurrently coalesce and fill old regions before resetting the old mark bitmaps is not accounted for in the heuristics and this increases the likelihood that the heuristic has started the cycles too late and that the cycle will degenerate into a stop the world pause. >> >> After this change, Shenandoah will make old regions parseable immediately following the last mixed collection (or following a global collection). At this point, Shenandoah has just finished reclaiming memory and we expect less urgency to coalesce and fill old regions. It also means that young bootstrap cycles will not take (much) longer than conventional young generation collections. > > William Kemper has updated the pull request incrementally with one additional commit since the last revision: > > Update ascii diagram, fix spelling errors LGTM; reviewed! ? ------------- Marked as reviewed by ysr (Committer). PR Review: https://git.openjdk.org/shenandoah/pull/355#pullrequestreview-1727623009 From ysr at openjdk.org Mon Nov 13 15:55:08 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Mon, 13 Nov 2023 15:55:08 GMT Subject: RFR: 8319867: GenShen: Make old regions parseable at end of concurrent cycles [v2] In-Reply-To: References: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com>

Message-ID: On Sat, 11 Nov 2023 00:44:35 GMT, William Kemper wrote: >> src/hotspot/share/gc/shenandoah/shenandoahOldGeneration.cpp line 448: >> >>> 446: break; >>> 447: case EVACUATING: >>> 448: assert(_state == WAITING_FOR_BOOTSTRAP || _state == MARKING, "Cannot have old collection candidates without first marking, state is '%s'", state_name(_state)); >> >> FWIW, the ASCII art state-transition-diagram doesn't show the transition from WAITING_FOR_BOOTSTRAP directly into EVACUATING, without passing through an intermediate MARKING. > > Yes, that can only happen at the end of a _GLOBAL_ collection. This is explained in the comment above the diagram. I will add it to the diagram. Got it. One way to manage this and still get stronger checking during all, but in particular non-global, collections, might be to organize the state transition checks per collection type, and make assertions specific to the kind of collection in which the transition is happening/being checked. But perhaps that is overkill, and in any case not necessary in this PR. ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/355#discussion_r1391304232 From wkemper at openjdk.org Mon Nov 13 17:03:15 2023 From: wkemper at openjdk.org (William Kemper) Date: Mon, 13 Nov 2023 17:03:15 GMT Subject: RFR: 8319867: GenShen: Make old regions parsable at end of concurrent cycles [v3] In-Reply-To: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com> References: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com> Message-ID: > ### Background > When Shenandoah is performing a concurrent mark of the old generation, it is not safe for the young generation's remembered set scan to use the old mark bitmap. For this reason, any old regions that were not included in a collection set (either mixed or a global collection) must be made _parsable_ for the remembered set scan (we call this "coalescing and filling" in the code). > > ### Description > Before this change, Shenandoah has deferred making these regions parsable until it begins the young generation bootstrap cycle. This is fine, except that we have also recently made old generation collections subordinate to young collections. In other words, an old collection cannot start unless Shenandoah wants to first start a young collection. > > Unfortunately, the additional time to concurrently coalesce and fill old regions before resetting the old mark bitmaps is not accounted for in the heuristics and this increases the likelihood that the heuristic has started the cycles too late and that the cycle will degenerate into a stop the world pause. > > After this change, Shenandoah will make old regions parsable immediately following the last mixed collection (or following a global collection). At this point, Shenandoah has just finished reclaiming memory and we expect less urgency to coalesce and fill old regions. It also means that young bootstrap cycles will not take (much) longer than conventional young generation collections. William Kemper has updated the pull request incrementally with one additional commit since the last revision: Fix whitespace errors ------------- Changes: - all: https://git.openjdk.org/shenandoah/pull/355/files - new: https://git.openjdk.org/shenandoah/pull/355/files/65bec3b2..07d93f95 Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah&pr=355&range=02 - incr: https://webrevs.openjdk.org/?repo=shenandoah&pr=355&range=01-02 Stats: 2 lines in 1 file changed: 0 ins; 0 del; 2 mod Patch: https://git.openjdk.org/shenandoah/pull/355.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/355/head:pull/355 PR: https://git.openjdk.org/shenandoah/pull/355 From wkemper at openjdk.org Mon Nov 13 17:08:39 2023 From: wkemper at openjdk.org (William Kemper) Date: Mon, 13 Nov 2023 17:08:39 GMT Subject: RFR: 8319342: GenShen: Reset the count of degenerated cycles in a row following Full GC [v3] In-Reply-To: References:

Message-ID: On Tue, 7 Nov 2023 22:02:51 GMT, Kelvin Nilsen wrote: >> After a Full GC completes, reset the count on consecutive degenerated cycles to zero. This prevents automatic triggering of back-to-back Full GC. > > Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: > > Fix typo Marked as reviewed by wkemper (Committer). ------------- PR Review: https://git.openjdk.org/shenandoah/pull/352#pullrequestreview-1727795317 From wkemper at openjdk.org Mon Nov 13 17:13:54 2023 From: wkemper at openjdk.org (William Kemper) Date: Mon, 13 Nov 2023 17:13:54 GMT Subject: RFR: 8319931: GenShen: Increase no progress threshold for TestThreadFailure Message-ID: The test walks to close to the edge of an out of memory cliff. Intermittent failures are caused by [JDK-8316632](https://bugs.openjdk.org/browse/JDK-8316632). Further investigation is needed for Genshen. The intention of this PR is just to increase the setting `ShenandoahNoProgressThreshold` for this test to pass. ------------- Commit messages: - Increase no progress threshold for generational test Changes: https://git.openjdk.org/shenandoah/pull/356/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=356&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8319931 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/shenandoah/pull/356.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/356/head:pull/356 PR: https://git.openjdk.org/shenandoah/pull/356 From kdnilsen at openjdk.org Mon Nov 13 17:54:37 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 13 Nov 2023 17:54:37 GMT Subject: RFR: 8319931: GenShen: Increase no progress threshold for TestThreadFailure In-Reply-To: References: Message-ID: On Mon, 13 Nov 2023 17:09:04 GMT, William Kemper wrote: > The test walks to close to the edge of an out of memory cliff. Intermittent failures are caused by [JDK-8316632](https://bugs.openjdk.org/browse/JDK-8316632). Further investigation is needed for Genshen. The intention of this PR is just to increase the setting `ShenandoahNoProgressThreshold` for this test to pass. Marked as reviewed by kdnilsen (Committer). ------------- PR Review: https://git.openjdk.org/shenandoah/pull/356#pullrequestreview-1727890128 From wkemper at openjdk.org Mon Nov 13 18:06:35 2023 From: wkemper at openjdk.org (William Kemper) Date: Mon, 13 Nov 2023 18:06:35 GMT Subject: Integrated: 8319867: GenShen: Make old regions parsable at end of concurrent cycles In-Reply-To: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com> References: <9-BBrOKn0PqKcJcZgRRbDgl8QK04QuC62Jd8CKOwZdo=.15ad81a4-0f10-451e-8921-00b705c5d48d@github.com> Message-ID: On Thu, 9 Nov 2023 21:35:40 GMT, William Kemper wrote: > ### Background > When Shenandoah is performing a concurrent mark of the old generation, it is not safe for the young generation's remembered set scan to use the old mark bitmap. For this reason, any old regions that were not included in a collection set (either mixed or a global collection) must be made _parsable_ for the remembered set scan (we call this "coalescing and filling" in the code). > > ### Description > Before this change, Shenandoah has deferred making these regions parsable until it begins the young generation bootstrap cycle. This is fine, except that we have also recently made old generation collections subordinate to young collections. In other words, an old collection cannot start unless Shenandoah wants to first start a young collection. > > Unfortunately, the additional time to concurrently coalesce and fill old regions before resetting the old mark bitmaps is not accounted for in the heuristics and this increases the likelihood that the heuristic has started the cycles too late and that the cycle will degenerate into a stop the world pause. > > After this change, Shenandoah will make old regions parsable immediately following the last mixed collection (or following a global collection). At this point, Shenandoah has just finished reclaiming memory and we expect less urgency to coalesce and fill old regions. It also means that young bootstrap cycles will not take (much) longer than conventional young generation collections. This pull request has now been integrated. Changeset: e8602e7b Author: William Kemper URL: https://git.openjdk.org/shenandoah/commit/e8602e7bf3fb4fb4539ee00683551fadcc739e65 Stats: 257 lines in 17 files changed: 73 ins; 93 del; 91 mod 8319867: GenShen: Make old regions parsable at end of concurrent cycles Reviewed-by: kdnilsen, ysr ------------- PR: https://git.openjdk.org/shenandoah/pull/355 From ysr at openjdk.org Mon Nov 13 18:06:39 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Mon, 13 Nov 2023 18:06:39 GMT Subject: RFR: 8319931: GenShen: Increase no progress threshold for TestThreadFailure In-Reply-To: References: Message-ID: On Mon, 13 Nov 2023 17:09:04 GMT, William Kemper wrote: > The test walks to close to the edge of an out of memory cliff. Intermittent failures are caused by [JDK-8316632](https://bugs.openjdk.org/browse/JDK-8316632). Further investigation is needed for Genshen. The intention of this PR is just to increase the setting `ShenandoahNoProgressThreshold` for this test to pass. Marked as reviewed by ysr (Committer). ------------- PR Review: https://git.openjdk.org/shenandoah/pull/356#pullrequestreview-1727917830 From wkemper at openjdk.org Mon Nov 13 18:06:39 2023 From: wkemper at openjdk.org (William Kemper) Date: Mon, 13 Nov 2023 18:06:39 GMT Subject: Integrated: 8319931: GenShen: Increase no progress threshold for TestThreadFailure In-Reply-To: References: Message-ID: <15ePUmSj_yCo6rtxRvDcL8LcCc48uF9tsFi1bSW1UZE=.22122d78-0bb3-447b-b4e2-d22735b16201@github.com> On Mon, 13 Nov 2023 17:09:04 GMT, William Kemper wrote: > The test walks to close to the edge of an out of memory cliff. Intermittent failures are caused by [JDK-8316632](https://bugs.openjdk.org/browse/JDK-8316632). Further investigation is needed for Genshen. The intention of this PR is just to increase the setting `ShenandoahNoProgressThreshold` for this test to pass. This pull request has now been integrated. Changeset: 2db0157a Author: William Kemper URL: https://git.openjdk.org/shenandoah/commit/2db0157ac91368b3f575d74b2e0ac9d58dc59139 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod 8319931: GenShen: Increase no progress threshold for TestThreadFailure Reviewed-by: kdnilsen ------------- PR: https://git.openjdk.org/shenandoah/pull/356 From ysr at openjdk.org Mon Nov 13 18:25:36 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Mon, 13 Nov 2023 18:25:36 GMT Subject: RFR: 8319342: GenShen: Reset the count of degenerated cycles in a row following Full GC [v3] In-Reply-To: References:

Message-ID: <0IkvH5QVsb7MAnYvEUnoPISj8kpd1jmCfFvTZdAIlH0=.4962b741-7f00-4631-ad6d-2ffb50f25396@github.com> On Tue, 7 Nov 2023 22:02:51 GMT, Kelvin Nilsen wrote: >> After a Full GC completes, reset the count on consecutive degenerated cycles to zero. This prevents automatic triggering of back-to-back Full GC. > > Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: > > Fix typo Marked as reviewed by ysr (Committer). ------------- PR Review: https://git.openjdk.org/shenandoah/pull/352#pullrequestreview-1727949011 From kdnilsen at openjdk.org Mon Nov 13 21:26:14 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 13 Nov 2023 21:26:14 GMT Subject: Integrated: 8319342: GenShen: Reset the count of degenerated cycles in a row following Full GC In-Reply-To: References: Message-ID: On Fri, 3 Nov 2023 21:17:42 GMT, Kelvin Nilsen wrote: > After a Full GC completes, reset the count on consecutive degenerated cycles to zero. This prevents automatic triggering of back-to-back Full GC. This pull request has now been integrated. Changeset: 81de88c5 Author: Kelvin Nilsen URL: https://git.openjdk.org/shenandoah/commit/81de88c59bbbd3e3ea2afbf8f563febc82797a5d Stats: 41 lines in 7 files changed: 13 ins; 20 del; 8 mod 8319342: GenShen: Reset the count of degenerated cycles in a row following Full GC Reviewed-by: wkemper, ysr ------------- PR: https://git.openjdk.org/shenandoah/pull/352 From iwalulya at openjdk.org Tue Nov 14 19:57:28 2023 From: iwalulya at openjdk.org (Ivan Walulya) Date: Tue, 14 Nov 2023 19:57:28 GMT Subject: RFR: 8319439: Move BufferNode from PtrQueue files to new files In-Reply-To: References: Message-ID: On Sat, 4 Nov 2023 11:25:05 GMT, Kim Barrett wrote: > Please review this simple (trivial?) change that moves the BufferNode class > from share/gc/shared/ptrQueue.[ch]pp to new files > share/gc/shared/bufferNode.[ch]pp. The code (as opposed to the surrounding > boiler plate and #include's) was moved unchanged. > > This led to adding #include's of the new header file to a bunch of other files > (include what you use), in some cases eliminating the need to #include > ptrQueue.hpp. Also tidied up the #include's for ptrQueue.[ch]pp to take > advantage of the split. > > Also, the gtest gc/shared/test_ptrQueueBufferAllocator.cpp is renamed to > gc/shared/test_bufferNodeAllocator.cpp, with some adjustments to includes and > test naming. > > Testing: > mach5 tier1 Marked as reviewed by iwalulya (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/16506#pullrequestreview-1730648862 From wkemper at openjdk.org Tue Nov 14 23:08:44 2023 From: wkemper at openjdk.org (William Kemper) Date: Tue, 14 Nov 2023 23:08:44 GMT Subject: RFR: 8320112 GenShen: Improve end of process report Message-ID: When Shenandoah is shutdown, it logs a summary of its activity over the life of the process. This change adds percentages to the breakdown of the types and counts of cycles that executed. It also simplifies some of the logic around recording the various outcomes. ------------- Commit messages: - Cleanup, count abbreviated concurrent separately from abbreviated degen - WIP: It builds (after merging) - Merge remote-tracking branch 'shenandoah/master' into improve-end-of-process-report - WIP: Add percentages, stop creating gc session when filling old regions Changes: https://git.openjdk.org/shenandoah/pull/357/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=357&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8320112 Stats: 173 lines in 14 files changed: 41 ins; 54 del; 78 mod Patch: https://git.openjdk.org/shenandoah/pull/357.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/357/head:pull/357 PR: https://git.openjdk.org/shenandoah/pull/357 From wkemper at openjdk.org Tue Nov 14 23:14:48 2023 From: wkemper at openjdk.org (William Kemper) Date: Tue, 14 Nov 2023 23:14:48 GMT Subject: RFR: 8320112 GenShen: Improve end of process report [v2] In-Reply-To: References: Message-ID: > When Shenandoah is shutdown, it logs a summary of its activity over the life of the process. This change adds percentages to the breakdown of the types and counts of cycles that executed. It also simplifies some of the logic around recording the various outcomes. William Kemper has updated the pull request incrementally with two additional commits since the last revision: - Reduce visibility of new methods - Fix accidental whitespace change ------------- Changes: - all: https://git.openjdk.org/shenandoah/pull/357/files - new: https://git.openjdk.org/shenandoah/pull/357/files/3ea5e980..410c731d Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah&pr=357&range=01 - incr: https://webrevs.openjdk.org/?repo=shenandoah&pr=357&range=00-01 Stats: 24 lines in 4 files changed: 1 ins; 5 del; 18 mod Patch: https://git.openjdk.org/shenandoah/pull/357.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/357/head:pull/357 PR: https://git.openjdk.org/shenandoah/pull/357 From wkemper at openjdk.org Wed Nov 15 00:22:29 2023 From: wkemper at openjdk.org (William Kemper) Date: Wed, 15 Nov 2023 00:22:29 GMT Subject: RFR: 8320119: GenShen: Correct misspellings of parsable Message-ID: There should be no 'e' in the word that describes something that is able to be parsed. ------------- Commit messages: - Correct misspellings of parsable Changes: https://git.openjdk.org/shenandoah/pull/358/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=358&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8320119 Stats: 7 lines in 6 files changed: 0 ins; 0 del; 7 mod Patch: https://git.openjdk.org/shenandoah/pull/358.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/358/head:pull/358 PR: https://git.openjdk.org/shenandoah/pull/358 From kbarrett at openjdk.org Wed Nov 15 01:04:17 2023 From: kbarrett at openjdk.org (Kim Barrett) Date: Wed, 15 Nov 2023 01:04:17 GMT Subject: RFR: 8319439: Move BufferNode from PtrQueue files to new files [v2] In-Reply-To: References: Message-ID: <5K288rm4MtykYCyHrEPdanQsctdzzPeFeTA8EvHUAUI=.c8bfc38a-f926-408f-a158-084b07864170@github.com> > Please review this simple (trivial?) change that moves the BufferNode class > from share/gc/shared/ptrQueue.[ch]pp to new files > share/gc/shared/bufferNode.[ch]pp. The code (as opposed to the surrounding > boiler plate and #include's) was moved unchanged. > > This led to adding #include's of the new header file to a bunch of other files > (include what you use), in some cases eliminating the need to #include > ptrQueue.hpp. Also tidied up the #include's for ptrQueue.[ch]pp to take > advantage of the split. > > Also, the gtest gc/shared/test_ptrQueueBufferAllocator.cpp is renamed to > gc/shared/test_bufferNodeAllocator.cpp, with some adjustments to includes and > test naming. > > Testing: > mach5 tier1 Kim Barrett has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains four commits: - Merge branch 'master' into buffer-node - fixup gtest after rename - move test - move BufferNode to new files ------------- Changes: https://git.openjdk.org/jdk/pull/16506/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16506&range=01 Stats: 383 lines in 17 files changed: 223 ins; 152 del; 8 mod Patch: https://git.openjdk.org/jdk/pull/16506.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16506/head:pull/16506 PR: https://git.openjdk.org/jdk/pull/16506 From kbarrett at openjdk.org Wed Nov 15 01:11:49 2023 From: kbarrett at openjdk.org (Kim Barrett) Date: Wed, 15 Nov 2023 01:11:49 GMT Subject: RFR: 8319439: Move BufferNode from PtrQueue files to new files [v2] In-Reply-To: References:

Message-ID: On Tue, 7 Nov 2023 11:19:03 GMT, Thomas Schatzl wrote: >> Kim Barrett has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains four commits: >> >> - Merge branch 'master' into buffer-node >> - fixup gtest after rename >> - move test >> - move BufferNode to new files > > lgtm Thanks @tschatzl and @walulyai for reviews. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16506#issuecomment-1811638624 From kbarrett at openjdk.org Wed Nov 15 01:11:52 2023 From: kbarrett at openjdk.org (Kim Barrett) Date: Wed, 15 Nov 2023 01:11:52 GMT Subject: Integrated: 8319439: Move BufferNode from PtrQueue files to new files In-Reply-To: References: Message-ID: <5I5wDXCF5lDItoDQ594oZX8VrEEk1WDX-LvvkhJQphk=.d55c81a9-841b-49b7-8b05-51b8f18945f5@github.com> On Sat, 4 Nov 2023 11:25:05 GMT, Kim Barrett wrote: > Please review this simple (trivial?) change that moves the BufferNode class > from share/gc/shared/ptrQueue.[ch]pp to new files > share/gc/shared/bufferNode.[ch]pp. The code (as opposed to the surrounding > boiler plate and #include's) was moved unchanged. > > This led to adding #include's of the new header file to a bunch of other files > (include what you use), in some cases eliminating the need to #include > ptrQueue.hpp. Also tidied up the #include's for ptrQueue.[ch]pp to take > advantage of the split. > > Also, the gtest gc/shared/test_ptrQueueBufferAllocator.cpp is renamed to > gc/shared/test_bufferNodeAllocator.cpp, with some adjustments to includes and > test naming. > > Testing: > mach5 tier1 This pull request has now been integrated. Changeset: 1e76ba0c Author: Kim Barrett URL: https://git.openjdk.org/jdk/commit/1e76ba0cd0de38da6fc2b8147627496ee2f2837b Stats: 383 lines in 17 files changed: 223 ins; 152 del; 8 mod 8319439: Move BufferNode from PtrQueue files to new files Reviewed-by: tschatzl, iwalulya ------------- PR: https://git.openjdk.org/jdk/pull/16506 From kdnilsen at openjdk.org Wed Nov 15 01:50:05 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 15 Nov 2023 01:50:05 GMT Subject: RFR: 8320119: GenShen: Correct misspellings of parsable In-Reply-To: References: Message-ID: <8lMaQT-bcUax53Rd9P66LkAlXDee2IhXbUjB32eMjus=.ed897f82-e240-4588-aaca-010b3c249b2f@github.com> On Wed, 15 Nov 2023 00:15:43 GMT, William Kemper wrote: > There should be no 'e' in the word that describes something that is able to be parsed. Marked as reviewed by kdnilsen (Committer). Not sure it is necessary to change spelling: https://en.wiktionary.org/wiki/parseable ------------- PR Review: https://git.openjdk.org/shenandoah/pull/358#pullrequestreview-1731097975 PR Comment: https://git.openjdk.org/shenandoah/pull/358#issuecomment-1811670146 From kdnilsen at openjdk.org Wed Nov 15 01:59:11 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 15 Nov 2023 01:59:11 GMT Subject: RFR: 8320112 GenShen: Improve end of process report [v2] In-Reply-To: References:

Message-ID: On Tue, 14 Nov 2023 23:14:48 GMT, William Kemper wrote: >> When Shenandoah is shutdown, it logs a summary of its activity over the life of the process. This change adds percentages to the breakdown of the types and counts of cycles that executed. It also simplifies some of the logic around recording the various outcomes. > > William Kemper has updated the pull request incrementally with two additional commits since the last revision: > > - Reduce visibility of new methods > - Fix accidental whitespace change Thanks for clarifying my confusion, and thanks for these improvements. ------------- Marked as reviewed by kdnilsen (Committer). PR Review: https://git.openjdk.org/shenandoah/pull/357#pullrequestreview-1737556582 From wkemper at openjdk.org Fri Nov 17 22:35:12 2023 From: wkemper at openjdk.org (William Kemper) Date: Fri, 17 Nov 2023 22:35:12 GMT Subject: RFR: JDK-8320336: GenShen: Reduce proactive humongous defragmentation efforts In-Reply-To: References: Message-ID: On Fri, 17 Nov 2023 17:15:32 GMT, Kelvin Nilsen wrote: > Only trigger defragmentation of old-gen memory if JVM is configured to reserve a certain amount of the heap to support humongous objects. Even when the JVM is so configured, be slightly less aggressive in triggering defragmentation of old-gen memory. Looks good to me. ------------- Marked as reviewed by wkemper (Committer). PR Review: https://git.openjdk.org/shenandoah/pull/361#pullrequestreview-1738041538 From wkemper at openjdk.org Fri Nov 17 22:38:04 2023 From: wkemper at openjdk.org (William Kemper) Date: Fri, 17 Nov 2023 22:38:04 GMT Subject: Integrated: 8320112 GenShen: Improve end of process report In-Reply-To: References: Message-ID: On Tue, 14 Nov 2023 23:03:09 GMT, William Kemper wrote: > When Shenandoah is shutdown, it logs a summary of its activity over the life of the process. This change adds percentages to the breakdown of the types and counts of cycles that executed. It also simplifies some of the logic around recording the various outcomes. This pull request has now been integrated. Changeset: d80a17d5 Author: William Kemper URL: https://git.openjdk.org/shenandoah/commit/d80a17d506836acc677ae4e8105d60eb7f6f6fa0 Stats: 159 lines in 14 files changed: 41 ins; 58 del; 60 mod 8320112: GenShen: Improve end of process report Reviewed-by: kdnilsen ------------- PR: https://git.openjdk.org/shenandoah/pull/357 From wkemper at openjdk.org Fri Nov 17 22:43:02 2023 From: wkemper at openjdk.org (William Kemper) Date: Fri, 17 Nov 2023 22:43:02 GMT Subject: RFR: Merge openjdk/jdk:master [v2] In-Reply-To: References: Message-ID: > Merges tag jdk-22+24 William Kemper has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 90 commits: - Merge remote-tracking branch 'shenandoah/master' into merge-jdk-22+24 - 8301997: Move method resolution information out of the cpCache Co-authored-by: Gui Cao Co-authored-by: Fei Yang Co-authored-by: Martin Doerr Co-authored-by: Amit Kumar Reviewed-by: coleenp, adinn, fparain - 8319986: Invalid/inconsistent description and example for DateFormat Reviewed-by: joehw, rriggs, jlu, iris, lancea - 8320130: Problemlist 2 vmTestbase/nsk/jdi/StepRequest/addClassFilter_rt tests with Xcomp Reviewed-by: cjplummer, dcubed - 8319187: Add three eMudhra emSign roots Reviewed-by: mullan - 8318219: RISC-V: C2 ExpandBits Reviewed-by: fyang - 8320053: GHA: Cross-compile gtest code Reviewed-by: ihse, stuefe - 8319966: AIX: expected [[0:i4]] but found [[0:I4]] after JDK-8319882 Reviewed-by: mdoerr - 8318671: Potential uninitialized uintx value after JDK-8317683 Reviewed-by: thartmann, shade - 8319781: RISC-V: Refactor UseRVV related checks Reviewed-by: rehn, fyang - ... and 80 more: https://git.openjdk.org/shenandoah/compare/d80a17d5...2db62747 ------------- Changes: https://git.openjdk.org/shenandoah/pull/360/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=360&range=01 Stats: 619015 lines in 974 files changed: 82836 ins; 475201 del; 60978 mod Patch: https://git.openjdk.org/shenandoah/pull/360.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/360/head:pull/360 PR: https://git.openjdk.org/shenandoah/pull/360 From wkemper at openjdk.org Mon Nov 20 18:21:58 2023 From: wkemper at openjdk.org (William Kemper) Date: Mon, 20 Nov 2023 18:21:58 GMT Subject: Integrated: Merge openjdk/jdk:master In-Reply-To: References: Message-ID: On Thu, 16 Nov 2023 14:09:58 GMT, William Kemper wrote: > Merges tag jdk-22+24 This pull request has now been integrated. Changeset: 22280c12 Author: William Kemper URL: https://git.openjdk.org/shenandoah/commit/22280c1228c51d10db19a89979bdd66001d34336 Stats: 619015 lines in 974 files changed: 82836 ins; 475201 del; 60978 mod Merge ------------- PR: https://git.openjdk.org/shenandoah/pull/360 From ysr at openjdk.org Mon Nov 20 23:39:48 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Mon, 20 Nov 2023 23:39:48 GMT Subject: RFR: JDK-8320336: GenShen: Reduce proactive humongous defragmentation efforts In-Reply-To: References: Message-ID: On Fri, 17 Nov 2023 17:15:32 GMT, Kelvin Nilsen wrote: > Only trigger defragmentation of old-gen memory if JVM is configured to reserve a certain amount of the heap to support humongous objects. Even when the JVM is so configured, be slightly less aggressive in triggering defragmentation of old-gen memory. Sorry for the delay. LGTM. One question relating to the performance changes stemming from this fix: - Effectively, it seems that the density/span based triggering is now disabled by default (because of the change at line 3129 and the default setting of humongous reserve at 0) - Does this effectively win back the difference caused by the original change? - However, were you to enable it, say by an appropriate positive value for the humongous reserve, does the less aggressive triggering result in a performance improvement? - Are there any (rough) performance numbers, by chance, to share for the two changes separately? ------------- Marked as reviewed by ysr (Committer). PR Review: https://git.openjdk.org/shenandoah/pull/361#pullrequestreview-1740872293 From kdnilsen at openjdk.org Tue Nov 21 00:51:04 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 21 Nov 2023 00:51:04 GMT Subject: Integrated: JDK-8320336: GenShen: Reduce proactive humongous defragmentation efforts In-Reply-To: References: Message-ID: <2_Gmtt8rQYNaKaDRvA1kP7vG863MMAkug11Hhxw7_eQ=.995a8b46-31c2-402a-8054-652a26b27a83@github.com> On Fri, 17 Nov 2023 17:15:32 GMT, Kelvin Nilsen wrote: > Only trigger defragmentation of old-gen memory if JVM is configured to reserve a certain amount of the heap to support humongous objects. Even when the JVM is so configured, be slightly less aggressive in triggering defragmentation of old-gen memory. This pull request has now been integrated. Changeset: ff8d5313 Author: Kelvin Nilsen URL: https://git.openjdk.org/shenandoah/commit/ff8d5313b065e99176cd3b8b6ca247352dd22200 Stats: 12 lines in 1 file changed: 4 ins; 1 del; 7 mod 8320336: GenShen: Reduce proactive humongous defragmentation efforts Reviewed-by: wkemper, ysr ------------- PR: https://git.openjdk.org/shenandoah/pull/361 From kdnilsen at openjdk.org Tue Nov 21 00:51:02 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 21 Nov 2023 00:51:02 GMT Subject: RFR: JDK-8320336: GenShen: Reduce proactive humongous defragmentation efforts In-Reply-To: References: Message-ID: <03WswFyuF3J8zUqQiKQzMOhWCMnkP3qPy3P1cY-gcZ8=.5d368a13-83e8-46a1-bede-5fe357cdf54d@github.com> On Fri, 17 Nov 2023 17:15:32 GMT, Kelvin Nilsen wrote: > Only trigger defragmentation of old-gen memory if JVM is configured to reserve a certain amount of the heap to support humongous objects. Even when the JVM is so configured, be slightly less aggressive in triggering defragmentation of old-gen memory. The trigger is disabled by default. Humongous fragmentation is one of the issues that results in very high variability in performance on some workloads. Humongous memory gets fragmented when humongous objects happen to be allocated in "the middle" of the heap instead of getting allocated at the start of the heap. It is difficult to predict or even contrive programs that will cause this to happen. But the typical scenario is: 1. We are operating near the "cliff's edge" so young gen consumes all of the available heap. 2. Some number of young-gen regions residing near the bottom of heap which are not necessarily contiguous are promoted in place rather than getting compacted. 3. Free regions are now scattered between these promoted-in-place heap regions, and we have no sufficiently long run of consecutive free regions to satisfy an allocation request. With this change: We will still endeavor to defragment the old-gen memory whenever we perform a concurrent marking effort. Concurrent marking efforts are now triggered by guaranteed interval or by growth since previous old mark. If a service desires to keep memory defragmented because they expect frequent allocation of humongous objects, they would be wise to set -XX:ShenandoahGenerationalHumongousReserve=5 (or any value grater than 0). ------------- PR Comment: https://git.openjdk.org/shenandoah/pull/361#issuecomment-1820037408 From dholmes at openjdk.org Tue Nov 21 05:24:33 2023 From: dholmes at openjdk.org (David Holmes) Date: Tue, 21 Nov 2023 05:24:33 GMT Subject: RFR: 8318776: Require supports_cx8 to always be true [v4] In-Reply-To: References: Message-ID: > As discussed in JBS all platforms (some tweaks to Zero are in progress) actually do support `cx8` i.e. 64-bit compare-and-exchange, so we can strip out the locked-based alternatives to using it and just add a guarantee that it is true at runtime. And all platforms except some ARM variants set `SUPPORTS_NATIVE_CX8`, so we can greatly simplify things. Summary of changes: > - `_supports_cx8` field is only needed when `SUPPORTS_NATIVE_CX8` is not defined > - Assertions for `supports_cx8()` are removed > - Compiler predicates requiring `supports_cx8()` are removed > - Access backend is greatly simplified without the need for lock-based alternative > - `java.util.concurrent.AtomicLongFieldUpdater` is simplified without the need for a lock-based alternative > > I did consider moving all the ARM `kuser_helper` related code to be only defined when `SUPPORTS_NATIVE_CX8` is not defined, but there was a theoretical risk this could change the behaviour if ARMv7 binaries were run on other ARM CPU's. I added a note to that effect in vm_version_linux_arm32.cpp so the ARM port maintainers could clean this up further if desired. > > Testing: > - All Oracle tiers 1-5 builds (which includes an ARMv7 build) > - GHA builds/tests > - Oracle tiers 1-3 sanity testing > > Zero changes coming in via [JDK-8319777](https://bugs.openjdk.org/browse/JDK-8319777) will be merged when they arrive. > > Thanks. David Holmes has updated the pull request incrementally with one additional commit since the last revision: Remove unnecessary includes of vm_version.hpp. Fix copyright years. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/16625/files - new: https://git.openjdk.org/jdk/pull/16625/files/65871144..597cef53 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=16625&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=16625&range=02-03 Stats: 4 lines in 4 files changed: 0 ins; 3 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/16625.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16625/head:pull/16625 PR: https://git.openjdk.org/jdk/pull/16625 From eosterlund at openjdk.org Tue Nov 21 06:06:11 2023 From: eosterlund at openjdk.org (Erik =?UTF-8?B?w5ZzdGVybHVuZA==?=) Date: Tue, 21 Nov 2023 06:06:11 GMT Subject: RFR: 8318776: Require supports_cx8 to always be true [v4] In-Reply-To: References:

Message-ID: On Tue, 21 Nov 2023 05:24:33 GMT, David Holmes wrote: >> As discussed in JBS all platforms (some tweaks to Zero are in progress) actually do support `cx8` i.e. 64-bit compare-and-exchange, so we can strip out the locked-based alternatives to using it and just add a guarantee that it is true at runtime. And all platforms except some ARM variants set `SUPPORTS_NATIVE_CX8`, so we can greatly simplify things. Summary of changes: >> - `_supports_cx8` field is only needed when `SUPPORTS_NATIVE_CX8` is not defined >> - Assertions for `supports_cx8()` are removed >> - Compiler predicates requiring `supports_cx8()` are removed >> - Access backend is greatly simplified without the need for lock-based alternative >> - `java.util.concurrent.AtomicLongFieldUpdater` is simplified without the need for a lock-based alternative >> >> I did consider moving all the ARM `kuser_helper` related code to be only defined when `SUPPORTS_NATIVE_CX8` is not defined, but there was a theoretical risk this could change the behaviour if ARMv7 binaries were run on other ARM CPU's. I added a note to that effect in vm_version_linux_arm32.cpp so the ARM port maintainers could clean this up further if desired. >> >> Testing: >> - All Oracle tiers 1-5 builds (which includes an ARMv7 build) >> - GHA builds/tests >> - Oracle tiers 1-3 sanity testing >> >> Zero changes coming in via [JDK-8319777](https://bugs.openjdk.org/browse/JDK-8319777) will be merged when they arrive. >> >> Thanks. > > David Holmes has updated the pull request incrementally with one additional commit since the last revision: > > Remove unnecessary includes of vm_version.hpp. > Fix copyright years. This looks great! ------------- Marked as reviewed by eosterlund (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16625#pullrequestreview-1741175703 From dholmes at openjdk.org Tue Nov 21 11:00:11 2023 From: dholmes at openjdk.org (David Holmes) Date: Tue, 21 Nov 2023 11:00:11 GMT Subject: RFR: 8318776: Require supports_cx8 to always be true [v4] In-Reply-To: References:

Message-ID: On Tue, 21 Nov 2023 06:03:38 GMT, Erik ?sterlund wrote: >> David Holmes has updated the pull request incrementally with one additional commit since the last revision: >> >> Remove unnecessary includes of vm_version.hpp. >> Fix copyright years. > > This looks great! > Thanks for the review @fisk ! I have to wait for a few Zero related PRs to get integrated then re-merge, before I can integrate. Zero patches were pushed, please re-merge. I checked current mainline works well with at least linux-arm-zero-fastdebug, and I would like to re-test it with this patch. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16625#issuecomment-1821309132 From cslucas at openjdk.org Tue Nov 21 17:58:35 2023 From: cslucas at openjdk.org (Cesar Soares Lucas) Date: Tue, 21 Nov 2023 17:58:35 GMT Subject: RFR: JDK-8241503: C2: Share MacroAssembler between mach nodes during code emission [v2] In-Reply-To: References: Message-ID: > # Description > > Please review this PR with a patch to re-use the same C2_MacroAssembler object to emit all instructions in the same compilation unit. > > Overall, the change is pretty simple. However, due to the renaming of the variable to access C2_MacroAssembler, from `_masm.` to `masm->`, and also some method prototype changes, the patch became quite large. > > # Help Needed for Testing > > I don't have access to all platforms necessary to test this. I hope some other folks can help with testing on `S390`, `RISC-V` and `PPC`. > > # Testing status > > ## tier1 > > | | Win | Mac | Linux | > |----------|---------|---------|---------| > | ARM64 | | | | > | ARM32 | | | | > | x86 | | | | > | x64 | | | | > | PPC64 | | | | > | S390x | | | | > | RiscV | | | | Cesar Soares Lucas has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains two commits: - Catch up with changes on master - Reuse same C2_MacroAssembler object to emit instructions. ------------- Changes: https://git.openjdk.org/jdk/pull/16484/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16484&range=01 Stats: 3356 lines in 60 files changed: 1039 ins; 426 del; 1891 mod Patch: https://git.openjdk.org/jdk/pull/16484.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16484/head:pull/16484 PR: https://git.openjdk.org/jdk/pull/16484 From dholmes at openjdk.org Wed Nov 22 02:09:38 2023 From: dholmes at openjdk.org (David Holmes) Date: Wed, 22 Nov 2023 02:09:38 GMT Subject: RFR: 8318776: Require supports_cx8 to always be true [v5] In-Reply-To: References: Message-ID: > As discussed in JBS all platforms (some tweaks to Zero are in progress) actually do support `cx8` i.e. 64-bit compare-and-exchange, so we can strip out the locked-based alternatives to using it and just add a guarantee that it is true at runtime. And all platforms except some ARM variants set `SUPPORTS_NATIVE_CX8`, so we can greatly simplify things. Summary of changes: > - `_supports_cx8` field is only needed when `SUPPORTS_NATIVE_CX8` is not defined > - Assertions for `supports_cx8()` are removed > - Compiler predicates requiring `supports_cx8()` are removed > - Access backend is greatly simplified without the need for lock-based alternative > - `java.util.concurrent.AtomicLongFieldUpdater` is simplified without the need for a lock-based alternative > > I did consider moving all the ARM `kuser_helper` related code to be only defined when `SUPPORTS_NATIVE_CX8` is not defined, but there was a theoretical risk this could change the behaviour if ARMv7 binaries were run on other ARM CPU's. I added a note to that effect in vm_version_linux_arm32.cpp so the ARM port maintainers could clean this up further if desired. > > Testing: > - All Oracle tiers 1-5 builds (which includes an ARMv7 build) > - GHA builds/tests > - Oracle tiers 1-3 sanity testing > > Zero changes coming in via [JDK-8319777](https://bugs.openjdk.org/browse/JDK-8319777) will be merged when they arrive. > > Thanks. David Holmes has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains seven additional commits since the last revision: - Merge with master and update Zero code accordingly - Merge branch 'master' into 8318776-supports_cx8 - Remove unnecessary includes of vm_version.hpp. Fix copyright years. - Remove cx8 comment as no longer relevant (the spinlock is used regardless of cx8) - Remove suports_cx8() checks from gtest - Remove test for VMSupportsCX8 - 8318776: Require supports_cx8 to always be true ------------- Changes: - all: https://git.openjdk.org/jdk/pull/16625/files - new: https://git.openjdk.org/jdk/pull/16625/files/597cef53..aad0a4c4 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=16625&range=04 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=16625&range=03-04 Stats: 621905 lines in 1279 files changed: 89413 ins; 471113 del; 61379 mod Patch: https://git.openjdk.org/jdk/pull/16625.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16625/head:pull/16625 PR: https://git.openjdk.org/jdk/pull/16625 From dholmes at openjdk.org Wed Nov 22 02:21:05 2023 From: dholmes at openjdk.org (David Holmes) Date: Wed, 22 Nov 2023 02:21:05 GMT Subject: RFR: 8318776: Require supports_cx8 to always be true [v4] In-Reply-To: References:

Message-ID: On Tue, 21 Nov 2023 16:58:21 GMT, Aleksey Shipilev wrote: >> This looks great! > >> Thanks for the review @fisk ! I have to wait for a few Zero related PRs to get integrated then re-merge, before I can integrate. > > Zero patches were pushed, please re-merge. I checked current mainline works well with at least linux-arm-zero-fastdebug, and I would like to re-test it with this patch. @shipilev I have re-merged and update the Zero changes (ifdef around `_saupports_cx8`). @viktorklang-ora and/or @DougLea could I ask you to look at the `java.util.concurrent.AtomicLongFieldUpdater` changes please. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16625#issuecomment-1821983975 From shade at openjdk.org Wed Nov 22 09:00:13 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 22 Nov 2023 09:00:13 GMT Subject: RFR: 8318776: Require supports_cx8 to always be true [v5] In-Reply-To: References:

Message-ID: On Wed, 22 Nov 2023 08:57:11 GMT, Aleksey Shipilev wrote: > Zero tests are running. Caught the `guarantee` on linux-arm-zero-fastdebug! But that is actually the fault in my previous patch: #16779. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16625#issuecomment-1822510325 From dl at openjdk.org Wed Nov 22 15:53:12 2023 From: dl at openjdk.org (Doug Lea) Date: Wed, 22 Nov 2023 15:53:12 GMT Subject: RFR: 8318776: Require supports_cx8 to always be true [v5] In-Reply-To: References:

Message-ID: On Wed, 22 Nov 2023 02:09:38 GMT, David Holmes wrote: >> As discussed in JBS all platforms (some tweaks to Zero are in progress) actually do support `cx8` i.e. 64-bit compare-and-exchange, so we can strip out the locked-based alternatives to using it and just add a guarantee that it is true at runtime. And all platforms except some ARM variants set `SUPPORTS_NATIVE_CX8`, so we can greatly simplify things. Summary of changes: >> - `_supports_cx8` field is only needed when `SUPPORTS_NATIVE_CX8` is not defined >> - Assertions for `supports_cx8()` are removed >> - Compiler predicates requiring `supports_cx8()` are removed >> - Access backend is greatly simplified without the need for lock-based alternative >> - `java.util.concurrent.AtomicLongFieldUpdater` is simplified without the need for a lock-based alternative >> >> I did consider moving all the ARM `kuser_helper` related code to be only defined when `SUPPORTS_NATIVE_CX8` is not defined, but there was a theoretical risk this could change the behaviour if ARMv7 binaries were run on other ARM CPU's. I added a note to that effect in vm_version_linux_arm32.cpp so the ARM port maintainers could clean this up further if desired. >> >> Testing: >> - All Oracle tiers 1-5 builds (which includes an ARMv7 build) >> - GHA builds/tests >> - Oracle tiers 1-3 sanity testing >> >> Zero changes coming in via [JDK-8319777](https://bugs.openjdk.org/browse/JDK-8319777) will be merged when they arrive. >> >> Thanks. > > David Holmes has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains seven additional commits since the last revision: > > - Merge with master and update Zero code accordingly > - Merge branch 'master' into 8318776-supports_cx8 > - Remove unnecessary includes of vm_version.hpp. > Fix copyright years. > - Remove cx8 comment as no longer relevant (the spinlock is used regardless of cx8) > - Remove suports_cx8() checks from gtest > - Remove test for VMSupportsCX8 > - 8318776: Require supports_cx8 to always be true The deletion of backup code and the check for it in java.util.concurrent.AtomicLongFieldUpdater are clearly OK. We always thought the need for it was transient. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16625#issuecomment-1823026095 From tschatzl at openjdk.org Wed Nov 22 16:09:27 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Wed, 22 Nov 2023 16:09:27 GMT Subject: RFR: 8317809: Insertion of free code blobs into code cache can be very slow during class unloading [v2] In-Reply-To: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com> References: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com> Message-ID: > Insert code blobs in a sorted fashion to exploit the finger-optimization when adding, making this procedure O(n) instead of O(n^2) > > Introduces a globally available ClassUnloadingContext that contains common methods pertaining to class and code unloading. GCs may use it to efficiently manage unlinked class loader datas and nmethods to allow use of common methods (unlink/merge). > > The steps typically are registering a new to be unlinked CLD/nmethod, and then purge its memory later. STW collectors perform this work in one big chunk taking the CodeCache_lock, for the entire duration, while concurrent collectors lock/unlock for every insertion to allow for concurrent users for the lock to progress. > > Some care has been taken to stay consistent with an "unloading = unlinking + purge" scheme; however particularly the existing CLD handling API (still) mixes unlinking and purging in its CLD::unload() call. To simplify this change that is mostly geared towards separating nmethod unlinking from purging, to make code blob freeing O(n) instead of O(n^2). > > Upcoming changes will > * separate nmethod unregistering from nmethod purging to allow doing that in bulk (for the STW collectors); that can significantly reduce code purging time for the STW collectors. > * better name the second stage of unlinking (called "cleaning" throughout, e.g. the work done in `G1CollectedHeap::complete_cleaning`) > * untangle CLD unlinking and what's called "cleaning" now to allow moving more stuff into the second unlinking stage for better parallelism > * G1: move some significant tasks from the remark pause to concurrent (unregistering nmethods, freeing code blobs and cld/metaspace purging) > * Maybe move Serial/Parallel GC metaspace purging closer to other unlinking/purging code to keep things local and allow easier logging. > > Please also first looking into the (small) PR this depends on. > > The crash on linux-x86 is fixed by PR#16766 which I split out for quicker reviews. > > Testing: tier1-7 > > Thanks, > Thomas Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: iwalulya review, naming ------------- Changes: - all: https://git.openjdk.org/jdk/pull/16759/files - new: https://git.openjdk.org/jdk/pull/16759/files/d63ff4a4..448232df Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=16759&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=16759&range=00-01 Stats: 14 lines in 2 files changed: 0 ins; 0 del; 14 mod Patch: https://git.openjdk.org/jdk/pull/16759.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16759/head:pull/16759 PR: https://git.openjdk.org/jdk/pull/16759 From tschatzl at openjdk.org Wed Nov 22 16:55:08 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Wed, 22 Nov 2023 16:55:08 GMT Subject: RFR: 8317809: Insertion of free code blobs into code cache can be very slow during class unloading [v2] In-Reply-To: References: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com> Message-ID: On Wed, 22 Nov 2023 16:09:27 GMT, Thomas Schatzl wrote: >> Insert code blobs in a sorted fashion to exploit the finger-optimization when adding, making this procedure O(n) instead of O(n^2) >> >> Introduces a globally available ClassUnloadingContext that contains common methods pertaining to class and code unloading. GCs may use it to efficiently manage unlinked class loader datas and nmethods to allow use of common methods (unlink/merge). >> >> The steps typically are registering a new to be unlinked CLD/nmethod, and then purge its memory later. STW collectors perform this work in one big chunk taking the CodeCache_lock, for the entire duration, while concurrent collectors lock/unlock for every insertion to allow for concurrent users for the lock to progress. >> >> Some care has been taken to stay consistent with an "unloading = unlinking + purge" scheme; however particularly the existing CLD handling API (still) mixes unlinking and purging in its CLD::unload() call. To simplify this change that is mostly geared towards separating nmethod unlinking from purging, to make code blob freeing O(n) instead of O(n^2). >> >> Upcoming changes will >> * separate nmethod unregistering from nmethod purging to allow doing that in bulk (for the STW collectors); that can significantly reduce code purging time for the STW collectors. >> * better name the second stage of unlinking (called "cleaning" throughout, e.g. the work done in `G1CollectedHeap::complete_cleaning`) >> * untangle CLD unlinking and what's called "cleaning" now to allow moving more stuff into the second unlinking stage for better parallelism >> * G1: move some significant tasks from the remark pause to concurrent (unregistering nmethods, freeing code blobs and cld/metaspace purging) >> * Maybe move Serial/Parallel GC metaspace purging closer to other unlinking/purging code to keep things local and allow easier logging. >> >> These are the reason for the class hierarchy for `ClassUnloadingContext`: the goal is to ultimately have about this phasing (for G1): >> 1. collect all dead CLDs, using the `register_unloading_class_loader_data` method *only* >> 2. parallelize the stuff in `ClassLoaderData::unload()` in one way or another, adding them to the `complete_cleaning` (parallel) phase. >> 3. `purge_nmethods`, `free_code_blobs` and the `remove_unlinked_nmethods_from_code_root_set` (from JDK-8317007) will be concurrent. >> >> Particularly the split of `SystemDictionary::do_unloading` into "only" traversing the CLDs to find the dead ones and then in parallel process them in 2. a... > > Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: > > iwalulya review, naming I added some explanation of why there is a class hierarchy for `ClassUnloadingContext` in the description (and some further background). ------------- PR Comment: https://git.openjdk.org/jdk/pull/16759#issuecomment-1823139932 From dcubed at openjdk.org Wed Nov 22 18:39:10 2023 From: dcubed at openjdk.org (Daniel D. Daugherty) Date: Wed, 22 Nov 2023 18:39:10 GMT Subject: RFR: 8318776: Require supports_cx8 to always be true [v5] In-Reply-To: References:

Message-ID: <6TkAoQRK_49MtT6wjb_JAwFzUATAvAx_rX9yxMa9Vfs=.7d1d5bf6-f088-44df-9d01-cc336bdeecaa@github.com> On Wed, 22 Nov 2023 02:09:38 GMT, David Holmes wrote: >> As discussed in JBS all platforms (some tweaks to Zero are in progress) actually do support `cx8` i.e. 64-bit compare-and-exchange, so we can strip out the locked-based alternatives to using it and just add a guarantee that it is true at runtime. And all platforms except some ARM variants set `SUPPORTS_NATIVE_CX8`, so we can greatly simplify things. Summary of changes: >> - `_supports_cx8` field is only needed when `SUPPORTS_NATIVE_CX8` is not defined >> - Assertions for `supports_cx8()` are removed >> - Compiler predicates requiring `supports_cx8()` are removed >> - Access backend is greatly simplified without the need for lock-based alternative >> - `java.util.concurrent.AtomicLongFieldUpdater` is simplified without the need for a lock-based alternative >> >> I did consider moving all the ARM `kuser_helper` related code to be only defined when `SUPPORTS_NATIVE_CX8` is not defined, but there was a theoretical risk this could change the behaviour if ARMv7 binaries were run on other ARM CPU's. I added a note to that effect in vm_version_linux_arm32.cpp so the ARM port maintainers could clean this up further if desired. >> >> Testing: >> - All Oracle tiers 1-5 builds (which includes an ARMv7 build) >> - GHA builds/tests >> - Oracle tiers 1-3 sanity testing >> >> Zero changes coming in via [JDK-8319777](https://bugs.openjdk.org/browse/JDK-8319777) will be merged when they arrive. >> >> Thanks. > > David Holmes has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains seven additional commits since the last revision: > > - Merge with master and update Zero code accordingly > - Merge branch 'master' into 8318776-supports_cx8 > - Remove unnecessary includes of vm_version.hpp. > Fix copyright years. > - Remove cx8 comment as no longer relevant (the spinlock is used regardless of cx8) > - Remove suports_cx8() checks from gtest > - Remove test for VMSupportsCX8 > - 8318776: Require supports_cx8 to always be true Wow! This PR is much larger than I expected. Thumbs up! ------------- Marked as reviewed by dcubed (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16625#pullrequestreview-1745089631 From dcubed at openjdk.org Wed Nov 22 18:39:15 2023 From: dcubed at openjdk.org (Daniel D. Daugherty) Date: Wed, 22 Nov 2023 18:39:15 GMT Subject: RFR: 8318776: Require supports_cx8 to always be true [v5] In-Reply-To: References: