From haosun at openjdk.org Fri Dec 1 05:24:08 2023 From: haosun at openjdk.org (Hao Sun) Date: Fri, 1 Dec 2023 05:24:08 GMT Subject: RFR: JDK-8241503: C2: Share MacroAssembler between mach nodes during code emission [v4] In-Reply-To: References:

Message-ID: On Wed, 29 Nov 2023 22:40:35 GMT, Cesar Soares Lucas wrote: >> # Description >> >> Please review this PR with a patch to re-use the same C2_MacroAssembler object to emit all instructions in the same compilation unit. >> >> Overall, the change is pretty simple. However, due to the renaming of the variable to access C2_MacroAssembler, from `_masm.` to `masm->`, and also some method prototype changes, the patch became quite large. >> >> # Help Needed for Testing >> >> I don't have access to all platforms necessary to test this. I hope some other folks can help with testing on `S390`, `RISC-V` and `PPC`. >> >> # Testing status >> >> ## tier1 >> >> | | Win | Mac | Linux | >> |----------|---------|---------|---------| >> | ARM64 | | | | >> | ARM32 | | | | >> | x86 | | | | >> | x64 | | | | >> | PPC64 | | | | >> | S390x | | | | >> | RiscV | | | | > > Cesar Soares Lucas has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains five commits: > > - Fix merge > - Catch up with master branch. > > Merge remote-tracking branch 'origin/master' into reuse-macroasm > - Some inst_mark fixes; Catch up with master. > - Catch up with changes on master > - Reuse same C2_MacroAssembler object to emit instructions. src/hotspot/share/jvmci/jvmciCodeInstaller.cpp line 1246: > 1244: if (_next_call_type == INVOKESTATIC || _next_call_type == INVOKESPECIAL) { > 1245: // Need a static call stub for transitions from compiled to interpreted. > 1246: C2_MacroAssembler masm(&buffer); Hi, I encountered one build failure: JDK build **without C2** fails on Linux/AArch64. The configure I used --with-debug-level=release --with-jvm-features=-compiler2 --disable-precompiled-headers The error log === Output from failing command(s) repeated here === * For target hotspot_variant-server_libjvm_gtest_objs_BUILD_GTEST_LIBJVM_link: /usr/bin/ld: /tmp/build-release/hotspot/variant-server/libjvm/objs/jvmciCodeInstaller.o: in function `C2_MacroAssembler::C2_MacroAssembler(CodeBuffer*)': make/hotspot/src/hotspot/share/opto/c2_MacroAssembler.hpp:38: undefined reference to `vtable for C2_MacroAssembler' /usr/bin/ld: make/hotspot/src/hotspot/share/opto/c2_MacroAssembler.hpp:38: undefined reference to `vtable for C2_MacroAssembler' collect2: error: ld returned 1 exit status * For target hotspot_variant-server_libjvm_objs_BUILD_LIBJVM_link: /usr/bin/ld: /tmp/build-release/hotspot/variant-server/libjvm/objs/jvmciCodeInstaller.o: in function `C2_MacroAssembler::C2_MacroAssembler(CodeBuffer*)': make/hotspot/src/hotspot/share/opto/c2_MacroAssembler.hpp:38: undefined reference to `vtable for C2_MacroAssembler' /usr/bin/ld: make/hotspot/src/hotspot/share/opto/c2_MacroAssembler.hpp:38: undefined reference to `vtable for C2_MacroAssembler' collect2: error: ld returned 1 exit status * All command lines available in /tmp/build-release/make-support/failure-logs. === End of repeated output === I suggest making the following change: Suggestion: MacroAssembler masm(&buffer); ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16484#discussion_r1411551297 From tschatzl at openjdk.org Fri Dec 1 09:15:08 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Fri, 1 Dec 2023 09:15:08 GMT Subject: RFR: 8317809: Insertion of free code blobs into code cache can be very slow during class unloading [v3] In-Reply-To: References: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com>

Message-ID: On Thu, 30 Nov 2023 10:34:37 GMT, Thomas Schatzl wrote: >> src/hotspot/share/gc/shared/classUnloadingContext.hpp line 63: >> >>> 61: }; >>> 62: >>> 63: class DefaultClassUnloadingContext : public ClassUnloadingContext { >> >> I don't understand why they need to be two classes, even after reading "These are the reason for the class hierarchy for...". The reference to future/other PR(s) in the description doesn't really help -- it's unclear what is *necessary* for the current PR and what is preparation for future PR(s). > > The base class is unnecessary for this change, but very nice to have for future changes. I'll just merge them for now, and separate them again later. As explained in the now removed original description: >These are the reason for the class hierarchy for `ClassUnloadingContext`: the goal is to ultimately have about this phasing (for G1): >1. collect all dead CLDs, using the `register_unloading_class_loader_data` method *only* >2. parallelize the stuff in `ClassLoaderData::unload()` in one way or another, adding them to the `complete_cleaning` >(parallel) phase. > >Particularly the split of `SystemDictionary::do_unloading` into "only" traversing the CLDs to find the dead ones and then in parallel process them in 2. above warrants a separate `ClassUnloadingContext` (to facilitate parallelism). I.e. the non-parallelized case does not need the necessary data structure complications and helper methods to do efficient parallel iteration. However as mentioned I removed the class hierarchy for this change as it's unnecessary for now; let's discuss this hierarchy separately. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16759#discussion_r1411815090 From lucy at openjdk.org Fri Dec 1 11:09:05 2023 From: lucy at openjdk.org (Lutz Schmidt) Date: Fri, 1 Dec 2023 11:09:05 GMT Subject: RFR: 8320807: [PPC64][ZGC] C1 generates wrong code for atomics In-Reply-To: References: Message-ID: On Mon, 27 Nov 2023 21:52:53 GMT, Martin Doerr wrote: > Debugging test failures on PPC64 in java/lang/Thread/virtual/stress/Skynet.java#ZGenerational has shown that the ldarx+stdcx_ loop for uncompressed Oops in `LIR_Assembler::atomic_op` is wrong: `__ mr(Rtmp, Robj);` is inside of the ldarx+stdcx_ loop, but must be outside of it. Repeated execution leads to wrong store value. > In addition, zBarrierSetC1.cpp expects `cas_obj` and `xchg` to contain all necessary memory barriers. That doesn't fit to the current PPC64 design which inserts memory barriers on LIR level instead. I've changed this and moved them into the assembler code for all GCs. > While debugging, I have optimized out an unnecessary branch in `ZBarrierSetAssembler::store_barrier_medium`. Looks good. Thanks for detecting and fixing this bug. ------------- Marked as reviewed by lucy (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16835#pullrequestreview-1759560030 From rrich at openjdk.org Fri Dec 1 11:47:07 2023 From: rrich at openjdk.org (Richard Reingruber) Date: Fri, 1 Dec 2023 11:47:07 GMT Subject: RFR: 8320807: [PPC64][ZGC] C1 generates wrong code for atomics In-Reply-To: References: Message-ID: On Mon, 27 Nov 2023 21:52:53 GMT, Martin Doerr wrote: > Debugging test failures on PPC64 in java/lang/Thread/virtual/stress/Skynet.java#ZGenerational has shown that the ldarx+stdcx_ loop for uncompressed Oops in `LIR_Assembler::atomic_op` is wrong: `__ mr(Rtmp, Robj);` is inside of the ldarx+stdcx_ loop, but must be outside of it. Repeated execution leads to wrong store value. > In addition, zBarrierSetC1.cpp expects `cas_obj` and `xchg` to contain all necessary memory barriers. That doesn't fit to the current PPC64 design which inserts memory barriers on LIR level instead. I've changed this and moved them into the assembler code for all GCs. > While debugging, I have optimized out an unnecessary branch in `ZBarrierSetAssembler::store_barrier_medium`. Looks good. Just two suggestions to improve comments. Cheers, Richard. src/hotspot/cpu/ppc/c1_LIRAssembler_ppc.cpp line 2603: > 2601: } > 2602: > 2603: // Volatile load may be followed by Unsafe CAS. Suggestion: // There might be a volatile load before this Unsafe CAS. src/hotspot/cpu/ppc/c1_LIRAssembler_ppc.cpp line 2984: > 2982: } > 2983: > 2984: // Volatile load may be followed by Unsafe OP. Suggestion: // There might be a volatile load before this Unsafe OP. ------------- Marked as reviewed by rrich (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16835#pullrequestreview-1759593616 PR Review Comment: https://git.openjdk.org/jdk/pull/16835#discussion_r1411967401 PR Review Comment: https://git.openjdk.org/jdk/pull/16835#discussion_r1411988559 From mdoerr at openjdk.org Fri Dec 1 13:22:39 2023 From: mdoerr at openjdk.org (Martin Doerr) Date: Fri, 1 Dec 2023 13:22:39 GMT Subject: RFR: 8320807: [PPC64][ZGC] C1 generates wrong code for atomics [v2] In-Reply-To: References: Message-ID: > Debugging test failures on PPC64 in java/lang/Thread/virtual/stress/Skynet.java#ZGenerational has shown that the ldarx+stdcx_ loop for uncompressed Oops in `LIR_Assembler::atomic_op` is wrong: `__ mr(Rtmp, Robj);` is inside of the ldarx+stdcx_ loop, but must be outside of it. Repeated execution leads to wrong store value. > In addition, zBarrierSetC1.cpp expects `cas_obj` and `xchg` to contain all necessary memory barriers. That doesn't fit to the current PPC64 design which inserts memory barriers on LIR level instead. I've changed this and moved them into the assembler code for all GCs. > While debugging, I have optimized out an unnecessary branch in `ZBarrierSetAssembler::store_barrier_medium`. Martin Doerr has updated the pull request incrementally with one additional commit since the last revision: Improve comments. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/16835/files - new: https://git.openjdk.org/jdk/pull/16835/files/54a0d25d..e6958e4c Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=16835&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=16835&range=00-01 Stats: 3 lines in 2 files changed: 0 ins; 0 del; 3 mod Patch: https://git.openjdk.org/jdk/pull/16835.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16835/head:pull/16835 PR: https://git.openjdk.org/jdk/pull/16835 From mdoerr at openjdk.org Fri Dec 1 13:22:41 2023 From: mdoerr at openjdk.org (Martin Doerr) Date: Fri, 1 Dec 2023 13:22:41 GMT Subject: RFR: 8320807: [PPC64][ZGC] C1 generates wrong code for atomics In-Reply-To: References: Message-ID: On Mon, 27 Nov 2023 21:52:53 GMT, Martin Doerr wrote: > Debugging test failures on PPC64 in java/lang/Thread/virtual/stress/Skynet.java#ZGenerational has shown that the ldarx+stdcx_ loop for uncompressed Oops in `LIR_Assembler::atomic_op` is wrong: `__ mr(Rtmp, Robj);` is inside of the ldarx+stdcx_ loop, but must be outside of it. Repeated execution leads to wrong store value. > In addition, zBarrierSetC1.cpp expects `cas_obj` and `xchg` to contain all necessary memory barriers. That doesn't fit to the current PPC64 design which inserts memory barriers on LIR level instead. I've changed this and moved them into the assembler code for all GCs. > While debugging, I have optimized out an unnecessary branch in `ZBarrierSetAssembler::store_barrier_medium`. Thanks for the reviews! I've improved the comments as suggested. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16835#issuecomment-1836106556 From mdoerr at openjdk.org Fri Dec 1 14:47:17 2023 From: mdoerr at openjdk.org (Martin Doerr) Date: Fri, 1 Dec 2023 14:47:17 GMT Subject: Integrated: 8320807: [PPC64][ZGC] C1 generates wrong code for atomics In-Reply-To: References: Message-ID: On Mon, 27 Nov 2023 21:52:53 GMT, Martin Doerr wrote: > Debugging test failures on PPC64 in java/lang/Thread/virtual/stress/Skynet.java#ZGenerational has shown that the ldarx+stdcx_ loop for uncompressed Oops in `LIR_Assembler::atomic_op` is wrong: `__ mr(Rtmp, Robj);` is inside of the ldarx+stdcx_ loop, but must be outside of it. Repeated execution leads to wrong store value. > In addition, zBarrierSetC1.cpp expects `cas_obj` and `xchg` to contain all necessary memory barriers. That doesn't fit to the current PPC64 design which inserts memory barriers on LIR level instead. I've changed this and moved them into the assembler code for all GCs. > While debugging, I have optimized out an unnecessary branch in `ZBarrierSetAssembler::store_barrier_medium`. This pull request has now been integrated. Changeset: 3087e14c Author: Martin Doerr URL: https://git.openjdk.org/jdk/commit/3087e14cde9257680f0406b11942f9cb7739cb7b Stats: 121 lines in 4 files changed: 42 ins; 70 del; 9 mod 8320807: [PPC64][ZGC] C1 generates wrong code for atomics Reviewed-by: lucy, rrich ------------- PR: https://git.openjdk.org/jdk/pull/16835 From kdnilsen at openjdk.org Fri Dec 1 15:56:13 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 1 Dec 2023 15:56:13 GMT Subject: RFR: 8321149: GenShen: Test for is_good_progress() following degen needs to sum all available memoryChange behavior of is_good_progerss Message-ID: This reduces the sensitivity of the trigger that upgrades to Full GC following a completed degenerated GC. The change was motivated by examination of a test workload that required one more full GC and one more degenerated GC than was expected. Upon examination of the GC log, it was determined that the extra Full GC was triggered because the mutator free set following degen was approximately 10% below the critical threshold, even though the total available memory within heap was more than three fold the critical threshold. Following this change, no Full GCs were required and one fewer degenerated cycle was required (which had occurred immediately following the Full GC due to the long unproductive STW the pause caused by Full GC). P50 latency improved by 15%, and p100 latencies improved by over 100 fold. More comprehensive testing over a broader set of workloads reveals this change is not "universally better". Of particular concern is degradation of specjbb numbers on x86, but not aarch64, tests. I'm inclined to believe this change represents net improvement, but it would be best to delay integration until we have a better understanding of specjbb performance issues, and how they might be impacted by this change. Control: shenandoah-x86-template Experiment: fix-is-good-progress-gh-x86 Most impacted benchmarks | Most impacted metrics ------------------------------------------------------------------------------------------------------- Genshen/specjbb2015 | cpu_system Shenandoah/h2 | concurrent_evacuation Genshen/diluvian_large | transfer_old_from_satb Shenandoah/serial | critical_jops Genshen/extremem-large-45g | trigger_learn Only in experiment | Only in control ------------------------------------------------------------------------------------------------------- extremem-ff/trigger_failure | scimark.fft.small/concurrent_evacuation tomcat/cwr_total | scimark.lu.large/concurrent_strong_roots extremem-ff/degen_update_roots | scimark.fft.small/concurrent_thread_roots extremem-ff/du_thread_roots | scimark.fft.small/concurrent_update_thread_roots extremem-ff/du_total | scimark.fft.large/concurrent_weak_roots Shenandoah ------------------------------------------------------------------------------------------------------- +40.97% serial/concurrent_evacuation p=0.00668 Control: 16.703ms (+/- 6.30ms) 45 Test: 23.547ms (+/- 6.06ms) 8 +8.77% scimark.sor.large/cpu_system p=0.04575 Control: 2.293s (+/- 0.29s ) 60 Test: 2.494s (+/- 0.25s ) 10 +6.02% mpegaudio/cpu_system p=0.03141 Control: 5.712s (+/- 0.44s ) 60 Test: 6.056s (+/- 0.54s ) 10 -11.37% h2/dacapo_metered_latency_max p=0.04808 Control: 31.523ms (+/- 4.40ms) 60 Test: 28.305ms (+/- 3.24ms) 10 -11.37% h2/dacapo_simple_latency_max p=0.04811 Control: 31.523ms (+/- 4.40ms) 60 Test: 28.305ms (+/- 3.24ms) 10 -6.67% hyperalloc_a3072_o4096/trigger_learn p=0.01965 Control: 11.333 (+/- 0.90 ) 60 Test: 10.625 (+/- 0.67 ) 10 Genshen ------------------------------------------------------------------------------------------------------- +31.13% extremem-large-45g/transfer_old_from_satb p=0.01612 Control: 1.444ms (+/-267.77us) 25 Test: 1.893ms (+/-560.32us) 4 -44.61% diluvian_large/pause_degenerated_gc_n p=0.00862 Control: 23.097s (+/- 6.86s ) 36 Test: 15.972s (+/- 5.04s ) 11 -44.61% diluvian_large/pause_degenerated_gc_g p=0.00862 Control: 23.097s (+/- 6.86s ) 36 Test: 15.972s (+/- 5.04s ) 11 -13.37% crypto.signverify/pause_init_update_refs_g p=0.04082 Control: 2.807ms (+/-728.62us) 89 Test: 2.477ms (+/-714.60us) 20 -13.04% specjbb2015/trigger_failure p=0.04891 Control: 122.083 (+/- 22.98 ) 60 Test: 108.000 (+/- 28.48 ) 10 -11.70% specjbb2015/sla_100000_jops p=0.00312 Control: 6284.875 (+/-517.36 ) 60 Test: 5626.500 (+/-549.73 ) 10 -10.18% extremem-phased/trigger_spike p=0.01099 Control: 32.917 (+/- 3.76 ) 60 Test: 29.875 (+/- 2.64 ) 10 -8.36% specjbb2015/sla_75000_jops p=0.04222 Control: 6005.917 (+/-539.64 ) 60 Test: 5542.625 (+/-564.13 ) 10 -6.26% specjbb2015/critical_jops p=0.02604 Control: 5202.917 (+/-348.10 ) 60 Test: 4896.500 (+/-411.36 ) 10 ----------------------------------------- Pipeline: fix-is-good-progress-gh-aarch64 Elapsed: 8:31:03.815824 [kdnilsen at amazon.com](mailto:kdnilsen at amazon.com) : openjdk : fix-is-goo : 2023-11-30 23:37:57+00:00 : e62dd2 : Change behavior of is_good_progerss [kemperw at amazon.com](mailto:kemperw at amazon.com) : codepipeline-helpers : mainline : 2023-11-21 10:20:23-08:00 : f42398 : Add more detail from tlab messages b05cfa4a-66b5-4d94-81b6-35f965db87c0: Smoke: 0:00:00 Integration: 6:38:22.240000 294: Passed 294: Total Control: shenandoah-aarch64-template Experiment: fix-is-good-progress-gh-aarch64 Most impacted benchmarks | Most impacted metrics ------------------------------------------------------------------------------------------------------- Genshen/diluvian_large | cpu_system Shenandoah/diluvian_large | trigger_threshold Genshen/xalan | context_switch_count Genshen/specjbb2015 | fu_thread_roots Genshen/diluvian_medium | calculate_addresses Only in experiment | Only in control ------------------------------------------------------------------------------------------------------- avrora/jhiccup_max_pause | scimark.lu.large/cmr_thread_roots mpegaudio/concurrent_update_refs | scimark.sparse.large/cmr_thread_roots crypto.rsa/concurrent_thread_roots | mpegaudio/concurrent_mark_roots diluvian_large/dcu_unlink_clds | scimark.sparse.small/concurrent_update_thread_roots scimark.lu.large/pause_init_update_refs_g | diluvian_large/cwr_total Shenandoah ------------------------------------------------------------------------------------------------------- +175.00% diluvian_large/trigger_threshold p=0.00004 Control: 1.000 (+/- 0.00 ) 40 Test: 2.750 (+/- 0.48 ) 10 -20.30% jme/cpu_system p=0.01368 Control: 0.328s (+/- 0.06s ) 40 Test: 0.273s (+/- 0.04s ) 10 Genshen ------------------------------------------------------------------------------------------------------- -23.83% diluvian_large/pause_degenerated_gc_n p=0.00866 Control: 25.834s (+/- 4.48s ) 32 Test: 20.861s (+/- 6.63s ) 13 -23.83% diluvian_large/pause_degenerated_gc_g p=0.00866 Control: 25.834s (+/- 4.48s ) 32 Test: 20.861s (+/- 6.63s ) 13 -8.07% diluvian_large/calculate_addresses p=0.03028 Control: 2.854s (+/-284.07ms) 28 Test: 2.641s (+/-194.48ms) 10 -6.96% diluvian_medium/cpu_system p=0.03454 Control: 2.345s (+/- 0.21s ) 50 Test: 2.192s (+/- 0.19s ) 10 -5.73% specjbb2015/fu_thread_roots p=0.00000 Control: 2.346ms (+/-203.14us) 238 Test: 2.219ms (+/-126.57us) 48 -5.29% xalan/context_switch_count p=0.00851 Control: 24231.100 (+/-2546.76 ) 50 Test: 23013.125 (+/-972.88 ) 10 ------------- Commit messages: - Change behavior of is_good_progerss Changes: https://git.openjdk.org/shenandoah/pull/364/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=364&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8321149 Stats: 9 lines in 2 files changed: 8 ins; 0 del; 1 mod Patch: https://git.openjdk.org/shenandoah/pull/364.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/364/head:pull/364 PR: https://git.openjdk.org/shenandoah/pull/364 From roland at openjdk.org Fri Dec 1 16:55:08 2023 From: roland at openjdk.org (Roland Westrelin) Date: Fri, 1 Dec 2023 16:55:08 GMT Subject: RFR: 8321122: Shenandoah: Remove ShenandoahLoopOptsAfterExpansion flag In-Reply-To: References: Message-ID: <_PWNVaUyOIoUREKSS3D3ru2-GnCSzpZDoMdaYiwpIIY=.7a2dcf93-b7b3-4488-92fb-ae734e5b0d95@github.com> On Thu, 30 Nov 2023 16:00:17 GMT, Aleksey Shipilev wrote: > More flags cleanup. `ShenandoahLoopOptsAfterExpansion` was added long time ago to gate more aggressive barrier optimizations in C2. It was default since then. There seem to be no reason to keep it as the flag. Just checking, @rwestrel -- you don't need it anymore, right? It could be useful for diagnostic purposes. Say if a user reports a crash, one trouble shooting step could be to ask them to run with `ShenandoahLoopOptsAfterExpansion` off. This said, that never happened AFAIK. So I don't have a strong opinion. I'm also wondering if: they make an actual difference performance wise? with other changes that happened in the meantime, are they still doing anything useful? ------------- PR Comment: https://git.openjdk.org/jdk/pull/16908#issuecomment-1836454231 From wkemper at openjdk.org Fri Dec 1 18:14:44 2023 From: wkemper at openjdk.org (William Kemper) Date: Fri, 1 Dec 2023 18:14:44 GMT Subject: RFR: 8321149: GenShen: Test for is_good_progress() following degen needs to sum all available memory In-Reply-To: References: Message-ID: On Fri, 1 Dec 2023 15:51:34 GMT, Kelvin Nilsen wrote: > This reduces the sensitivity of the trigger that upgrades to Full GC following a completed degenerated GC. The change was motivated by examination of a test workload that required one more full GC and one more degenerated GC than was expected. Upon examination of the GC log, it was determined that the extra Full GC was triggered because the mutator free set following degen was approximately 10% below the critical threshold, even though the total available memory within heap was more than three fold the critical threshold. > > Following this change, no Full GCs were required and one fewer degenerated cycle was required (which had occurred immediately following the Full GC due to the long unproductive STW the pause caused by Full GC). P50 latency improved by 15%, and p100 latencies improved by over 100 fold. > > More comprehensive testing over a broader set of workloads reveals this change is not "universally better". Of particular concern is degradation of specjbb numbers on x86, but not aarch64, tests. I'm inclined to believe this change represents net improvement, but it would be best to delay integration until we have a better understanding of specjbb performance issues, and how they might be impacted by this change. > > > > Control: shenandoah-x86-template > Experiment: fix-is-good-progress-gh-x86 > > Most impacted benchmarks | Most impacted metrics > ------------------------------------------------------------------------------------------------------- > Genshen/specjbb2015 | cpu_system > Shenandoah/h2 | concurrent_evacuation > Genshen/diluvian_large | transfer_old_from_satb > Shenandoah/serial | critical_jops > Genshen/extremem-large-45g | trigger_learn > > > Only in experiment | Only in control > ------------------------------------------------------------------------------------------------------- > extremem-ff/trigger_failure | scimark.fft.small/concurrent_evacuation > tomcat/cwr_total | scimark.lu.large/concurrent_strong_roots > extremem-ff/de... src/hotspot/share/gc/shenandoah/shenandoahMetrics.cpp line 1: > 1: /* We need to think about what _good progress_ means... In my mind, it means the _mutators_ have enough memory to last them through a concurrent cycle. Maybe we ought to have this code evaluate the allocation rate and gc times as the heuristics do? Alternatively, we could also reduce `ShenandoahCriticalFreeThreshold` to achieve a similar end? Or, for generational mode, we should look at how much memory the degenerated cycle made available in the _young_ generation? ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/364#discussion_r1412432036 From kdnilsen at openjdk.org Fri Dec 1 23:53:21 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 1 Dec 2023 23:53:21 GMT Subject: RFR: 8321149: GenShen: Test for is_good_progress() following degen needs to sum all available memory In-Reply-To: References:

Message-ID: On Fri, 1 Dec 2023 18:12:05 GMT, William Kemper wrote: >> This reduces the sensitivity of the trigger that upgrades to Full GC following a completed degenerated GC. The change was motivated by examination of a test workload that required one more full GC and one more degenerated GC than was expected. Upon examination of the GC log, it was determined that the extra Full GC was triggered because the mutator free set following degen was approximately 10% below the critical threshold, even though the total available memory within heap was more than three fold the critical threshold. >> >> Following this change, no Full GCs were required and one fewer degenerated cycle was required (which had occurred immediately following the Full GC due to the long unproductive STW the pause caused by Full GC). P50 latency improved by 15%, and p100 latencies improved by over 100 fold. >> >> More comprehensive testing over a broader set of workloads reveals this change is not "universally better". Of particular concern is degradation of specjbb numbers on x86, but not aarch64, tests. I'm inclined to believe this change represents net improvement, but it would be best to delay integration until we have a better understanding of specjbb performance issues, and how they might be impacted by this change. >> >> >> >> Control: shenandoah-x86-template >> Experiment: fix-is-good-progress-gh-x86 >> >> Most impacted benchmarks | Most impacted metrics >> ------------------------------------------------------------------------------------------------------- >> Genshen/specjbb2015 | cpu_system >> Shenandoah/h2 | concurrent_evacuation >> Genshen/diluvian_large | transfer_old_from_satb >> Shenandoah/serial | critical_jops >> Genshen/extremem-large-45g | trigger_learn >> >> >> Only in experiment | Only in control >> ------------------------------------------------------------------------------------------------------- >> extremem-ff/trigger_failure | scimark.fft.small/concurrent_evacuation >> tomcat/cwr_total | scimark.lu.large/concurrent... > > src/hotspot/share/gc/shenandoah/shenandoahMetrics.cpp line 1: > >> 1: /* > > We need to think about what _good progress_ means... In my mind, it means the _mutators_ have enough memory to last them through a concurrent cycle. Maybe we ought to have this code evaluate the allocation rate and gc times as the heuristics do? Alternatively, we could also reduce `ShenandoahCriticalFreeThreshold` to achieve a similar end? Or, for generational mode, we should look at how much memory the degenerated cycle made available in the _young_ generation? Thanks. These are good insights. I will explore further. ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/364#discussion_r1412669834 From kdnilsen at openjdk.org Sat Dec 2 00:40:16 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 2 Dec 2023 00:40:16 GMT Subject: RFR: 8321149: GenShen: Test for is_good_progress() following degen needs to sum all available memory In-Reply-To: References: Message-ID: On Fri, 1 Dec 2023 15:51:34 GMT, Kelvin Nilsen wrote: > This reduces the sensitivity of the trigger that upgrades to Full GC following a completed degenerated GC. The change was motivated by examination of a test workload that required one more full GC and one more degenerated GC than was expected. Upon examination of the GC log, it was determined that the extra Full GC was triggered because the mutator free set following degen was approximately 10% below the critical threshold, even though the total available memory within heap was more than three fold the critical threshold. > > Following this change, no Full GCs were required and one fewer degenerated cycle was required (which had occurred immediately following the Full GC due to the long unproductive STW the pause caused by Full GC). P50 latency improved by 15%, and p100 latencies improved by over 100 fold. > > More comprehensive testing over a broader set of workloads reveals this change is not "universally better". Of particular concern is degradation of specjbb numbers on x86, but not aarch64, tests. I'm inclined to believe this change represents net improvement, but it would be best to delay integration until we have a better understanding of specjbb performance issues, and how they might be impacted by this change. > > > > Control: shenandoah-x86-template > Experiment: fix-is-good-progress-gh-x86 > > Most impacted benchmarks | Most impacted metrics > ------------------------------------------------------------------------------------------------------- > Genshen/specjbb2015 | cpu_system > Shenandoah/h2 | concurrent_evacuation > Genshen/diluvian_large | transfer_old_from_satb > Shenandoah/serial | critical_jops > Genshen/extremem-large-45g | trigger_learn > > > Only in experiment | Only in control > ------------------------------------------------------------------------------------------------------- > extremem-ff/trigger_failure | scimark.fft.small/concurrent_evacuation > tomcat/cwr_total | scimark.lu.large/concurrent_strong_roots > extremem-ff/de... I'm going back to the drawing board on this one... ------------- PR Comment: https://git.openjdk.org/shenandoah/pull/364#issuecomment-1836960846 From kdnilsen at openjdk.org Sat Dec 2 00:40:16 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Sat, 2 Dec 2023 00:40:16 GMT Subject: Withdrawn: 8321149: GenShen: Test for is_good_progress() following degen needs to sum all available memory In-Reply-To: References: Message-ID: On Fri, 1 Dec 2023 15:51:34 GMT, Kelvin Nilsen wrote: > This reduces the sensitivity of the trigger that upgrades to Full GC following a completed degenerated GC. The change was motivated by examination of a test workload that required one more full GC and one more degenerated GC than was expected. Upon examination of the GC log, it was determined that the extra Full GC was triggered because the mutator free set following degen was approximately 10% below the critical threshold, even though the total available memory within heap was more than three fold the critical threshold. > > Following this change, no Full GCs were required and one fewer degenerated cycle was required (which had occurred immediately following the Full GC due to the long unproductive STW the pause caused by Full GC). P50 latency improved by 15%, and p100 latencies improved by over 100 fold. > > More comprehensive testing over a broader set of workloads reveals this change is not "universally better". Of particular concern is degradation of specjbb numbers on x86, but not aarch64, tests. I'm inclined to believe this change represents net improvement, but it would be best to delay integration until we have a better understanding of specjbb performance issues, and how they might be impacted by this change. > > > > Control: shenandoah-x86-template > Experiment: fix-is-good-progress-gh-x86 > > Most impacted benchmarks | Most impacted metrics > ------------------------------------------------------------------------------------------------------- > Genshen/specjbb2015 | cpu_system > Shenandoah/h2 | concurrent_evacuation > Genshen/diluvian_large | transfer_old_from_satb > Shenandoah/serial | critical_jops > Genshen/extremem-large-45g | trigger_learn > > > Only in experiment | Only in control > ------------------------------------------------------------------------------------------------------- > extremem-ff/trigger_failure | scimark.fft.small/concurrent_evacuation > tomcat/cwr_total | scimark.lu.large/concurrent_strong_roots > extremem-ff/de... This pull request has been closed without being integrated. ------------- PR: https://git.openjdk.org/shenandoah/pull/364 From tschatzl at openjdk.org Mon Dec 4 07:56:04 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 4 Dec 2023 07:56:04 GMT Subject: RFR: 8317809: Insertion of free code blobs into code cache can be very slow during class unloading [v5] In-Reply-To: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com> References: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com> Message-ID: > Insert code blobs in a sorted fashion to exploit the finger-optimization when adding, making this procedure O(n) instead of O(n^2) > > Introduces a globally available ClassUnloadingContext that contains common methods pertaining to class and code unloading. GCs may use it to efficiently manage unlinked class loader datas and nmethods to allow use of common methods (unlink/merge). > > The steps typically are registering a new to be unlinked CLD/nmethod, and then purge its memory later. STW collectors perform this work in one big chunk taking the CodeCache_lock, for the entire duration, while concurrent collectors lock/unlock for every insertion to allow for concurrent users for the lock to progress. > > Some care has been taken to stay consistent with an "unloading = unlinking + purge" scheme; however particularly the existing CLD handling API (still) mixes unlinking and purging in its CLD::unload() call. To simplify this change that is mostly geared towards separating nmethod unlinking from purging, to make code blob freeing O(n) instead of O(n^2). > > Upcoming changes will > * separate nmethod unregistering from nmethod purging to allow doing that in bulk (for the STW collectors); that can significantly reduce code purging time for the STW collectors. > * better name the second stage of unlinking (called "cleaning" throughout, e.g. the work done in `G1CollectedHeap::complete_cleaning`) > * untangle CLD unlinking and what's called "cleaning" now to allow moving more stuff into the second unlinking stage for better parallelism > * G1: move some significant tasks from the remark pause to concurrent (unregistering nmethods, freeing code blobs and cld/metaspace purging) > * Maybe move Serial/Parallel GC metaspace purging closer to other unlinking/purging code to keep things local and allow easier logging. > > Please also first looking into the (small) PR this depends on. > > The crash on linux-x86 is fixed by PR#16766 which I split out for quicker reviews. > > Testing: tier1-7 > > Thanks, > Thomas Thomas Schatzl has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains ten commits: - Merge branch 'master' into 8317809-sorted-insertion-of-free-blobs - remove trailing whitespace - fix indentation after recent commit - Address ayang/iwalulya review comments, remove inheritance in ClassUnloadingContext for now as unnecessary for this change, use iterators, other review comments - Merge branch 'master' into mergeme - iwalulya review, naming - 8317809 Insert code blobs in a sorted fashion to exploit the finger-optimization when adding, making this procedure O(n) instead of O(n^2) Introduce a globally available ClassUnloadingContext that contains common methods pertaining to class and code unloading. GCs may use it to efficiently manage unlinked class loader datas and nmethods to allow use of common methods (unlink/merge). The steps typically are registering a new to be unlinked CLD/nmethod, and then purge its memory later. STW collectors perform this work in one big chunk taking the CodeCache_lock, for the entire duration, while concurrent collectors lock/unlock for every insertion to allow for concurrent users for the lock to progress. Some care has been taken to stay consistent with an "unloading = unlinking + purge" scheme; however particularly the existing CLD handling API (still) mixes unlinking and purging in its CLD::unload() call. To simplify this change that is mostly geared towards separating nmethod unlinking from purging, to make code blob freeing O(n) instead of O(n^2). Upcoming changes will * separate nmethod unregistering from nmethod purging to allow doing that in bulk (for the STW collectors); that can significantly reduce code purging time for the STW collectors. * better name the second stage of unlinking (called "cleaning" throughout, e.g. the work done in `G1CollectedHeap::complete_cleaning`) * untangle CLD unlinking and what's called "cleaning" now to allow moving more stuff into the second unlinking stage for better parallelism * G1: move some signifcant tasks from the remark pause to concurrent (unregistering nmethods, freeing code blobs and cld/metaspace purging) * Maybe move Serial/Parallel GC metaspace purging closer to other unlinking/purging code to keep things local and allow easier logging. - Only run test case on debug VMs, sufficient - 8320331 g1 full gc "during" verification accesses half-unloaded metadata ------------- Changes: https://git.openjdk.org/jdk/pull/16759/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16759&range=04 Stats: 474 lines in 28 files changed: 347 ins; 83 del; 44 mod Patch: https://git.openjdk.org/jdk/pull/16759.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16759/head:pull/16759 PR: https://git.openjdk.org/jdk/pull/16759 From shade at openjdk.org Mon Dec 4 08:22:46 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 4 Dec 2023 08:22:46 GMT Subject: Integrated: 8321120: Shenandoah: Remove ShenandoahElasticTLAB flag In-Reply-To: References: Message-ID: <_41KszjxJh2oN8zr4F09pC57VosIQjSqsdK_7r7WQBg=.e0b34729-cfdd-4df1-9120-6578a80dcd72@github.com> On Thu, 30 Nov 2023 15:50:52 GMT, Aleksey Shipilev wrote: > We have added `ShenandoahElasticTLAB` a long time ago, to provide the escape hatch if elastic TLABs would misbehave. We have been running with elastic TLABs for years without problems. Taking care of elastic/non-elastic TLAB allocs adds to technical debt. > > Additional testing: > - [x] hotspot_gc_shenandoah This pull request has now been integrated. Changeset: 93b9235f Author: Aleksey Shipilev URL: https://git.openjdk.org/jdk/commit/93b9235f19dab2a0c08ba8a1afcc82ee71be4c0b Stats: 92 lines in 5 files changed: 0 ins; 87 del; 5 mod 8321120: Shenandoah: Remove ShenandoahElasticTLAB flag Reviewed-by: kdnilsen, ysr ------------- PR: https://git.openjdk.org/jdk/pull/16907 From shade at openjdk.org Mon Dec 4 08:23:41 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 4 Dec 2023 08:23:41 GMT Subject: RFR: 8321122: Shenandoah: Remove ShenandoahLoopOptsAfterExpansion flag In-Reply-To: <_PWNVaUyOIoUREKSS3D3ru2-GnCSzpZDoMdaYiwpIIY=.7a2dcf93-b7b3-4488-92fb-ae734e5b0d95@github.com> References: <_PWNVaUyOIoUREKSS3D3ru2-GnCSzpZDoMdaYiwpIIY=.7a2dcf93-b7b3-4488-92fb-ae734e5b0d95@github.com> Message-ID: On Fri, 1 Dec 2023 16:52:45 GMT, Roland Westrelin wrote: > > More flags cleanup. `ShenandoahLoopOptsAfterExpansion` was added long time ago to gate more aggressive barrier optimizations in C2. It was default since then. There seem to be no reason to keep it as the flag. Just checking, @rwestrel -- you don't need it anymore, right? > > It could be useful for diagnostic purposes. Say if a user reports a crash, one trouble shooting step could be to ask them to run with `ShenandoahLoopOptsAfterExpansion` off. This said, that never happened AFAIK. So I don't have a strong opinion. I'm also wondering if: they make an actual difference performance wise? with other changes that happened in the meantime, are they still doing anything useful? I think we want to keep the flag list from growing, and this is one of the steps of culling the flags we do not need anymore. I cannot remember a case when this flag was useful either. I think long-term we want to switch to late barrier insertion anyway, which would render this flag obsolete. We can remove the flag when that happens, if you prefer. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16908#issuecomment-1838046406 From ayang at openjdk.org Mon Dec 4 09:11:41 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 4 Dec 2023 09:11:41 GMT Subject: RFR: 8317809: Insertion of free code blobs into code cache can be very slow during class unloading [v4] In-Reply-To: <2_ONmN3qxsdTIEJMbQhE82nBn10l_RnZm5-DZAmQn2I=.9ad49c74-3721-4299-8a9e-b8c1973eb494@github.com> References: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com> <2_ONmN3qxsdTIEJMbQhE82nBn10l_RnZm5-DZAmQn2I=.9ad49c74-3721-4299-8a9e-b8c1973eb494@github.com> Message-ID: On Thu, 30 Nov 2023 11:38:28 GMT, Thomas Schatzl wrote: >> Insert code blobs in a sorted fashion to exploit the finger-optimization when adding, making this procedure O(n) instead of O(n^2) >> >> Introduces a globally available ClassUnloadingContext that contains common methods pertaining to class and code unloading. GCs may use it to efficiently manage unlinked class loader datas and nmethods to allow use of common methods (unlink/merge). >> >> The steps typically are registering a new to be unlinked CLD/nmethod, and then purge its memory later. STW collectors perform this work in one big chunk taking the CodeCache_lock, for the entire duration, while concurrent collectors lock/unlock for every insertion to allow for concurrent users for the lock to progress. >> >> Some care has been taken to stay consistent with an "unloading = unlinking + purge" scheme; however particularly the existing CLD handling API (still) mixes unlinking and purging in its CLD::unload() call. To simplify this change that is mostly geared towards separating nmethod unlinking from purging, to make code blob freeing O(n) instead of O(n^2). >> >> Upcoming changes will >> * separate nmethod unregistering from nmethod purging to allow doing that in bulk (for the STW collectors); that can significantly reduce code purging time for the STW collectors. >> * better name the second stage of unlinking (called "cleaning" throughout, e.g. the work done in `G1CollectedHeap::complete_cleaning`) >> * untangle CLD unlinking and what's called "cleaning" now to allow moving more stuff into the second unlinking stage for better parallelism >> * G1: move some significant tasks from the remark pause to concurrent (unregistering nmethods, freeing code blobs and cld/metaspace purging) >> * Maybe move Serial/Parallel GC metaspace purging closer to other unlinking/purging code to keep things local and allow easier logging. >> >> Please also first looking into the (small) PR this depends on. >> >> The crash on linux-x86 is fixed by PR#16766 which I split out for quicker reviews. >> >> Testing: tier1-7 >> >> Thanks, >> Thomas > > Thomas Schatzl has updated the pull request incrementally with three additional commits since the last revision: > > - remove trailing whitespace > - fix indentation after recent commit > - Address ayang/iwalulya review comments, remove inheritance in ClassUnloadingContext for now as unnecessary for this change, use iterators, other review comments src/hotspot/share/gc/g1/g1ConcurrentMark.cpp line 1686: > 1684: // Unload Klasses, String, Code Cache, etc. > 1685: if (ClassUnloadingWithConcurrentMark) { > 1686: _g1h->unload_classes_and_code("Class Unloading", &is_alive, _gc_timer_cm); Kind of preexisting: I'd not expect find "Class Unloading" inside `weak_refs_work()`. Its caller level is more reasonable, IMO. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16759#discussion_r1410762799 From ayang at openjdk.org Mon Dec 4 09:11:47 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 4 Dec 2023 09:11:47 GMT Subject: RFR: 8317809: Insertion of free code blobs into code cache can be very slow during class unloading [v5] In-Reply-To: References: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com> Message-ID: On Mon, 4 Dec 2023 07:56:04 GMT, Thomas Schatzl wrote: >> Insert code blobs in a sorted fashion to exploit the finger-optimization when adding, making this procedure O(n) instead of O(n^2) >> >> Introduces a globally available ClassUnloadingContext that contains common methods pertaining to class and code unloading. GCs may use it to efficiently manage unlinked class loader datas and nmethods to allow use of common methods (unlink/merge). >> >> The steps typically are registering a new to be unlinked CLD/nmethod, and then purge its memory later. STW collectors perform this work in one big chunk taking the CodeCache_lock, for the entire duration, while concurrent collectors lock/unlock for every insertion to allow for concurrent users for the lock to progress. >> >> Some care has been taken to stay consistent with an "unloading = unlinking + purge" scheme; however particularly the existing CLD handling API (still) mixes unlinking and purging in its CLD::unload() call. To simplify this change that is mostly geared towards separating nmethod unlinking from purging, to make code blob freeing O(n) instead of O(n^2). >> >> Upcoming changes will >> * separate nmethod unregistering from nmethod purging to allow doing that in bulk (for the STW collectors); that can significantly reduce code purging time for the STW collectors. >> * better name the second stage of unlinking (called "cleaning" throughout, e.g. the work done in `G1CollectedHeap::complete_cleaning`) >> * untangle CLD unlinking and what's called "cleaning" now to allow moving more stuff into the second unlinking stage for better parallelism >> * G1: move some significant tasks from the remark pause to concurrent (unregistering nmethods, freeing code blobs and cld/metaspace purging) >> * Maybe move Serial/Parallel GC metaspace purging closer to other unlinking/purging code to keep things local and allow easier logging. >> >> Please also first looking into the (small) PR this depends on. >> >> The crash on linux-x86 is fixed by PR#16766 which I split out for quicker reviews. >> >> Testing: tier1-7 >> >> Thanks, >> Thomas > > Thomas Schatzl has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains ten commits: > > - Merge branch 'master' into 8317809-sorted-insertion-of-free-blobs > - remove trailing whitespace > - fix indentation after recent commit > - Address ayang/iwalulya review comments, remove inheritance in ClassUnloadingContext for now as unnecessary for this change, use iterators, other review comments > - Merge branch 'master' into mergeme > - iwalulya review, naming > - 8317809 Insert code blobs in a sorted fashion to exploit the finger-optimization when adding, making this procedure O(n) instead of O(n^2) > > Introduce a globally available ClassUnloadingContext that contains common methods pertaining to class and code unloading. > GCs may use it to efficiently manage unlinked class loader datas and nmethods to allow use of common methods (unlink/merge). > > The steps typically are registering a new to be unlinked CLD/nmethod, and then purge its memory later. STW collectors perform > this work in one big chunk taking the CodeCache_lock, for the entire duration, while concurrent collectors lock/unlock for every > insertion to allow for concurrent users for the lock to progress. > > Some care has been taken to stay consistent with an "unloading = unlinking + purge" scheme; however particularly the existing > CLD handling API (still) mixes unlinking and purging in its CLD::unload() call. To simplify this change that is mostly geared > towards separating nmethod unlinking from purging, to make code blob freeing O(n) instead of O(n^2). > > Upcoming changes will > * separate nmethod unregistering from nmethod purging to allow doing that in bulk (for the STW collectors); that can significantly > reduce code purging time for the STW collectors. > * better name the second stage of unlinking (called "cleaning" throughout, e.g. the work done in `G1CollectedHeap::complete_cleaning`) > * untangle CLD unlinking and what's called "cleaning" now to allow moving more stuff into the second unlinking stage for better > parallelism > * G1: move some signifcant tasks from the remark pause to concurrent (unregistering nmethods, freeing code blobs and cld/metaspace purging) > * Maybe move Serial/Parallel GC metaspace purging closer to other unlinking/purging code to keep things local and allow easier logging. > - Only run test case on debug VMs, sufficient > - 8320331 g1 full gc "during" verification accesses half-unloa... src/hotspot/share/gc/shared/classUnloadingContext.hpp line 38: > 36: static ClassUnloadingContext* _context; > 37: > 38: ClassLoaderData* volatile _cld_head; I don't get why `ClassLoaderData* volatile _cld_head;` needs to be inside `ClassUnloadingContext`. What's the motivation for the change to files in `src/hotspot/share/classfile/`? Seems that they have nothing to do with other fields inside this class, which are for sort-before-free of nmethods. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16759#discussion_r1413566268 From tschatzl at openjdk.org Mon Dec 4 11:15:41 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 4 Dec 2023 11:15:41 GMT Subject: RFR: 8317809: Insertion of free code blobs into code cache can be very slow during class unloading [v4] In-Reply-To: References: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com> <2_ONmN3qxsdTIEJMbQhE82nBn10l_RnZm5-DZAmQn2I=.9ad49c74-3721-4299-8a9e-b8c1973eb494@github.com> Message-ID: On Thu, 30 Nov 2023 14:36:19 GMT, Albert Mingkun Yang wrote: >> Thomas Schatzl has updated the pull request incrementally with three additional commits since the last revision: >> >> - remove trailing whitespace >> - fix indentation after recent commit >> - Address ayang/iwalulya review comments, remove inheritance in ClassUnloadingContext for now as unnecessary for this change, use iterators, other review comments > > src/hotspot/share/gc/g1/g1ConcurrentMark.cpp line 1686: > >> 1684: // Unload Klasses, String, Code Cache, etc. >> 1685: if (ClassUnloadingWithConcurrentMark) { >> 1686: _g1h->unload_classes_and_code("Class Unloading", &is_alive, _gc_timer_cm); > > Kind of preexisting: I'd not expect find "Class Unloading" inside `weak_refs_work()`. Its caller level is more reasonable, IMO. So what is your suggestion here? In some way CLDs and nmethods are weak references, so I can see reason for the original placement, but since it's a fairly large chunk of functionality one might expect it at a higher level. If so, I can move it in a separate patch. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16759#discussion_r1413725314 From tschatzl at openjdk.org Mon Dec 4 11:15:46 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 4 Dec 2023 11:15:46 GMT Subject: RFR: 8317809: Insertion of free code blobs into code cache can be very slow during class unloading [v5] In-Reply-To: References: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com>

Message-ID: On Mon, 4 Dec 2023 09:08:29 GMT, Albert Mingkun Yang wrote: >> Thomas Schatzl has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains ten commits: >> >> - Merge branch 'master' into 8317809-sorted-insertion-of-free-blobs >> - remove trailing whitespace >> - fix indentation after recent commit >> - Address ayang/iwalulya review comments, remove inheritance in ClassUnloadingContext for now as unnecessary for this change, use iterators, other review comments >> - Merge branch 'master' into mergeme >> - iwalulya review, naming >> - 8317809 Insert code blobs in a sorted fashion to exploit the finger-optimization when adding, making this procedure O(n) instead of O(n^2) >> >> Introduce a globally available ClassUnloadingContext that contains common methods pertaining to class and code unloading. >> GCs may use it to efficiently manage unlinked class loader datas and nmethods to allow use of common methods (unlink/merge). >> >> The steps typically are registering a new to be unlinked CLD/nmethod, and then purge its memory later. STW collectors perform >> this work in one big chunk taking the CodeCache_lock, for the entire duration, while concurrent collectors lock/unlock for every >> insertion to allow for concurrent users for the lock to progress. >> >> Some care has been taken to stay consistent with an "unloading = unlinking + purge" scheme; however particularly the existing >> CLD handling API (still) mixes unlinking and purging in its CLD::unload() call. To simplify this change that is mostly geared >> towards separating nmethod unlinking from purging, to make code blob freeing O(n) instead of O(n^2). >> >> Upcoming changes will >> * separate nmethod unregistering from nmethod purging to allow doing that in bulk (for the STW collectors); that can significantly >> reduce code purging time for the STW collectors. >> * better name the second stage of unlinking (called "cleaning" throughout, e.g. the work done in `G1CollectedHeap::complete_cleaning`) >> * untangle CLD unlinking and what's called "cleaning" now to allow moving more stuff into the second unlinking stage for better >> parallelism >> * G1: move some signifcant tasks from the remark pause to concurrent (unregistering nmethods, freeing code blobs and cld/metaspace purging) >> * Maybe move Serial/Parallel GC metaspace purging closer to other unlinking/purging code to keep things local and allow easier logging. >> - Only run test case on debug VMs, sufficient >> ... > > src/hotspot/share/gc/shared/classUnloadingContext.hpp line 38: > >> 36: static ClassUnloadingContext* _context; >> 37: >> 38: ClassLoaderData* volatile _cld_head; > > I don't get why `ClassLoaderData* volatile _cld_head;` needs to be inside `ClassUnloadingContext`. What's the motivation for the change to files in `src/hotspot/share/classfile/`? Seems that they have nothing to do with other fields inside this class, which are for sort-before-free of nmethods. This change introduces a `ClassUnloadingContext`, not a `NmethodUnloadingContext`, and class (and nmethod) unloading consists of both (unloaded) class handling and (unloaded) nmethod handling. I did not want to first introduce a `NmethodUnloadingContext` that will soon be changed to the `ClassUnloadingContext` anyway as for easier parallel access to the CLDs a bit later. A `ClassUnloadingContext` without handling CLDs seems odd, and just fixing the nmethod stuff only seems just busywork for both the author and the reviewers later (for the sake of having minimal patches). However if you insist on changing this, I can remove the CLD handling and rename everything now and later again. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16759#discussion_r1413723381 From tschatzl at openjdk.org Mon Dec 4 12:39:59 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 4 Dec 2023 12:39:59 GMT Subject: RFR: 8317809: Insertion of free code blobs into code cache can be very slow during class unloading [v6] In-Reply-To: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com> References: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com> Message-ID: > Insert code blobs in a sorted fashion to exploit the finger-optimization when adding, making this procedure O(n) instead of O(n^2) > > Introduces a globally available ClassUnloadingContext that contains common methods pertaining to class and code unloading. GCs may use it to efficiently manage unlinked class loader datas and nmethods to allow use of common methods (unlink/merge). > > The steps typically are registering a new to be unlinked CLD/nmethod, and then purge its memory later. STW collectors perform this work in one big chunk taking the CodeCache_lock, for the entire duration, while concurrent collectors lock/unlock for every insertion to allow for concurrent users for the lock to progress. > > Some care has been taken to stay consistent with an "unloading = unlinking + purge" scheme; however particularly the existing CLD handling API (still) mixes unlinking and purging in its CLD::unload() call. To simplify this change that is mostly geared towards separating nmethod unlinking from purging, to make code blob freeing O(n) instead of O(n^2). > > Upcoming changes will > * separate nmethod unregistering from nmethod purging to allow doing that in bulk (for the STW collectors); that can significantly reduce code purging time for the STW collectors. > * better name the second stage of unlinking (called "cleaning" throughout, e.g. the work done in `G1CollectedHeap::complete_cleaning`) > * untangle CLD unlinking and what's called "cleaning" now to allow moving more stuff into the second unlinking stage for better parallelism > * G1: move some significant tasks from the remark pause to concurrent (unregistering nmethods, freeing code blobs and cld/metaspace purging) > * Maybe move Serial/Parallel GC metaspace purging closer to other unlinking/purging code to keep things local and allow easier logging. > > Please also first looking into the (small) PR this depends on. > > The crash on linux-x86 is fixed by PR#16766 which I split out for quicker reviews. > > Testing: tier1-7 > > Thanks, > Thomas Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: ayang review: move class unloading outside of weak_refs_work ------------- Changes: - all: https://git.openjdk.org/jdk/pull/16759/files - new: https://git.openjdk.org/jdk/pull/16759/files/f8ddc131..4afd996a Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=16759&range=05 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=16759&range=04-05 Stats: 13 lines in 1 file changed: 7 ins; 6 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/16759.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16759/head:pull/16759 PR: https://git.openjdk.org/jdk/pull/16759 From ayang at openjdk.org Mon Dec 4 13:00:46 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 4 Dec 2023 13:00:46 GMT Subject: RFR: 8317809: Insertion of free code blobs into code cache can be very slow during class unloading [v6] In-Reply-To: References: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com> Message-ID: On Mon, 4 Dec 2023 12:39:59 GMT, Thomas Schatzl wrote: >> Insert code blobs in a sorted fashion to exploit the finger-optimization when adding, making this procedure O(n) instead of O(n^2) >> >> Introduces a globally available ClassUnloadingContext that contains common methods pertaining to class and code unloading. GCs may use it to efficiently manage unlinked class loader datas and nmethods to allow use of common methods (unlink/merge). >> >> The steps typically are registering a new to be unlinked CLD/nmethod, and then purge its memory later. STW collectors perform this work in one big chunk taking the CodeCache_lock, for the entire duration, while concurrent collectors lock/unlock for every insertion to allow for concurrent users for the lock to progress. >> >> Some care has been taken to stay consistent with an "unloading = unlinking + purge" scheme; however particularly the existing CLD handling API (still) mixes unlinking and purging in its CLD::unload() call. To simplify this change that is mostly geared towards separating nmethod unlinking from purging, to make code blob freeing O(n) instead of O(n^2). >> >> Upcoming changes will >> * separate nmethod unregistering from nmethod purging to allow doing that in bulk (for the STW collectors); that can significantly reduce code purging time for the STW collectors. >> * better name the second stage of unlinking (called "cleaning" throughout, e.g. the work done in `G1CollectedHeap::complete_cleaning`) >> * untangle CLD unlinking and what's called "cleaning" now to allow moving more stuff into the second unlinking stage for better parallelism >> * G1: move some significant tasks from the remark pause to concurrent (unregistering nmethods, freeing code blobs and cld/metaspace purging) >> * Maybe move Serial/Parallel GC metaspace purging closer to other unlinking/purging code to keep things local and allow easier logging. >> >> Please also first looking into the (small) PR this depends on. >> >> The crash on linux-x86 is fixed by PR#16766 which I split out for quicker reviews. >> >> Testing: tier1-7 >> >> Thanks, >> Thomas > > Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: > > ayang review: move class unloading outside of weak_refs_work Only one minor & subjective comment. src/hotspot/share/gc/shared/classUnloadingContext.hpp line 69: > 67: void register_unlinked_nmethod(nmethod* nm); > 68: void purge_nmethods(); > 69: void free_code_blobs(); I feel this is exposing too much detail, especially when the adjacent API just combines them. ------------- Marked as reviewed by ayang (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16759#pullrequestreview-1762337991 PR Review Comment: https://git.openjdk.org/jdk/pull/16759#discussion_r1413833775 From tschatzl at openjdk.org Mon Dec 4 13:55:41 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 4 Dec 2023 13:55:41 GMT Subject: RFR: 8317809: Insertion of free code blobs into code cache can be very slow during class unloading [v6] In-Reply-To: References: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com> Message-ID: On Mon, 4 Dec 2023 12:39:59 GMT, Thomas Schatzl wrote: >> Insert code blobs in a sorted fashion to exploit the finger-optimization when adding, making this procedure O(n) instead of O(n^2) >> >> Introduces a globally available ClassUnloadingContext that contains common methods pertaining to class and code unloading. GCs may use it to efficiently manage unlinked class loader datas and nmethods to allow use of common methods (unlink/merge). >> >> The steps typically are registering a new to be unlinked CLD/nmethod, and then purge its memory later. STW collectors perform this work in one big chunk taking the CodeCache_lock, for the entire duration, while concurrent collectors lock/unlock for every insertion to allow for concurrent users for the lock to progress. >> >> Some care has been taken to stay consistent with an "unloading = unlinking + purge" scheme; however particularly the existing CLD handling API (still) mixes unlinking and purging in its CLD::unload() call. To simplify this change that is mostly geared towards separating nmethod unlinking from purging, to make code blob freeing O(n) instead of O(n^2). >> >> Upcoming changes will >> * separate nmethod unregistering from nmethod purging to allow doing that in bulk (for the STW collectors); that can significantly reduce code purging time for the STW collectors. >> * better name the second stage of unlinking (called "cleaning" throughout, e.g. the work done in `G1CollectedHeap::complete_cleaning`) >> * untangle CLD unlinking and what's called "cleaning" now to allow moving more stuff into the second unlinking stage for better parallelism >> * G1: move some significant tasks from the remark pause to concurrent (unregistering nmethods, freeing code blobs and cld/metaspace purging) >> * Maybe move Serial/Parallel GC metaspace purging closer to other unlinking/purging code to keep things local and allow easier logging. >> >> Please also first looking into the (small) PR this depends on. >> >> The crash on linux-x86 is fixed by PR#16766 which I split out for quicker reviews. >> >> Testing: tier1-7 >> >> Thanks, >> Thomas > > Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: > > ayang review: move class unloading outside of weak_refs_work Fwiw, to put this change in a bit more context: it is part of a series of changes to improve class unloading performance back to pre-jdk21 levels (and better). The basic plan: * this change, [JDK-8317809](https://bugs.openjdk.org/browse/JDK-8317809), that improves nmethod sorting/free list handling (and introduces the ClassUnloadingContext) * [JDK-8317007](https://bugs.openjdk.org/browse/JDK-8317007) that allows bulk unregistering of nmethods instead of (slow) per-nmethod unregistering (also out for review) With the above two changes, Remark pause time should be <= before removal of the code root sweeper (lots of changes went in already that improved time taken for various parts of the class/code unloading). I am planning the following follow-ups in the next few months (after FC time will be spent on bugfixing, and holidays coming up): * (for G1) move out several parts of class unloading into the concurrent phase, at least this will include - bulk nmethod unregistering ([JDK-8317007](https://bugs.openjdk.org/browse/JDK-8317007)) - nmethod code blob freeing (this change) - metaspace unloading Not necessarily in a single change; this basically halves g1 remark pause times again in my testing. * split up and parallelize ClassLoaderData unloading; currently with this change, when registering CLDs CLD->unload() is immediately called as before. However this is wasteful as most of that method can either be "obviously" parallelized or made so that other tasks can run in parallel. So the plan is that class unloading (`SystemDictionary::do_unloading`) will be split into a part that iterates only over the CLD list to determine dead ones, and a parallel part. There are no CR/PRs out for these latter two items, but hopefully this will short of making everything concurrent keep class/code unloading times low enough for some time. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16759#issuecomment-1838687260 From roland at openjdk.org Mon Dec 4 14:16:40 2023 From: roland at openjdk.org (Roland Westrelin) Date: Mon, 4 Dec 2023 14:16:40 GMT Subject: RFR: 8321122: Shenandoah: Remove ShenandoahLoopOptsAfterExpansion flag In-Reply-To: References: <_PWNVaUyOIoUREKSS3D3ru2-GnCSzpZDoMdaYiwpIIY=.7a2dcf93-b7b3-4488-92fb-ae734e5b0d95@github.com> Message-ID: <8yB2OpjTUTHG6E1Wb_Ry5F9HuMinHSY-yt-2eDSB3zw=.8878b365-2275-4227-a8fe-058c6cb7cebe@github.com> On Mon, 4 Dec 2023 08:21:01 GMT, Aleksey Shipilev wrote: > I think we want to keep the flag list from growing, and this is one of the steps of culling the flags we do not need anymore. I cannot remember a case when this flag was useful either. Ok. Then. > I think long-term we want to switch to late barrier insertion anyway, which would render this flag obsolete. They seem to be different things to me. Having the ability to optimize the barrier after it's expanded or inserted seems like a useful thing to have in any case. Is there evidence that late barrier insertion would solve existing issues? ------------- PR Comment: https://git.openjdk.org/jdk/pull/16908#issuecomment-1838723895 From shade at openjdk.org Mon Dec 4 15:47:37 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 4 Dec 2023 15:47:37 GMT Subject: RFR: 8321122: Shenandoah: Remove ShenandoahLoopOptsAfterExpansion flag In-Reply-To: <8yB2OpjTUTHG6E1Wb_Ry5F9HuMinHSY-yt-2eDSB3zw=.8878b365-2275-4227-a8fe-058c6cb7cebe@github.com> References: <_PWNVaUyOIoUREKSS3D3ru2-GnCSzpZDoMdaYiwpIIY=.7a2dcf93-b7b3-4488-92fb-ae734e5b0d95@github.com> <8yB2OpjTUTHG6E1Wb_Ry5F9HuMinHSY-yt-2eDSB3zw=.8878b365-2275-4227-a8fe-058c6cb7cebe@github.com> Message-ID: <9wc8yRtwoi-AmBI_kEQOJSwXnDRzlQaBVSOA8GpVO84=.9f60a36f-c26e-4e1f-b2cb-050780ed5ff6@github.com> On Mon, 4 Dec 2023 14:13:56 GMT, Roland Westrelin wrote: > > I think long-term we want to switch to late barrier insertion anyway, which would render this flag obsolete. > > They seem to be different things to me. Having the ability to optimize the barrier after it's expanded or inserted seems like a useful thing to have in any case. Is there evidence that late barrier insertion would solve existing issues? I thought late barrier insertion basically avoids dealing with any optimizer questions, as we insert the barriers close to emit. So if we are to do late barrier insertion, we don't need the bulk of barrier optimization code? Maybe I misunderstand where late barrier insertion fits in C2. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16908#issuecomment-1838919728 From roland at openjdk.org Mon Dec 4 15:55:40 2023 From: roland at openjdk.org (Roland Westrelin) Date: Mon, 4 Dec 2023 15:55:40 GMT Subject: RFR: 8321122: Shenandoah: Remove ShenandoahLoopOptsAfterExpansion flag In-Reply-To: <9wc8yRtwoi-AmBI_kEQOJSwXnDRzlQaBVSOA8GpVO84=.9f60a36f-c26e-4e1f-b2cb-050780ed5ff6@github.com> References: <_PWNVaUyOIoUREKSS3D3ru2-GnCSzpZDoMdaYiwpIIY=.7a2dcf93-b7b3-4488-92fb-ae734e5b0d95@github.com> <8yB2OpjTUTHG6E1Wb_Ry5F9HuMinHSY-yt-2eDSB3zw=.8878b365-2275-4227-a8fe-058c6cb7cebe@github.com> <9wc8yRtwoi-AmBI_kEQOJSwXnDRzlQaBVSOA8GpVO84=.9f60a36f-c26e-4e1f-b2cb-050780ed5ff6@github.com> Message-ID: On Mon, 4 Dec 2023 15:45:14 GMT, Aleksey Shipilev wrote: > I thought late barrier insertion basically avoids dealing with any optimizer questions, as we insert the barriers close to emit. So if we are to do late barrier insertion, we don't need the bulk of barrier optimization code? Maybe I misunderstand where late barrier insertion fits in C2. But the flag you're removing controls optimizations of the barrier code once expanded. For instance: 1) a' = barrier(a); b' = barrier(b); is expanded to: 2) if (heap_stable) { a' = a; } else { a' = slow_case(a); } if (heap_stable) { b' = b; } else { b' = slow_case(b); } which is then optimized under the `ShenandoahLoopOptsAfterExpansion` flag to: 3) if (heap_stable) { a' = a; b' = b; } else { a' = slow_case(a); b' = slow_case(b); } Late insertion gives 2) without 1) but it doesn't give you 3). Depending where insertion happens (I haven't looked at zgc code) you could still do 3) next or it would be too late. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16908#issuecomment-1838935805 From iwalulya at openjdk.org Tue Dec 5 10:17:36 2023 From: iwalulya at openjdk.org (Ivan Walulya) Date: Tue, 5 Dec 2023 10:17:36 GMT Subject: RFR: 8317809: Insertion of free code blobs into code cache can be very slow during class unloading [v6] In-Reply-To: References: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com> Message-ID: <6SIvBc11xioUfOBp7ywrO5T1wbE3yvwo6kCLXfYVz6c=.2284c684-38d3-436e-9bf0-34a662a5a73f@github.com> On Mon, 4 Dec 2023 12:39:59 GMT, Thomas Schatzl wrote: >> Insert code blobs in a sorted fashion to exploit the finger-optimization when adding, making this procedure O(n) instead of O(n^2) >> >> Introduces a globally available ClassUnloadingContext that contains common methods pertaining to class and code unloading. GCs may use it to efficiently manage unlinked class loader datas and nmethods to allow use of common methods (unlink/merge). >> >> The steps typically are registering a new to be unlinked CLD/nmethod, and then purge its memory later. STW collectors perform this work in one big chunk taking the CodeCache_lock, for the entire duration, while concurrent collectors lock/unlock for every insertion to allow for concurrent users for the lock to progress. >> >> Some care has been taken to stay consistent with an "unloading = unlinking + purge" scheme; however particularly the existing CLD handling API (still) mixes unlinking and purging in its CLD::unload() call. To simplify this change that is mostly geared towards separating nmethod unlinking from purging, to make code blob freeing O(n) instead of O(n^2). >> >> Upcoming changes will >> * separate nmethod unregistering from nmethod purging to allow doing that in bulk (for the STW collectors); that can significantly reduce code purging time for the STW collectors. >> * better name the second stage of unlinking (called "cleaning" throughout, e.g. the work done in `G1CollectedHeap::complete_cleaning`) >> * untangle CLD unlinking and what's called "cleaning" now to allow moving more stuff into the second unlinking stage for better parallelism >> * G1: move some significant tasks from the remark pause to concurrent (unregistering nmethods, freeing code blobs and cld/metaspace purging) >> * Maybe move Serial/Parallel GC metaspace purging closer to other unlinking/purging code to keep things local and allow easier logging. >> >> Please also first looking into the (small) PR this depends on. >> >> The crash on linux-x86 is fixed by PR#16766 which I split out for quicker reviews. >> >> Testing: tier1-7 >> >> Thanks, >> Thomas > > Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: > > ayang review: move class unloading outside of weak_refs_work Marked as reviewed by iwalulya (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/16759#pullrequestreview-1764645335 From tschatzl at openjdk.org Tue Dec 5 10:40:57 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 5 Dec 2023 10:40:57 GMT Subject: RFR: 8317809: Insertion of free code blobs into code cache can be very slow during class unloading [v6] In-Reply-To: References: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com>

Message-ID: <3gKHRtjSjtbzvd3oXVu6w5axjWfSg9CDLVO53OipxTM=.373a814d-fc5c-41e4-86af-b8f133b6e255@github.com> On Mon, 4 Dec 2023 12:57:57 GMT, Albert Mingkun Yang wrote: >> Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: >> >> ayang review: move class unloading outside of weak_refs_work > > Only one minor & subjective comment. Thanks @albertnetymk @walulyai for your reviews ------------- PR Comment: https://git.openjdk.org/jdk/pull/16759#issuecomment-1840480439 From tschatzl at openjdk.org Tue Dec 5 10:40:59 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 5 Dec 2023 10:40:59 GMT Subject: Integrated: 8317809: Insertion of free code blobs into code cache can be very slow during class unloading In-Reply-To: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com> References: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com> Message-ID: On Tue, 21 Nov 2023 11:03:12 GMT, Thomas Schatzl wrote: > Insert code blobs in a sorted fashion to exploit the finger-optimization when adding, making this procedure O(n) instead of O(n^2) > > Introduces a globally available ClassUnloadingContext that contains common methods pertaining to class and code unloading. GCs may use it to efficiently manage unlinked class loader datas and nmethods to allow use of common methods (unlink/merge). > > The steps typically are registering a new to be unlinked CLD/nmethod, and then purge its memory later. STW collectors perform this work in one big chunk taking the CodeCache_lock, for the entire duration, while concurrent collectors lock/unlock for every insertion to allow for concurrent users for the lock to progress. > > Some care has been taken to stay consistent with an "unloading = unlinking + purge" scheme; however particularly the existing CLD handling API (still) mixes unlinking and purging in its CLD::unload() call. To simplify this change that is mostly geared towards separating nmethod unlinking from purging, to make code blob freeing O(n) instead of O(n^2). > > Upcoming changes will > * separate nmethod unregistering from nmethod purging to allow doing that in bulk (for the STW collectors); that can significantly reduce code purging time for the STW collectors. > * better name the second stage of unlinking (called "cleaning" throughout, e.g. the work done in `G1CollectedHeap::complete_cleaning`) > * untangle CLD unlinking and what's called "cleaning" now to allow moving more stuff into the second unlinking stage for better parallelism > * G1: move some significant tasks from the remark pause to concurrent (unregistering nmethods, freeing code blobs and cld/metaspace purging) > * Maybe move Serial/Parallel GC metaspace purging closer to other unlinking/purging code to keep things local and allow easier logging. > > Please also first looking into the (small) PR this depends on. > > The crash on linux-x86 is fixed by PR#16766 which I split out for quicker reviews. > > Testing: tier1-7 > > Thanks, > Thomas This pull request has now been integrated. Changeset: 30817b74 Author: Thomas Schatzl URL: https://git.openjdk.org/jdk/commit/30817b742300f10f566e6aee3a8c1f8af4ab3083 Stats: 483 lines in 28 files changed: 352 ins; 87 del; 44 mod 8317809: Insertion of free code blobs into code cache can be very slow during class unloading Reviewed-by: iwalulya, ayang ------------- PR: https://git.openjdk.org/jdk/pull/16759 From kdnilsen at openjdk.org Tue Dec 5 14:46:24 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 5 Dec 2023 14:46:24 GMT Subject: RFR: 8321401: GenShen: Each mutator must see FullGC before throwing OOM Message-ID: Require each thread to observe unproductive Full GC before it throws OOM exception. ------------- Commit messages: - Fix test and remove conditionally compiled redundant code - Require full gc before throwing out-of-memory Changes: https://git.openjdk.org/shenandoah/pull/365/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=365&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8321401 Stats: 15 lines in 2 files changed: 2 ins; 12 del; 1 mod Patch: https://git.openjdk.org/shenandoah/pull/365.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/365/head:pull/365 PR: https://git.openjdk.org/shenandoah/pull/365 From kdnilsen at openjdk.org Tue Dec 5 14:51:34 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 5 Dec 2023 14:51:34 GMT Subject: RFR: 8321401: GenShen: Each mutator must see FullGC before throwing OOM [v2] In-Reply-To: References: Message-ID: > Require each thread to observe unproductive Full GC before it throws OOM exception. Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: Fixup comment ------------- Changes: - all: https://git.openjdk.org/shenandoah/pull/365/files - new: https://git.openjdk.org/shenandoah/pull/365/files/92b814f3..6d6f16db Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah&pr=365&range=01 - incr: https://webrevs.openjdk.org/?repo=shenandoah&pr=365&range=00-01 Stats: 3 lines in 1 file changed: 2 ins; 0 del; 1 mod Patch: https://git.openjdk.org/shenandoah/pull/365.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/365/head:pull/365 PR: https://git.openjdk.org/shenandoah/pull/365 From coleenp at openjdk.org Tue Dec 5 15:50:59 2023 From: coleenp at openjdk.org (Coleen Phillimore) Date: Tue, 5 Dec 2023 15:50:59 GMT Subject: RFR: 8317809: Insertion of free code blobs into code cache can be very slow during class unloading [v6] In-Reply-To: References: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com> Message-ID: On Mon, 4 Dec 2023 12:39:59 GMT, Thomas Schatzl wrote: >> Insert code blobs in a sorted fashion to exploit the finger-optimization when adding, making this procedure O(n) instead of O(n^2) >> >> Introduces a globally available ClassUnloadingContext that contains common methods pertaining to class and code unloading. GCs may use it to efficiently manage unlinked class loader datas and nmethods to allow use of common methods (unlink/merge). >> >> The steps typically are registering a new to be unlinked CLD/nmethod, and then purge its memory later. STW collectors perform this work in one big chunk taking the CodeCache_lock, for the entire duration, while concurrent collectors lock/unlock for every insertion to allow for concurrent users for the lock to progress. >> >> Some care has been taken to stay consistent with an "unloading = unlinking + purge" scheme; however particularly the existing CLD handling API (still) mixes unlinking and purging in its CLD::unload() call. To simplify this change that is mostly geared towards separating nmethod unlinking from purging, to make code blob freeing O(n) instead of O(n^2). >> >> Upcoming changes will >> * separate nmethod unregistering from nmethod purging to allow doing that in bulk (for the STW collectors); that can significantly reduce code purging time for the STW collectors. >> * better name the second stage of unlinking (called "cleaning" throughout, e.g. the work done in `G1CollectedHeap::complete_cleaning`) >> * untangle CLD unlinking and what's called "cleaning" now to allow moving more stuff into the second unlinking stage for better parallelism >> * G1: move some significant tasks from the remark pause to concurrent (unregistering nmethods, freeing code blobs and cld/metaspace purging) >> * Maybe move Serial/Parallel GC metaspace purging closer to other unlinking/purging code to keep things local and allow easier logging. >> >> Please also first looking into the (small) PR this depends on. >> >> The crash on linux-x86 is fixed by PR#16766 which I split out for quicker reviews. >> >> Testing: tier1-7 >> >> Thanks, >> Thomas > > Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: > > ayang review: move class unloading outside of weak_refs_work src/hotspot/share/gc/shared/classUnloadingContext.cpp line 91: > 89: cld->classes_do(f); > 90: } > 91: } I don't understand why CLDG specific methods were moved here. They should be unaware of nmethod purging. and these 4 methods don't have any nmethod purging in them either and are specific to the CLDG implementation. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16759#discussion_r1415844874 From cslucas at openjdk.org Tue Dec 5 16:57:59 2023 From: cslucas at openjdk.org (Cesar Soares Lucas) Date: Tue, 5 Dec 2023 16:57:59 GMT Subject: RFR: JDK-8241503: C2: Share MacroAssembler between mach nodes during code emission [v4] In-Reply-To: References:

Message-ID: On Fri, 1 Dec 2023 03:07:33 GMT, Hao Sun wrote: >> Cesar Soares Lucas has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains five commits: >> >> - Fix merge >> - Catch up with master branch. >> >> Merge remote-tracking branch 'origin/master' into reuse-macroasm >> - Some inst_mark fixes; Catch up with master. >> - Catch up with changes on master >> - Reuse same C2_MacroAssembler object to emit instructions. > > src/hotspot/share/jvmci/jvmciCodeInstaller.cpp line 1246: > >> 1244: if (_next_call_type == INVOKESTATIC || _next_call_type == INVOKESPECIAL) { >> 1245: // Need a static call stub for transitions from compiled to interpreted. >> 1246: C2_MacroAssembler masm(&buffer); > > Hi, I encountered one build failure: JDK build **without C2** fails on Linux/AArch64. > > The configure I used > > > --with-debug-level=release --with-jvm-features=-compiler2 --disable-precompiled-headers > > > The error log > > > === Output from failing command(s) repeated here === > * For target hotspot_variant-server_libjvm_gtest_objs_BUILD_GTEST_LIBJVM_link: > /usr/bin/ld: /tmp/build-release/hotspot/variant-server/libjvm/objs/jvmciCodeInstaller.o: in function `C2_MacroAssembler::C2_MacroAssembler(CodeBuffer*)': > make/hotspot/src/hotspot/share/opto/c2_MacroAssembler.hpp:38: undefined reference to `vtable for C2_MacroAssembler' > /usr/bin/ld: make/hotspot/src/hotspot/share/opto/c2_MacroAssembler.hpp:38: undefined reference to `vtable for C2_MacroAssembler' > collect2: error: ld returned 1 exit status > * For target hotspot_variant-server_libjvm_objs_BUILD_LIBJVM_link: > /usr/bin/ld: /tmp/build-release/hotspot/variant-server/libjvm/objs/jvmciCodeInstaller.o: in function `C2_MacroAssembler::C2_MacroAssembler(CodeBuffer*)': > make/hotspot/src/hotspot/share/opto/c2_MacroAssembler.hpp:38: undefined reference to `vtable for C2_MacroAssembler' > /usr/bin/ld: make/hotspot/src/hotspot/share/opto/c2_MacroAssembler.hpp:38: undefined reference to `vtable for C2_MacroAssembler' > collect2: error: ld returned 1 exit status > > * All command lines available in /tmp/build-release/make-support/failure-logs. > === End of repeated output === > > > I suggest making the following change: > > Suggestion: > > MacroAssembler masm(&buffer); Thank you for catch @shqking ! ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16484#discussion_r1415982687 From cslucas at openjdk.org Tue Dec 5 16:57:55 2023 From: cslucas at openjdk.org (Cesar Soares Lucas) Date: Tue, 5 Dec 2023 16:57:55 GMT Subject: RFR: JDK-8241503: C2: Share MacroAssembler between mach nodes during code emission [v5] In-Reply-To: References: Message-ID: > # Description > > Please review this PR with a patch to re-use the same C2_MacroAssembler object to emit all instructions in the same compilation unit. > > Overall, the change is pretty simple. However, due to the renaming of the variable to access C2_MacroAssembler, from `_masm.` to `masm->`, and also some method prototype changes, the patch became quite large. > > # Help Needed for Testing > > I don't have access to all platforms necessary to test this. I hope some other folks can help with testing on `S390`, `RISC-V` and `PPC`. > > # Testing status > > ## tier1 > > | | Win | Mac | Linux | > |----------|---------|---------|---------| > | ARM64 | | | | > | ARM32 | | | | > | x86 | | | | > | x64 | | | | > | PPC64 | | | | > | S390x | | | | > | RiscV | | | | Cesar Soares Lucas has updated the pull request incrementally with one additional commit since the last revision: Fix build, copyright dates, m4 files. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/16484/files - new: https://git.openjdk.org/jdk/pull/16484/files/afe48fe7..d950806f Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=16484&range=04 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=16484&range=03-04 Stats: 18 lines in 12 files changed: 0 ins; 3 del; 15 mod Patch: https://git.openjdk.org/jdk/pull/16484.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16484/head:pull/16484 PR: https://git.openjdk.org/jdk/pull/16484 From tschatzl at openjdk.org Tue Dec 5 18:49:52 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 5 Dec 2023 18:49:52 GMT Subject: RFR: 8317809: Insertion of free code blobs into code cache can be very slow during class unloading [v6] In-Reply-To: References: <_dcFF70_w7IXSjb6w-HuHCBkPyS3a6NlzejtqdfdYnM=.74e0f9df-eca3-49b0-be68-2d5824c16003@github.com>

Message-ID: On Tue, 5 Dec 2023 15:47:57 GMT, Coleen Phillimore wrote: >> Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: >> >> ayang review: move class unloading outside of weak_refs_work > > src/hotspot/share/gc/shared/classUnloadingContext.cpp line 91: > >> 89: cld->classes_do(f); >> 90: } >> 91: } > > I don't understand why CLDG specific methods were moved here. They should be unaware of nmethod purging. and these 4 methods don't have any nmethod purging in them either and are specific to the CLDG implementation. The idea is to have GC take control how the unloading CLDs are stored/which data structure it is going to use to manage them to ultimately allow more control about class unloading for parallelization. Which on the one hand makes pauses shorter (for stw collectors), and on the other hand decreases the time the CLDG_lock is held (not sure it is nice that the concurrent collectors currently may hold that one for ~100ms in my test...). I believe having the linked list of unloading CLDs embedded in the CLDs for use by the GC not only seems wrong (i.e. it's a GC data structure located in runtime code) but is also very limiting (need to have one for all, fixed singly linked list). This change moves knowledge of how unloading CLDs are managed to GC area - runtime code just tells GC that a particular CLD is unloading. (Currently the `ClassUnloadingContext` also calls the `unload` method during registration to keep current functionality, but the plan is to separate the step of registration and actual unloading to allow custom handling of the second part; the registering, although it's still walking a singly linked list, is comparatively fast). These four methods provide a thin abstraction over the CLDs that are unloading (that runtime doesn't need and should not worry about imo). With that in place it is possible to slice the actual unloading work into phases according to dependencies (depending on GC if desired), potentially overlapping with other existing phases in collectors already allowing that (e.g. the parallel code unloading, but that is only an implementation detail to reduce overall parallel phases), or even moving some of that work sometime else (the `CLD::unload()` method unfortunately currently may do some memory freeing too). However most time is spent in notifying various components which can be parallelized (at least parallelize the different types of notifications). ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16759#discussion_r1416139587 From shade at openjdk.org Tue Dec 5 18:51:36 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Tue, 5 Dec 2023 18:51:36 GMT Subject: RFR: 8321122: Shenandoah: Remove ShenandoahLoopOptsAfterExpansion flag In-Reply-To: References: <_PWNVaUyOIoUREKSS3D3ru2-GnCSzpZDoMdaYiwpIIY=.7a2dcf93-b7b3-4488-92fb-ae734e5b0d95@github.com> <8yB2OpjTUTHG6E1Wb_Ry5F9HuMinHSY-yt-2eDSB3zw=.8878b365-2275-4227-a8fe-058c6cb7cebe@github.com> <9wc8yRtwoi-AmBI_kEQOJSwXnDRzlQaBVSOA8GpVO84=.9f60a36f-c26e-4e1f-b2cb-050780ed5ff6@github.com> Message-ID: On Mon, 4 Dec 2023 15:52:53 GMT, Roland Westrelin wrote: > > I thought late barrier insertion basically avoids dealing with any optimizer questions, as we insert the barriers close to emit. So if we are to do late barrier insertion, we don't need the bulk of barrier optimization code? Maybe I misunderstand where late barrier insertion fits in C2. > > But the flag you're removing controls optimizations of the barrier code once expanded. Ah, yes. The late insertion relies on assumption that the barrier is somehow lightweight to not require post-insertion optimization. That might not be true for Shenandoah (and G1) barriers. Anyway, would you like to keep the flag in, or should we integrate this PR? ------------- PR Comment: https://git.openjdk.org/jdk/pull/16908#issuecomment-1841419106 From shade at openjdk.org Tue Dec 5 19:53:44 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Tue, 5 Dec 2023 19:53:44 GMT Subject: RFR: 8321410: Shenandoah: Remove ShenandoahSuspendibleWorkers flag Message-ID: We have flipped `ShenandoahSuspendibleWorkers` to true in [JDK-8305403](https://bugs.openjdk.org/browse/JDK-8305403), and backported it back to 17.0.8. We seem to be in consensus that turning this flag off would likely to cause correctness issues: https://github.com/openjdk/jdk/pull/13309#discussion_r1160934484 So we might as well remove this flag altogether. Additional testing: - [ ] `hotspot_gc_shenandoah` ------------- Commit messages: - Fix Changes: https://git.openjdk.org/jdk/pull/16984/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16984&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8321410 Stats: 40 lines in 6 files changed: 0 ins; 33 del; 7 mod Patch: https://git.openjdk.org/jdk/pull/16984.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16984/head:pull/16984 PR: https://git.openjdk.org/jdk/pull/16984 From kdnilsen at openjdk.org Tue Dec 5 20:36:41 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 5 Dec 2023 20:36:41 GMT Subject: RFR: 8321401: Shenandoah: Each mutator must see FullGC before throwing OOM Message-ID: Require each thread to observe unproductive Full GC before it throws OOM exception. ------------- Commit messages: - Require each thread to see full GC before throwing OOM Changes: https://git.openjdk.org/jdk/pull/16985/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16985&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8321401 Stats: 12 lines in 1 file changed: 0 ins; 12 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/16985.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16985/head:pull/16985 PR: https://git.openjdk.org/jdk/pull/16985 From ysr at openjdk.org Tue Dec 5 23:30:35 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Tue, 5 Dec 2023 23:30:35 GMT Subject: RFR: 8321401: Shenandoah: Each mutator must see FullGC before throwing OOM In-Reply-To: References: Message-ID: On Tue, 5 Dec 2023 20:30:54 GMT, Kelvin Nilsen wrote: > Require each thread to observe unproductive Full GC before it throws OOM exception. src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 885: > 883: // This will notify the collector to start a cycle, but will raise > 884: // an OOME to the mutator if the last Full GCs have not made progress. > 885: if (result == nullptr && !req.is_lab_alloc() && get_gc_no_progress_count() > ShenandoahNoProgressThreshold) { Does this render `ShenandoahNoProgressThreshold` a dead option? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16985#discussion_r1416398792 From wkemper at openjdk.org Tue Dec 5 23:40:36 2023 From: wkemper at openjdk.org (William Kemper) Date: Tue, 5 Dec 2023 23:40:36 GMT Subject: RFR: 8321401: Shenandoah: Each mutator must see FullGC before throwing OOM In-Reply-To: References: Message-ID: On Tue, 5 Dec 2023 20:30:54 GMT, Kelvin Nilsen wrote: > Require each thread to observe unproductive Full GC before it throws OOM exception. src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 885: > 883: // This will notify the collector to start a cycle, but will raise > 884: // an OOME to the mutator if the last Full GCs have not made progress. > 885: if (result == nullptr && !req.is_lab_alloc() && get_gc_no_progress_count() > ShenandoahNoProgressThreshold) { We need to be careful here. This essentially backs out the fix for: * https://bugs.openjdk.org/browse/JDK-8316632 * https://bugs.openjdk.org/browse/JDK-8314935 For the original change, I ran tier1, tier2 and tier3 tests with Shenandoah enabled on x86 and aarch64, per @shipilev 's request (https://github.com/openjdk/jdk/pull/15852#issuecomment-1729099298). make test TEST="tier1 tier2 tier3" TEST_VM_OPTS="-XX:+UseShenandoahGC" CONF=fastdebug These tests, specifically, will likely timeout without this check: make test TEST="runtime/reflect/ReflectOutOfMemoryError.java gc/InfiniteList.java runtime/ClassInitErrors/TestOutOfMemoryDuringInit.java compiler/uncommontrap/TestDeoptOOM.java" TEST_VM_OPTS="-Xlog:gc*=info:file=/tmp/oome.log -XX:+UseShenandoahGC" JTREG="REPEAT_COUNT=50" CONF=fastdebug ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16985#discussion_r1416404947 From shade at openjdk.org Wed Dec 6 14:10:01 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 6 Dec 2023 14:10:01 GMT Subject: RFR: 8321410: Shenandoah: Remove ShenandoahSuspendibleWorkers flag [v2] In-Reply-To: References: Message-ID: > We have flipped `ShenandoahSuspendibleWorkers` to true in [JDK-8305403](https://bugs.openjdk.org/browse/JDK-8305403), and backported it back to 17.0.8. We seem to be in consensus that turning this flag off would likely to cause correctness issues: > https://github.com/openjdk/jdk/pull/13309#discussion_r1160934484 > > So we might as well remove this flag altogether. > > Additional testing: > - [x] `hotspot_gc_shenandoah` Aleksey Shipilev has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains two additional commits since the last revision: - Merge branch 'master' into JDK-8321410-shenandoah-suspendible-workers - Fix ------------- Changes: - all: https://git.openjdk.org/jdk/pull/16984/files - new: https://git.openjdk.org/jdk/pull/16984/files/0370357f..3e0a580e Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=16984&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=16984&range=00-01 Stats: 31526 lines in 319 files changed: 9148 ins; 21344 del; 1034 mod Patch: https://git.openjdk.org/jdk/pull/16984.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16984/head:pull/16984 PR: https://git.openjdk.org/jdk/pull/16984 From kdnilsen at openjdk.org Wed Dec 6 14:16:42 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 6 Dec 2023 14:16:42 GMT Subject: RFR: 8321401: Shenandoah: Each mutator must see FullGC before throwing OOM In-Reply-To: References:

Message-ID: On Tue, 5 Dec 2023 23:28:18 GMT, Y. Srinivas Ramakrishna wrote: >> Require each thread to observe unproductive Full GC before it throws OOM exception. > > src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 885: > >> 883: // This will notify the collector to start a cycle, but will raise >> 884: // an OOME to the mutator if the last Full GCs have not made progress. >> 885: if (result == nullptr && !req.is_lab_alloc() && get_gc_no_progress_count() > ShenandoahNoProgressThreshold) { > > Does this render `ShenandoahNoProgressThreshold` a dead option? Good point. It is no longer relevant. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16985#discussion_r1417391908 From shade at openjdk.org Wed Dec 6 14:16:49 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 6 Dec 2023 14:16:49 GMT Subject: RFR: 8320969: Shenandoah: Enforce stable number of GC workers [v2] In-Reply-To: <54bP7p1EQWWE8YXN94AlKVabtJUOFpw-wZ7fXOmxG_0=.7ad5f3d4-fe36-4ea3-8e38-0561020cc043@github.com> References: <54bP7p1EQWWE8YXN94AlKVabtJUOFpw-wZ7fXOmxG_0=.7ad5f3d4-fe36-4ea3-8e38-0561020cc043@github.com> Message-ID: <-BNpjrXyRpuAVDceEf1xu0jTJK1dhxBNw22HuXwaDjs=.1cbe5133-b6bb-4558-bb75-65091052b696@github.com> > See bug for rationale. > > Additional testing: > - [x] MacOS AArch64 server fastdebug, `hotspot_gc_shenandoah` > - [x] Linux x86_64 server fastdebug, `tier{1,2,3}` with `-XX:+UseShenandoahGC` > - [x] Linux AArch64 server fastdebug, `tier{1,2,3}` with `-XX:+UseShenandoahGC` Aleksey Shipilev has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision: - Merge branch 'master' into JDK-8320969-shenandoah-stable-gc-workers - Reinstate the .cpp policy code - Merge branch 'master' into JDK-8320969-shenandoah-stable-gc-workers - No dynamic GC workers ------------- Changes: - all: https://git.openjdk.org/jdk/pull/16878/files - new: https://git.openjdk.org/jdk/pull/16878/files/56d597be..e5826b15 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=16878&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=16878&range=00-01 Stats: 85136 lines in 1765 files changed: 39360 ins; 39156 del; 6620 mod Patch: https://git.openjdk.org/jdk/pull/16878.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16878/head:pull/16878 PR: https://git.openjdk.org/jdk/pull/16878 From kdnilsen at openjdk.org Wed Dec 6 14:16:50 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 6 Dec 2023 14:16:50 GMT Subject: RFR: 8320969: Shenandoah: Enforce stable number of GC workers [v2] In-Reply-To: <-BNpjrXyRpuAVDceEf1xu0jTJK1dhxBNw22HuXwaDjs=.1cbe5133-b6bb-4558-bb75-65091052b696@github.com> References: <54bP7p1EQWWE8YXN94AlKVabtJUOFpw-wZ7fXOmxG_0=.7ad5f3d4-fe36-4ea3-8e38-0561020cc043@github.com> <-BNpjrXyRpuAVDceEf1xu0jTJK1dhxBNw22HuXwaDjs=.1cbe5133-b6bb-4558-bb75-65091052b696@github.com> Message-ID: <9uUyltcRLs6culCIRl6du_O3IY56egv6EsjO-RZsUPQ=.132127b3-3b47-4d46-b131-e6997a7be2ba@github.com> On Wed, 6 Dec 2023 14:13:46 GMT, Aleksey Shipilev wrote: >> See bug for rationale. >> >> Additional testing: >> - [x] MacOS AArch64 server fastdebug, `hotspot_gc_shenandoah` >> - [x] Linux x86_64 server fastdebug, `tier{1,2,3}` with `-XX:+UseShenandoahGC` >> - [x] Linux AArch64 server fastdebug, `tier{1,2,3}` with `-XX:+UseShenandoahGC` > > Aleksey Shipilev has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision: > > - Merge branch 'master' into JDK-8320969-shenandoah-stable-gc-workers > - Reinstate the .cpp policy code > - Merge branch 'master' into JDK-8320969-shenandoah-stable-gc-workers > - No dynamic GC workers This does conflict with my changes, but I'd be glad to rebase my code on this foundation. This is simpler and more sensible. ------------- Marked as reviewed by kdnilsen (no project role). PR Review: https://git.openjdk.org/jdk/pull/16878#pullrequestreview-1757886141 From shade at openjdk.org Wed Dec 6 14:16:52 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 6 Dec 2023 14:16:52 GMT Subject: RFR: 8320969: Shenandoah: Enforce stable number of GC workers In-Reply-To: <54bP7p1EQWWE8YXN94AlKVabtJUOFpw-wZ7fXOmxG_0=.7ad5f3d4-fe36-4ea3-8e38-0561020cc043@github.com> References: <54bP7p1EQWWE8YXN94AlKVabtJUOFpw-wZ7fXOmxG_0=.7ad5f3d4-fe36-4ea3-8e38-0561020cc043@github.com> Message-ID: On Wed, 29 Nov 2023 11:01:32 GMT, Aleksey Shipilev wrote: > See bug for rationale. > > Additional testing: > - [x] MacOS AArch64 server fastdebug, `hotspot_gc_shenandoah` > - [x] Linux x86_64 server fastdebug, `tier{1,2,3}` with `-XX:+UseShenandoahGC` > - [x] Linux AArch64 server fastdebug, `tier{1,2,3}` with `-XX:+UseShenandoahGC` Right, that makes sense. I'll rework the patch a bit to still allow _some_ policy adjustments without relying on shared policy. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16878#issuecomment-1832404059 From kdnilsen at openjdk.org Wed Dec 6 15:36:52 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 6 Dec 2023 15:36:52 GMT Subject: RFR: 8321401: GenShen: Each mutator must see FullGC before throwing OOM [v3] In-Reply-To: References: Message-ID: > Require each thread to observe unproductive Full GC before it throws OOM exception. Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: Remove ShenandoahNoProgressThreshold With this change, this command-line option has no effect. ------------- Changes: - all: https://git.openjdk.org/shenandoah/pull/365/files - new: https://git.openjdk.org/shenandoah/pull/365/files/6d6f16db..bbff38a3 Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah&pr=365&range=02 - incr: https://webrevs.openjdk.org/?repo=shenandoah&pr=365&range=01-02 Stats: 5 lines in 1 file changed: 0 ins; 5 del; 0 mod Patch: https://git.openjdk.org/shenandoah/pull/365.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/365/head:pull/365 PR: https://git.openjdk.org/shenandoah/pull/365 From kdnilsen at openjdk.org Wed Dec 6 15:39:41 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 6 Dec 2023 15:39:41 GMT Subject: RFR: 8321401: Shenandoah: Each mutator must see FullGC before throwing OOM In-Reply-To: References:

Message-ID: On Wed, 6 Dec 2023 14:13:35 GMT, Kelvin Nilsen wrote: >> src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 885: >> >>> 883: // This will notify the collector to start a cycle, but will raise >>> 884: // an OOME to the mutator if the last Full GCs have not made progress. >>> 885: if (result == nullptr && !req.is_lab_alloc() && get_gc_no_progress_count() > ShenandoahNoProgressThreshold) { >> >> Does this render `ShenandoahNoProgressThreshold` a dead option? > > Good point. It is no longer relevant. I'll remove that option for coherency in this PR. Still need to resolve the question of too-much-time for OOM, mentioned below. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16985#discussion_r1417523732 From kdnilsen at openjdk.org Wed Dec 6 15:39:43 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 6 Dec 2023 15:39:43 GMT Subject: RFR: 8321401: Shenandoah: Each mutator must see FullGC before throwing OOM In-Reply-To: References:

Message-ID: On Tue, 5 Dec 2023 23:38:14 GMT, William Kemper wrote: >> Require each thread to observe unproductive Full GC before it throws OOM exception. > > src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 885: > >> 883: // This will notify the collector to start a cycle, but will raise >> 884: // an OOME to the mutator if the last Full GCs have not made progress. >> 885: if (result == nullptr && !req.is_lab_alloc() && get_gc_no_progress_count() > ShenandoahNoProgressThreshold) { > > We need to be careful here. This essentially backs out the fix for: > * https://bugs.openjdk.org/browse/JDK-8316632 > * https://bugs.openjdk.org/browse/JDK-8314935 > > For the original change, I ran tier1, tier2 and tier3 tests with Shenandoah enabled on x86 and aarch64, per @shipilev 's request (https://github.com/openjdk/jdk/pull/15852#issuecomment-1729099298). > > make test TEST="tier1 tier2 tier3" TEST_VM_OPTS="-XX:+UseShenandoahGC" CONF=fastdebug > > These tests, specifically, will likely timeout without this check: > > make test TEST="runtime/reflect/ReflectOutOfMemoryError.java gc/InfiniteList.java runtime/ClassInitErrors/TestOutOfMemoryDuringInit.java compiler/uncommontrap/TestDeoptOOM.java" TEST_VM_OPTS="-Xlog:gc*=info:file=/tmp/oome.log -XX:+UseShenandoahGC" JTREG="REPEAT_COUNT=50" CONF=fastdebug I'll run these tests and report the results. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16985#discussion_r1417525665 From roland at openjdk.org Wed Dec 6 15:44:39 2023 From: roland at openjdk.org (Roland Westrelin) Date: Wed, 6 Dec 2023 15:44:39 GMT Subject: RFR: 8321122: Shenandoah: Remove ShenandoahLoopOptsAfterExpansion flag In-Reply-To: References: Message-ID: On Thu, 30 Nov 2023 16:00:17 GMT, Aleksey Shipilev wrote: > More flags cleanup. `ShenandoahLoopOptsAfterExpansion` was added long time ago to gate more aggressive barrier optimizations in C2. It was default since then. There seem to be no reason to keep it as the flag. Just checking, @rwestrel -- you don't need it anymore, right? > > Additional testing: > - [x] `hotspot_gc_shenandoah` Marked as reviewed by roland (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/16908#pullrequestreview-1767912880 From roland at openjdk.org Wed Dec 6 15:44:42 2023 From: roland at openjdk.org (Roland Westrelin) Date: Wed, 6 Dec 2023 15:44:42 GMT Subject: RFR: 8321122: Shenandoah: Remove ShenandoahLoopOptsAfterExpansion flag In-Reply-To: References: <_PWNVaUyOIoUREKSS3D3ru2-GnCSzpZDoMdaYiwpIIY=.7a2dcf93-b7b3-4488-92fb-ae734e5b0d95@github.com> <8yB2OpjTUTHG6E1Wb_Ry5F9HuMinHSY-yt-2eDSB3zw=.8878b365-2275-4227-a8fe-058c6cb7cebe@github.com> <9wc8yRtwoi-AmBI_kEQOJSwXnDRzlQaBVSOA8GpVO84=.9f60a36f-c26e-4e1f-b2cb-050780ed5ff6@github.com>

Message-ID: On Tue, 5 Dec 2023 18:49:17 GMT, Aleksey Shipilev wrote: > Anyway, would you like to keep the flag in, or should we integrate this PR? I'm fine with the PR. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16908#issuecomment-1843140749 From kdnilsen at openjdk.org Wed Dec 6 16:31:53 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 6 Dec 2023 16:31:53 GMT Subject: RFR: 8321401: Shenandoah: Each mutator must see FullGC before throwing OOM In-Reply-To: References:

Message-ID: On Wed, 6 Dec 2023 15:36:58 GMT, Kelvin Nilsen wrote: >> src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 885: >> >>> 883: // This will notify the collector to start a cycle, but will raise >>> 884: // an OOME to the mutator if the last Full GCs have not made progress. >>> 885: if (result == nullptr && !req.is_lab_alloc() && get_gc_no_progress_count() > ShenandoahNoProgressThreshold) { >> >> We need to be careful here. This essentially backs out the fix for: >> * https://bugs.openjdk.org/browse/JDK-8316632 >> * https://bugs.openjdk.org/browse/JDK-8314935 >> >> For the original change, I ran tier1, tier2 and tier3 tests with Shenandoah enabled on x86 and aarch64, per @shipilev 's request (https://github.com/openjdk/jdk/pull/15852#issuecomment-1729099298). >> >> make test TEST="tier1 tier2 tier3" TEST_VM_OPTS="-XX:+UseShenandoahGC" CONF=fastdebug >> >> These tests, specifically, will likely timeout without this check: >> >> make test TEST="runtime/reflect/ReflectOutOfMemoryError.java gc/InfiniteList.java runtime/ClassInitErrors/TestOutOfMemoryDuringInit.java compiler/uncommontrap/TestDeoptOOM.java" TEST_VM_OPTS="-Xlog:gc*=info:file=/tmp/oome.log -XX:+UseShenandoahGC" JTREG="REPEAT_COUNT=50" CONF=fastdebug > > I'll run these tests and report the results. For the four explicit tests, 50 runs each, all passed. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16985#discussion_r1417622364 From kdnilsen at openjdk.org Wed Dec 6 16:31:55 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 6 Dec 2023 16:31:55 GMT Subject: RFR: 8321401: Shenandoah: Each mutator must see FullGC before throwing OOM In-Reply-To: References:

Message-ID: <6oAqAKQLPZUZwNiHbR-lKFnbZHvvJZYkXMJWskEpvCg=.b1d6f83e-e9b8-4ca5-a289-70460078a206@github.com> On Wed, 6 Dec 2023 16:25:50 GMT, Kelvin Nilsen wrote: >> I'll run these tests and report the results. > > For the four explicit tests, 50 runs each, all passed. Am wondering if there might have been some other compounding issue, since resolved, that contributed to the original issue. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16985#discussion_r1417626100 From wkemper at openjdk.org Wed Dec 6 16:53:39 2023 From: wkemper at openjdk.org (William Kemper) Date: Wed, 6 Dec 2023 16:53:39 GMT Subject: RFR: 8321401: Shenandoah: Each mutator must see FullGC before throwing OOM In-Reply-To: <6oAqAKQLPZUZwNiHbR-lKFnbZHvvJZYkXMJWskEpvCg=.b1d6f83e-e9b8-4ca5-a289-70460078a206@github.com> References:

<6oAqAKQLPZUZwNiHbR-lKFnbZHvvJZYkXMJWskEpvCg=.b1d6f83e-e9b8-4ca5-a289-70460078a206@github.com> Message-ID: On Wed, 6 Dec 2023 16:28:28 GMT, Kelvin Nilsen wrote: >> For the four explicit tests, 50 runs each, all passed. > > Am wondering if there might have been some other compounding issue, since resolved, that contributed to the original issue. Okay, thanks for the additional testing. In this case, we should also remove the `ShenandoahNoProgressThreshold` flag. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/16985#discussion_r1417658766 From shade at openjdk.org Wed Dec 6 17:34:35 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 6 Dec 2023 17:34:35 GMT Subject: RFR: 8321122: Shenandoah: Remove ShenandoahLoopOptsAfterExpansion flag In-Reply-To: References: Message-ID: On Thu, 30 Nov 2023 16:00:17 GMT, Aleksey Shipilev wrote: > More flags cleanup. `ShenandoahLoopOptsAfterExpansion` was added long time ago to gate more aggressive barrier optimizations in C2. It was default since then. There seem to be no reason to keep it as the flag. Just checking, @rwestrel -- you don't need it anymore, right? > > Additional testing: > - [x] `hotspot_gc_shenandoah` Looking for blessing from @rkennke as well. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16908#issuecomment-1843350229 From shade at openjdk.org Wed Dec 6 17:35:37 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 6 Dec 2023 17:35:37 GMT Subject: RFR: 8321410: Shenandoah: Remove ShenandoahSuspendibleWorkers flag [v2] In-Reply-To: References:

Message-ID: <-19lIb_rpCq7aEXckFs-FPMZt5sElABtD5JAbWgRoS0=.bf8b824e-eec9-403d-9ffc-8274d56472cb@github.com> On Wed, 6 Dec 2023 14:10:01 GMT, Aleksey Shipilev wrote: >> We have flipped `ShenandoahSuspendibleWorkers` to true in [JDK-8305403](https://bugs.openjdk.org/browse/JDK-8305403), and backported it back to 17.0.8. We seem to be in consensus that turning this flag off would likely to cause correctness issues: >> https://github.com/openjdk/jdk/pull/13309#discussion_r1160934484 >> >> So we might as well remove this flag altogether. >> >> Additional testing: >> - [x] `hotspot_gc_shenandoah` > > Aleksey Shipilev has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains two additional commits since the last revision: > > - Merge branch 'master' into JDK-8321410-shenandoah-suspendible-workers > - Fix Yes it is a correctness issue, let's get rid of the flag. I believe that you could also replace ShenandoahSuspendibleThreadSet* with just regular SuspendibleThreadSet*. ------------- Marked as reviewed by rkennke (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16984#pullrequestreview-1768538629 From rkennke at openjdk.org Wed Dec 6 20:21:37 2023 From: rkennke at openjdk.org (Roman Kennke) Date: Wed, 6 Dec 2023 20:21:37 GMT Subject: RFR: 8321122: Shenandoah: Remove ShenandoahLoopOptsAfterExpansion flag In-Reply-To: References: Message-ID: On Thu, 30 Nov 2023 16:00:17 GMT, Aleksey Shipilev wrote: > More flags cleanup. `ShenandoahLoopOptsAfterExpansion` was added long time ago to gate more aggressive barrier optimizations in C2. It was default since then. There seem to be no reason to keep it as the flag. Just checking, @rwestrel -- you don't need it anymore, right? > > Additional testing: > - [x] `hotspot_gc_shenandoah` Seems ok to remove the flag, but not sure if the optimization pass is still useful at all? It might be a left-over from pre-LRB and not do anything useful? (I don't know that, I haven't checked, but it could be) ------------- Marked as reviewed by rkennke (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16908#pullrequestreview-1768547290 From ysr at openjdk.org Wed Dec 6 20:24:37 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Wed, 6 Dec 2023 20:24:37 GMT Subject: RFR: 8320969: Shenandoah: Enforce stable number of GC workers [v2] In-Reply-To: <-BNpjrXyRpuAVDceEf1xu0jTJK1dhxBNw22HuXwaDjs=.1cbe5133-b6bb-4558-bb75-65091052b696@github.com> References: <54bP7p1EQWWE8YXN94AlKVabtJUOFpw-wZ7fXOmxG_0=.7ad5f3d4-fe36-4ea3-8e38-0561020cc043@github.com> <-BNpjrXyRpuAVDceEf1xu0jTJK1dhxBNw22HuXwaDjs=.1cbe5133-b6bb-4558-bb75-65091052b696@github.com> Message-ID: On Wed, 6 Dec 2023 14:16:49 GMT, Aleksey Shipilev wrote: >> See bug for rationale. >> >> Additional testing: >> - [x] MacOS AArch64 server fastdebug, `hotspot_gc_shenandoah` >> - [x] Linux x86_64 server fastdebug, `tier{1,2,3}` with `-XX:+UseShenandoahGC` >> - [x] Linux AArch64 server fastdebug, `tier{1,2,3}` with `-XX:+UseShenandoahGC` > > Aleksey Shipilev has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision: > > - Merge branch 'master' into JDK-8320969-shenandoah-stable-gc-workers > - Reinstate the .cpp policy code > - Merge branch 'master' into JDK-8320969-shenandoah-stable-gc-workers > - No dynamic GC workers Reviewed! ------------- Marked as reviewed by ysr (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16878#pullrequestreview-1768551188 From shade at openjdk.org Wed Dec 6 21:19:44 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 6 Dec 2023 21:19:44 GMT Subject: RFR: 8321410: Shenandoah: Remove ShenandoahSuspendibleWorkers flag [v2] In-Reply-To: References:

Message-ID: On Wed, 6 Dec 2023 14:10:01 GMT, Aleksey Shipilev wrote: >> We have flipped `ShenandoahSuspendibleWorkers` to true in [JDK-8305403](https://bugs.openjdk.org/browse/JDK-8305403), and backported it back to 17.0.8. We seem to be in consensus that turning this flag off would likely to cause correctness issues: >> https://github.com/openjdk/jdk/pull/13309#discussion_r1160934484 >> >> So we might as well remove this flag altogether. >> >> Additional testing: >> - [x] `hotspot_gc_shenandoah` > > Aleksey Shipilev has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains two additional commits since the last revision: > > - Merge branch 'master' into JDK-8321410-shenandoah-suspendible-workers > - Fix Thanks! ------------- PR Comment: https://git.openjdk.org/jdk/pull/16984#issuecomment-1843704071 From shade at openjdk.org Wed Dec 6 21:19:46 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 6 Dec 2023 21:19:46 GMT Subject: Integrated: 8321410: Shenandoah: Remove ShenandoahSuspendibleWorkers flag In-Reply-To: References: Message-ID: <_CIa-YLyPVjpixKvO-R22TYFDUGaoZgyxGu4zHd4wrk=.829c59f7-afea-4849-b6e3-9d9ea965f486@github.com> On Tue, 5 Dec 2023 19:47:31 GMT, Aleksey Shipilev wrote: > We have flipped `ShenandoahSuspendibleWorkers` to true in [JDK-8305403](https://bugs.openjdk.org/browse/JDK-8305403), and backported it back to 17.0.8. We seem to be in consensus that turning this flag off would likely to cause correctness issues: > https://github.com/openjdk/jdk/pull/13309#discussion_r1160934484 > > So we might as well remove this flag altogether. > > Additional testing: > - [x] `hotspot_gc_shenandoah` This pull request has now been integrated. Changeset: 2830dd2a Author: Aleksey Shipilev URL: https://git.openjdk.org/jdk/commit/2830dd2a7d3b933fbddca64ca0ac7a91e7ab0775 Stats: 40 lines in 6 files changed: 0 ins; 33 del; 7 mod 8321410: Shenandoah: Remove ShenandoahSuspendibleWorkers flag Reviewed-by: rkennke ------------- PR: https://git.openjdk.org/jdk/pull/16984 From shade at openjdk.org Wed Dec 6 21:22:43 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 6 Dec 2023 21:22:43 GMT Subject: RFR: 8321122: Shenandoah: Remove ShenandoahLoopOptsAfterExpansion flag In-Reply-To: References: Message-ID: <0sM-y3lB6TW6iAhaIOJbA1zQGRPjNvjeKe_0cu29fw0=.27ee70a8-3fa7-4d10-bb11-0c401ee645c3@github.com> On Thu, 30 Nov 2023 16:00:17 GMT, Aleksey Shipilev wrote: > More flags cleanup. `ShenandoahLoopOptsAfterExpansion` was added long time ago to gate more aggressive barrier optimizations in C2. It was default since then. There seem to be no reason to keep it as the flag. Just checking, @rwestrel -- you don't need it anymore, right? > > Additional testing: > - [x] `hotspot_gc_shenandoah` Thanks! ------------- PR Comment: https://git.openjdk.org/jdk/pull/16908#issuecomment-1843705358 From shade at openjdk.org Wed Dec 6 21:22:44 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 6 Dec 2023 21:22:44 GMT Subject: Integrated: 8321122: Shenandoah: Remove ShenandoahLoopOptsAfterExpansion flag In-Reply-To: References: Message-ID: On Thu, 30 Nov 2023 16:00:17 GMT, Aleksey Shipilev wrote: > More flags cleanup. `ShenandoahLoopOptsAfterExpansion` was added long time ago to gate more aggressive barrier optimizations in C2. It was default since then. There seem to be no reason to keep it as the flag. Just checking, @rwestrel -- you don't need it anymore, right? > > Additional testing: > - [x] `hotspot_gc_shenandoah` This pull request has now been integrated. Changeset: b02fc868 Author: Aleksey Shipilev URL: https://git.openjdk.org/jdk/commit/b02fc8681e57d75b5e05ce98216c53ff4d1e3b5d Stats: 18 lines in 2 files changed: 5 ins; 11 del; 2 mod 8321122: Shenandoah: Remove ShenandoahLoopOptsAfterExpansion flag Reviewed-by: kdnilsen, roland, rkennke ------------- PR: https://git.openjdk.org/jdk/pull/16908 From shade at openjdk.org Thu Dec 7 09:36:44 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Thu, 7 Dec 2023 09:36:44 GMT Subject: RFR: 8320969: Shenandoah: Enforce stable number of GC workers [v2] In-Reply-To: <-BNpjrXyRpuAVDceEf1xu0jTJK1dhxBNw22HuXwaDjs=.1cbe5133-b6bb-4558-bb75-65091052b696@github.com> References: <54bP7p1EQWWE8YXN94AlKVabtJUOFpw-wZ7fXOmxG_0=.7ad5f3d4-fe36-4ea3-8e38-0561020cc043@github.com> <-BNpjrXyRpuAVDceEf1xu0jTJK1dhxBNw22HuXwaDjs=.1cbe5133-b6bb-4558-bb75-65091052b696@github.com> Message-ID: On Wed, 6 Dec 2023 14:16:49 GMT, Aleksey Shipilev wrote: >> See bug for rationale. >> >> Additional testing: >> - [x] MacOS AArch64 server fastdebug, `hotspot_gc_shenandoah` >> - [x] Linux x86_64 server fastdebug, `tier{1,2,3}` with `-XX:+UseShenandoahGC` >> - [x] Linux AArch64 server fastdebug, `tier{1,2,3}` with `-XX:+UseShenandoahGC` > > Aleksey Shipilev has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision: > > - Merge branch 'master' into JDK-8320969-shenandoah-stable-gc-workers > - Reinstate the .cpp policy code > - Merge branch 'master' into JDK-8320969-shenandoah-stable-gc-workers > - No dynamic GC workers All right, thanks all! ------------- PR Comment: https://git.openjdk.org/jdk/pull/16878#issuecomment-1844995137 From shade at openjdk.org Thu Dec 7 09:36:46 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Thu, 7 Dec 2023 09:36:46 GMT Subject: Integrated: 8320969: Shenandoah: Enforce stable number of GC workers In-Reply-To: <54bP7p1EQWWE8YXN94AlKVabtJUOFpw-wZ7fXOmxG_0=.7ad5f3d4-fe36-4ea3-8e38-0561020cc043@github.com> References: <54bP7p1EQWWE8YXN94AlKVabtJUOFpw-wZ7fXOmxG_0=.7ad5f3d4-fe36-4ea3-8e38-0561020cc043@github.com> Message-ID: On Wed, 29 Nov 2023 11:01:32 GMT, Aleksey Shipilev wrote: > See bug for rationale. > > Additional testing: > - [x] MacOS AArch64 server fastdebug, `hotspot_gc_shenandoah` > - [x] Linux x86_64 server fastdebug, `tier{1,2,3}` with `-XX:+UseShenandoahGC` > - [x] Linux AArch64 server fastdebug, `tier{1,2,3}` with `-XX:+UseShenandoahGC` This pull request has now been integrated. Changeset: 656b4462 Author: Aleksey Shipilev URL: https://git.openjdk.org/jdk/commit/656b4462895da51765e473d425c87f920a30c891 Stats: 138 lines in 5 files changed: 10 ins; 110 del; 18 mod 8320969: Shenandoah: Enforce stable number of GC workers Reviewed-by: kdnilsen, wkemper, ysr ------------- PR: https://git.openjdk.org/jdk/pull/16878 From wkemper at openjdk.org Thu Dec 7 14:17:45 2023 From: wkemper at openjdk.org (William Kemper) Date: Thu, 7 Dec 2023 14:17:45 GMT Subject: RFR: Merge openjdk/jdk:master [v4] In-Reply-To: References: Message-ID: > Merges tag jdk-22+26 William Kemper has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 112 commits: - 8320945: problemlist tests failing on latest Windows 11 update Reviewed-by: lmesnik - 8210410: Refactor java.util.Currency:i18n shell tests to plain java tests Reviewed-by: naoto, lancea - 8320942: Only set openjdk-target when cross compiling linux-aarch64 Reviewed-by: ihse, erikj - 8320937: support latest VS2022 MSC_VER in abstract_vm_version.cpp Reviewed-by: dholmes, shade - 8319444: Unhelpful failure output in TestLegalNotices Reviewed-by: hannesw, jjg - 8313816: Accessing jmethodID might lead to spurious crashes Reviewed-by: coleenp - 8320877: Shenandoah: Remove ShenandoahUnloadClassesFrequency support Reviewed-by: wkemper, kdnilsen, rkennke - 8320806: Augment test/langtools/tools/javac/versions/Versions.java for JDK 22 language changes Reviewed-by: jlahoda, vromero - 8320940: Fix typo in java.lang.Double Reviewed-by: rriggs, iris, shade, lancea, bpb - 8320907: Shenandoah: Remove ShenandoahSelfFixing flag Reviewed-by: wkemper, zgu, kdnilsen, rkennke - ... and 102 more: https://git.openjdk.org/shenandoah/compare/e47cf611...ea6e92ed ------------- Changes: https://git.openjdk.org/shenandoah/pull/363/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=363&range=03 Stats: 16460 lines in 646 files changed: 11110 ins; 3130 del; 2220 mod Patch: https://git.openjdk.org/shenandoah/pull/363.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/363/head:pull/363 PR: https://git.openjdk.org/shenandoah/pull/363 From kdnilsen at openjdk.org Fri Dec 8 00:00:11 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 8 Dec 2023 00:00:11 GMT Subject: RFR: 8321401: Shenandoah: Each mutator must see FullGC before throwing OOM [v2] In-Reply-To: References: Message-ID: > Require each thread to observe unproductive Full GC before it throws OOM exception. Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: Track GC utilization and implement gc_overhead_limit_was_exceeded ------------- Changes: - all: https://git.openjdk.org/jdk/pull/16985/files - new: https://git.openjdk.org/jdk/pull/16985/files/8d0515b7..c80c049e Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=16985&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=16985&range=00-01 Stats: 327 lines in 9 files changed: 321 ins; 0 del; 6 mod Patch: https://git.openjdk.org/jdk/pull/16985.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16985/head:pull/16985 PR: https://git.openjdk.org/jdk/pull/16985 From wkemper at openjdk.org Fri Dec 8 00:16:48 2023 From: wkemper at openjdk.org (William Kemper) Date: Fri, 8 Dec 2023 00:16:48 GMT Subject: RFR: Merge openjdk/jdk:master [v5] In-Reply-To: References: Message-ID: > Merges tag jdk-22+26 William Kemper has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 114 commits: - Use max capacity, rather than soft max when computing runway - Merge remote-tracking branch 'shenandoah/master' into merge-jdk-22+26 - 8320945: problemlist tests failing on latest Windows 11 update Reviewed-by: lmesnik - 8210410: Refactor java.util.Currency:i18n shell tests to plain java tests Reviewed-by: naoto, lancea - 8320942: Only set openjdk-target when cross compiling linux-aarch64 Reviewed-by: ihse, erikj - 8320937: support latest VS2022 MSC_VER in abstract_vm_version.cpp Reviewed-by: dholmes, shade - 8319444: Unhelpful failure output in TestLegalNotices Reviewed-by: hannesw, jjg - 8313816: Accessing jmethodID might lead to spurious crashes Reviewed-by: coleenp - 8320877: Shenandoah: Remove ShenandoahUnloadClassesFrequency support Reviewed-by: wkemper, kdnilsen, rkennke - 8320806: Augment test/langtools/tools/javac/versions/Versions.java for JDK 22 language changes Reviewed-by: jlahoda, vromero - ... and 104 more: https://git.openjdk.org/shenandoah/compare/2618aa69...99df412f ------------- Changes: https://git.openjdk.org/shenandoah/pull/363/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=363&range=04 Stats: 16463 lines in 648 files changed: 11110 ins; 3132 del; 2221 mod Patch: https://git.openjdk.org/shenandoah/pull/363.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/363/head:pull/363 PR: https://git.openjdk.org/shenandoah/pull/363 From kdnilsen at openjdk.org Fri Dec 8 03:47:48 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 8 Dec 2023 03:47:48 GMT Subject: RFR: 8321401: Shenandoah: Each mutator must see FullGC before throwing OOM [v3] In-Reply-To: References: Message-ID: > Require each thread to observe unproductive Full GC before it throws OOM exception. Kelvin Nilsen has updated the pull request incrementally with one additional commit since the last revision: Fix several bugs for GCU limit exceeded implementation ------------- Changes: - all: https://git.openjdk.org/jdk/pull/16985/files - new: https://git.openjdk.org/jdk/pull/16985/files/c80c049e..89aa26fe Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=16985&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=16985&range=01-02 Stats: 32 lines in 3 files changed: 23 ins; 3 del; 6 mod Patch: https://git.openjdk.org/jdk/pull/16985.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16985/head:pull/16985 PR: https://git.openjdk.org/jdk/pull/16985 From wkemper at openjdk.org Fri Dec 8 17:22:06 2023 From: wkemper at openjdk.org (William Kemper) Date: Fri, 8 Dec 2023 17:22:06 GMT Subject: RFR: 8321605: GenShen: Old generation reference process is never reset Message-ID: Shenandoah's reference processor workers keep state in thread local variables that need to be `reset` during init mark. Prior to this change, this was not happening for the old generation - resulting in an accumulation of counters. ------------- Commit messages: - Reset old gen reference processor thread locals when bootstrapping old marking Changes: https://git.openjdk.org/shenandoah/pull/367/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=367&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8321605 Stats: 1 line in 1 file changed: 1 ins; 0 del; 0 mod Patch: https://git.openjdk.org/shenandoah/pull/367.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/367/head:pull/367 PR: https://git.openjdk.org/shenandoah/pull/367 From kdnilsen at openjdk.org Fri Dec 8 19:44:08 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Fri, 8 Dec 2023 19:44:08 GMT Subject: RFR: 8321605: GenShen: Old generation reference process is never reset In-Reply-To: References: Message-ID: On Fri, 8 Dec 2023 17:17:29 GMT, William Kemper wrote: > Shenandoah's reference processor workers keep state in thread local variables that need to be `reset` during init mark. Prior to this change, this was not happening for the old generation - resulting in an accumulation of counters. Marked as reviewed by kdnilsen (Committer). ------------- PR Review: https://git.openjdk.org/shenandoah/pull/367#pullrequestreview-1773050939 From ysr at openjdk.org Fri Dec 8 19:44:10 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Fri, 8 Dec 2023 19:44:10 GMT Subject: RFR: 8321605: GenShen: Old generation reference process is never reset In-Reply-To: References: Message-ID: On Fri, 8 Dec 2023 17:17:29 GMT, William Kemper wrote: > Shenandoah's reference processor workers keep state in thread local variables that need to be `reset` during init mark. Prior to this change, this was not happening for the old generation - resulting in an accumulation of counters. src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp line 688: > 686: ShenandoahInitMarkUpdateRegionStateClosure cl; > 687: heap->parallel_heap_region_iterate(&cl); > 688: heap->old_generation()->ref_processor()->reset_thread_locals(); Why don't we reset the thread locals in the else arm as well? Is there a less error-prone way of doing this at the time that a new GC cycle that will do discovery is set up? I realize my questions may be a bit naive as I don't have a full understanding of how the reference processor's state/initialization is handled overall. For example, we do a bunch of reference processor related stuff further below at line 699. Why not reset the state of thread locals there instead? Would that miss someting? If I look at the places where this state is reset elsewhere, it seems to occur in the same places where the policy is initialized. If that's done at each cycle that'll do ref-proc, may be that's always the place to reset state/counts? ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/367#discussion_r1420949117 From ysr at openjdk.org Fri Dec 8 19:44:12 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Fri, 8 Dec 2023 19:44:12 GMT Subject: RFR: 8321605: GenShen: Old generation reference process is never reset In-Reply-To: References:

Message-ID: On Fri, 8 Dec 2023 19:38:54 GMT, Y. Srinivas Ramakrishna wrote: >> Shenandoah's reference processor workers keep state in thread local variables that need to be `reset` during init mark. Prior to this change, this was not happening for the old generation - resulting in an accumulation of counters. > > src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp line 688: > >> 686: ShenandoahInitMarkUpdateRegionStateClosure cl; >> 687: heap->parallel_heap_region_iterate(&cl); >> 688: heap->old_generation()->ref_processor()->reset_thread_locals(); > > Why don't we reset the thread locals in the else arm as well? > > Is there a less error-prone way of doing this at the time that a new GC cycle that will do discovery is set up? > > I realize my questions may be a bit naive as I don't have a full understanding of how the reference processor's state/initialization is handled overall. > > For example, we do a bunch of reference processor related stuff further below at line 699. Why not reset the state of thread locals there instead? Would that miss someting? > > If I look at the places where this state is reset elsewhere, it seems to occur in the same places where the policy is initialized. If that's done at each cycle that'll do ref-proc, may be that's always the place to reset state/counts? I see now that this resets the old generation's reference processor. I suspect this is a consequence of the way in which old and young are being handled. A cleaner separation of the two for the generational case might help in the future, and avoid the current more error-prone structure. ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/367#discussion_r1420959847 From wkemper at openjdk.org Fri Dec 8 20:50:57 2023 From: wkemper at openjdk.org (William Kemper) Date: Fri, 8 Dec 2023 20:50:57 GMT Subject: RFR: 8321605: GenShen: Old generation reference process is never reset In-Reply-To: References:

Message-ID: On Fri, 8 Dec 2023 19:41:39 GMT, Y. Srinivas Ramakrishna wrote: >> src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp line 688: >> >>> 686: ShenandoahInitMarkUpdateRegionStateClosure cl; >>> 687: heap->parallel_heap_region_iterate(&cl); >>> 688: heap->old_generation()->ref_processor()->reset_thread_locals(); >> >> Why don't we reset the thread locals in the else arm as well? >> >> Is there a less error-prone way of doing this at the time that a new GC cycle that will do discovery is set up? >> >> I realize my questions may be a bit naive as I don't have a full understanding of how the reference processor's state/initialization is handled overall. >> >> For example, we do a bunch of reference processor related stuff further below at line 699. Why not reset the state of thread locals there instead? Would that miss someting? >> >> If I look at the places where this state is reset elsewhere, it seems to occur in the same places where the policy is initialized. If that's done at each cycle that'll do ref-proc, may be that's always the place to reset state/counts? > > I see now that this resets the old generation's reference processor. I suspect this is a consequence of the way in which old and young are being handled. A cleaner separation of the two for the generational case might help in the future, and avoid the current more error-prone structure. I agree, the "bootstrap" cycle for the old generation is poorly modeled. ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/367#discussion_r1421012978 From wkemper at openjdk.org Fri Dec 8 20:50:57 2023 From: wkemper at openjdk.org (William Kemper) Date: Fri, 8 Dec 2023 20:50:57 GMT Subject: Integrated: 8321605: GenShen: Old generation reference process is never reset In-Reply-To: References: Message-ID: <1MzNd3c7Odf9gTIL0lnvHuphLx7w0wP2tAhgAAG2Cl8=.feaffe05-4e60-4db4-88b2-8a27dc43e1c8@github.com> On Fri, 8 Dec 2023 17:17:29 GMT, William Kemper wrote: > Shenandoah's reference processor workers keep state in thread local variables that need to be `reset` during init mark. Prior to this change, this was not happening for the old generation - resulting in an accumulation of counters. This pull request has now been integrated. Changeset: 2458f136 Author: William Kemper URL: https://git.openjdk.org/shenandoah/commit/2458f13684d9c2665c7c86dffcc23a24d1e73797 Stats: 1 line in 1 file changed: 1 ins; 0 del; 0 mod 8321605: GenShen: Old generation reference process is never reset Reviewed-by: kdnilsen ------------- PR: https://git.openjdk.org/shenandoah/pull/367 From rkennke at openjdk.org Mon Dec 11 16:59:51 2023 From: rkennke at openjdk.org (Roman Kennke) Date: Mon, 11 Dec 2023 16:59:51 GMT Subject: RFR: 8305896: Alternative full GC forwarding [v56] In-Reply-To: References: Message-ID: > Currently, the full-GC modes of Serial, Shenandoah and G1 GCs are forwarding objects by over-writing the object header with the new object location. Unfortunately, for compact object headers ([JDK-8294992](https://bugs.openjdk.org/browse/JDK-8294992)) this would not work, because the crucial class information is also stored in the header, and we could no longer iterate over objects until the headers would be restored. Also, the preserved-headers tables would grow quite large. > > I propose to use an alternative algorithm for full-GC (sliding-GC) forwarding that uses a special encoding so that the forwarding information fits into the lowest 32 bits of the header. > > It exploits the insight that, with sliding GCs, objects from one region will only ever be forwarded to one of two possible target regions. For this to work, we need to divide the heap into equal-sized regions. This is already the case for Shenandoah and G1, and can easily be overlaid for Serial GC, by assuming either the whole heap as a single region (if it fits) or by using SpaceAlignment-sized virtual regions. > > We also build and maintain a table that has N elements, where N is the number of regions. Each entry is two addresses, which are the start-address of the possible target regions for each source region. > > With this, forwarding information would be encoded like this: > - Bits 0 and 1: same as before, we put in '11' to indicate that the object is forwarded. > - Bit 2: Used for 'fallback'-forwarding (see below) > - Bit 3: Selects the target region 0 or 1. Look up the base address in the table (see above) > - Bits 4..31 The number of heap words from the target base address > > This works well for all sliding GCs in Serial, G1 and Shenandoah. The exception is in G1, there is a special mode called 'serial compaction' which acts as a last-last-ditch effort to squeeze more space out of the heap by re-forwarding the tails of the compaction chains. Unfortunately, this breaks the assumption of the sliding-forwarding-table. When that happens, we initialize a fallback table, which is a simple open hash-table, and set the Bit 2 in the forwarding to indicate that we shall look up the forwardee in the fallback-table. > > All the table accesses can be done unsynchronized because: > - Serial GC is single-threaded anyway > - In G1 and Shenandoah, GC worker threads divide up the work such that each worker does disjoint sets of regions. > - G1 serial compaction is single-threaded > > The change introduces a new (experimental) flag -... Roman Kennke has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 123 commits: - Merge branch 'master' into JDK-8305896 - Merge remote-tracking branch 'upstream/master' into JDK-8305896 - Fix build - Merge remote-tracking branch 'upstream/master' into JDK-8305896 - Merge remote-tracking branch 'upstream/master' into JDK-8305896 - Merge branch 'master' into JDK-8305896 - Various cleanups - Some @shipilev comments from downstream review - Further templatize Serial GC's adjust_pointers() - Merge branch 'master' into JDK-8305896 - ... and 113 more: https://git.openjdk.org/jdk/compare/3c6459e1...d67b42f5 ------------- Changes: https://git.openjdk.org/jdk/pull/13582/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=13582&range=55 Stats: 1203 lines in 39 files changed: 1079 ins; 10 del; 114 mod Patch: https://git.openjdk.org/jdk/pull/13582.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/13582/head:pull/13582 PR: https://git.openjdk.org/jdk/pull/13582 From stefank at openjdk.org Mon Dec 11 19:37:44 2023 From: stefank at openjdk.org (Stefan Karlsson) Date: Mon, 11 Dec 2023 19:37:44 GMT Subject: RFR: 8321812: Update GC tests to use execute[Limited]TestJava Message-ID: A lot of our tests use a multi-step recipe to spawn and wait for a process. Here's an example ProcessBuilder pb = ProcessTools.createTestJavaProcessBuilder( "-XX:-UseTLAB", "-XX:+UnlockDiagnosticVMOptions", "-XX:+VerifyDuringStartup", "-Xlog:gc+verify=debug", "-version"); OutputAnalyzer output = new OutputAnalyzer(pb.start()); ... do something with output and wait for the process to complete ... These are the steps involved: 1) Create a `ProcessBuilder` 2) Call `ProcessBuilder::start` 3) Create an `OutputAnalyzer` 4) Perform an operation that finally waits for the process to, at least partially, complete (OutputAnalyzer::getOutput, OutputAnalyzer::shouldHaveExitValue(), and more). Almost all our tests could be converted to use a single call to `ProcessTools.executeTestJava` (or `executeLimitedTestJava`), which spawns the process, makes sure that it has fully completed, and then returns a filled-in OutputAnalyzer to the caller. The above example would become: OutputAnalyzer output = ProcessTools.executeTestJava( "-XX:-UseTLAB", "-XX:+UnlockDiagnosticVMOptions", "-XX:+VerifyDuringStartup", "-Xlog:gc+verify=debug", "-version"); I propose that we make this change in the GC tests, to make our code simpler and hopefully easier to read. Note: There's a few changes to the throws statements because some ProcessTools APIs throws IOException while others throw Exception. Testing: I've done testing on a similar set of changes, but I'm going to run the appropriate, final tests while this is being considered/reviewed. ------------- Depends on: https://git.openjdk.org/jdk/pull/17049 Commit messages: - 8321812: Update GC tests to use execute[Limited]TestJava Changes: https://git.openjdk.org/jdk/pull/17067/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=17067&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8321812 Stats: 665 lines in 131 files changed: 32 ins; 216 del; 417 mod Patch: https://git.openjdk.org/jdk/pull/17067.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/17067/head:pull/17067 PR: https://git.openjdk.org/jdk/pull/17067 From wkemper at openjdk.org Tue Dec 12 17:15:24 2023 From: wkemper at openjdk.org (William Kemper) Date: Tue, 12 Dec 2023 17:15:24 GMT Subject: RFR: Merge openjdk/jdk:master [v6] In-Reply-To: References: Message-ID: > Merges tag jdk-22+26 William Kemper has updated the pull request incrementally with two additional commits since the last revision: - Revert "Use API for version of jtreg used to build jdk21" This reverts commit 4cbbf23918a2297bed0008906c5342f47aab0de7. - Use API for version of jtreg used to build jdk21 ------------- Changes: - all: https://git.openjdk.org/shenandoah/pull/363/files - new: https://git.openjdk.org/shenandoah/pull/363/files/99df412f..e65e8f9e Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah&pr=363&range=05 - incr: https://webrevs.openjdk.org/?repo=shenandoah&pr=363&range=04-05 Stats: 0 lines in 0 files changed: 0 ins; 0 del; 0 mod Patch: https://git.openjdk.org/shenandoah/pull/363.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/363/head:pull/363 PR: https://git.openjdk.org/shenandoah/pull/363 From wkemper at openjdk.org Tue Dec 12 22:25:53 2023 From: wkemper at openjdk.org (William Kemper) Date: Tue, 12 Dec 2023 22:25:53 GMT Subject: Integrated: Merge openjdk/jdk:master In-Reply-To: References: Message-ID: <_c9FY05HRhWDQC1yD6FxyOfK8RYyAYQVedhO8G5djFM=.d9e72476-e2c5-436f-8eb3-96af1805f317@github.com> On Thu, 30 Nov 2023 14:10:08 GMT, William Kemper wrote: > Merges tag jdk-22+26 This pull request has now been integrated. Changeset: 0a0562a5 Author: William Kemper URL: https://git.openjdk.org/shenandoah/commit/0a0562a5303e656c0d01a49b3d3d8e015d5b5e87 Stats: 16463 lines in 648 files changed: 11110 ins; 3132 del; 2221 mod Merge ------------- PR: https://git.openjdk.org/shenandoah/pull/363 From wkemper at openjdk.org Tue Dec 12 22:43:25 2023 From: wkemper at openjdk.org (William Kemper) Date: Tue, 12 Dec 2023 22:43:25 GMT Subject: RFR: Merge openjdk/jdk:master Message-ID: Merges tag jdk-22+27 ------------- Commit messages: - 8319313: G1: Rename G1EvacFailureInjector appropriately - 8320370: NMT: Change MallocMemorySnapshot to simplify code. - 8321223: Implementation of Scoped Values (Second Preview) - 8321001: RISC-V: C2 SignumVF - 8316454: JFR break locale settings - 8321470: ThreadLocal.nextHashCode can be static final - 8321467: MemorySegment.setString(long, String, Charset) throws IAE(Misaligned access) - 8321505: JFR: Update views - 8321519: Typo in exception message - 8320805: JFR: Create view for deprecated methods - ... and 133 more: https://git.openjdk.org/shenandoah/compare/ea6e92ed...86f9b3f5 The webrev contains the conflicts with master: - merge conflicts: https://webrevs.openjdk.org/?repo=shenandoah&pr=368&range=00.conflicts Changes: https://git.openjdk.org/shenandoah/pull/368/files Stats: 88230 lines in 1720 files changed: 41554 ins; 40178 del; 6498 mod Patch: https://git.openjdk.org/shenandoah/pull/368.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/368/head:pull/368 PR: https://git.openjdk.org/shenandoah/pull/368 From wkemper at openjdk.org Tue Dec 12 22:54:47 2023 From: wkemper at openjdk.org (William Kemper) Date: Tue, 12 Dec 2023 22:54:47 GMT Subject: RFR: 8321937: GenShen: Sync up 21u based repo Message-ID: Backport recent Shenandoah changes ------------- Commit messages: - Use API for version of jtreg used to build jdk21 - 8316226: GenShen: Consider forcing auto-tenure age to be greater than 1 - 8320336: GenShen: Reduce proactive humongous defragmentation efforts - 8320112: GenShen: Improve end of process report - 8320119: GenShen: Correct misspellings of parsable - 8319342: GenShen: Reset the count of degenerated cycles in a row following Full GC - 8319931: GenShen: Increase no progress threshold for TestThreadFailure - 8319867: GenShen: Make old regions parsable at end of concurrent cycles - 8318462: [GenShen] Prevent unsafe access to displaced mark-word - 8319671: GenShen: Old marking may encounter invalid pointers in SATB buffers - ... and 3 more: https://git.openjdk.org/shenandoah-jdk21u/compare/57fb4b2b...0afff542 Changes: https://git.openjdk.org/shenandoah-jdk21u/pull/4/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=4&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8321937 Stats: 925 lines in 38 files changed: 438 ins; 225 del; 262 mod Patch: https://git.openjdk.org/shenandoah-jdk21u/pull/4.diff Fetch: git fetch https://git.openjdk.org/shenandoah-jdk21u.git pull/4/head:pull/4 PR: https://git.openjdk.org/shenandoah-jdk21u/pull/4 From wkemper at openjdk.org Wed Dec 13 01:14:54 2023 From: wkemper at openjdk.org (William Kemper) Date: Wed, 13 Dec 2023 01:14:54 GMT Subject: RFR: Merge openjdk/jdk:master [v2] In-Reply-To: References: Message-ID: > Merges tag jdk-22+27 William Kemper has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 144 commits: - Merge remote-tracking branch 'shenandoah/master' into merge-jdk-22+27 - 8319313: G1: Rename G1EvacFailureInjector appropriately Reviewed-by: mli, iwalulya, ayang - 8320370: NMT: Change MallocMemorySnapshot to simplify code. Reviewed-by: stuefe, gziemski, stefank - 8321223: Implementation of Scoped Values (Second Preview) Reviewed-by: psandoz, mcimadamore - 8321001: RISC-V: C2 SignumVF 8321002: RISC-V: C2 SignumVD Reviewed-by: fyang - 8316454: JFR break locale settings Reviewed-by: mgronlun - 8321470: ThreadLocal.nextHashCode can be static final Reviewed-by: shade, jpai - 8321467: MemorySegment.setString(long, String, Charset) throws IAE(Misaligned access) Reviewed-by: pminborg - 8321505: JFR: Update views Reviewed-by: mgronlun - 8321519: Typo in exception message Reviewed-by: jpai - ... and 134 more: https://git.openjdk.org/shenandoah/compare/0a0562a5...0b638103 ------------- Changes: https://git.openjdk.org/shenandoah/pull/368/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=368&range=01 Stats: 88269 lines in 1722 files changed: 41555 ins; 40212 del; 6502 mod Patch: https://git.openjdk.org/shenandoah/pull/368.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/368/head:pull/368 PR: https://git.openjdk.org/shenandoah/pull/368 From ysr at openjdk.org Wed Dec 13 01:36:02 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Wed, 13 Dec 2023 01:36:02 GMT Subject: RFR: 8321939: [GenShen] ShenandoahOldEvacRatioPercent=100 fails with divide-by-zero Message-ID: The code that used the option ShenandoahOldEvacRatioPercent (SOEP) didn't correctly deal with the extremal setting of 100. Corrected the error and did some light refactoring in the vicinity, getting rid of some dead variables, adding documentation, and assertions. Also modified slightly the documentation of the option. **Testing:** - [ ] GHA - [ ] code pipeline testing - [ ] specjbb with SOEP=default,0,100 to verify that the issue is fixed ------------- Commit messages: - jcheck trailing white-space. - Merge branch 'master' into tenure_budget_8314599 - Fix another case where divide-by-zero could occur when - Fix divide-by-zero when ShenandoahOldEvacRatioPercent==100; minor Changes: https://git.openjdk.org/shenandoah/pull/369/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=369&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8321939 Stats: 172 lines in 3 files changed: 62 ins; 68 del; 42 mod Patch: https://git.openjdk.org/shenandoah/pull/369.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/369/head:pull/369 PR: https://git.openjdk.org/shenandoah/pull/369 From ysr at openjdk.org Wed Dec 13 01:39:47 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Wed, 13 Dec 2023 01:39:47 GMT Subject: RFR: 8321939: [GenShen] ShenandoahOldEvacRatioPercent=100 fails with divide-by-zero [v2] In-Reply-To: References: Message-ID: > The code that used the option ShenandoahOldEvacRatioPercent (SOEP) didn't correctly deal with the extremal setting of 100. Corrected the error and did some light refactoring in the vicinity, getting rid of some dead variables, adding documentation, and assertions. Also modified slightly the documentation of the option. > > **Testing:** > - [ ] GHA > - [ ] code pipeline testing > - [ ] specjbb with SOEP=default,0,100 to verify that the issue is fixed Y. Srinivas Ramakrishna has updated the pull request incrementally with one additional commit since the last revision: Fix a scope issue with a local variable. ------------- Changes: - all: https://git.openjdk.org/shenandoah/pull/369/files - new: https://git.openjdk.org/shenandoah/pull/369/files/7166147f..03e55dee Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah&pr=369&range=01 - incr: https://webrevs.openjdk.org/?repo=shenandoah&pr=369&range=00-01 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/shenandoah/pull/369.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/369/head:pull/369 PR: https://git.openjdk.org/shenandoah/pull/369 From wkemper at openjdk.org Wed Dec 13 16:53:22 2023 From: wkemper at openjdk.org (William Kemper) Date: Wed, 13 Dec 2023 16:53:22 GMT Subject: RFR: Merge openjdk/jdk:master [v2] In-Reply-To: References:

Message-ID: On Wed, 13 Dec 2023 01:14:54 GMT, William Kemper wrote: >> Merges tag jdk-22+27 > > William Kemper has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 144 commits: > > - Merge remote-tracking branch 'shenandoah/master' into merge-jdk-22+27 > - 8319313: G1: Rename G1EvacFailureInjector appropriately > > Reviewed-by: mli, iwalulya, ayang > - 8320370: NMT: Change MallocMemorySnapshot to simplify code. > > Reviewed-by: stuefe, gziemski, stefank > - 8321223: Implementation of Scoped Values (Second Preview) > > Reviewed-by: psandoz, mcimadamore > - 8321001: RISC-V: C2 SignumVF > 8321002: RISC-V: C2 SignumVD > > Reviewed-by: fyang > - 8316454: JFR break locale settings > > Reviewed-by: mgronlun > - 8321470: ThreadLocal.nextHashCode can be static final > > Reviewed-by: shade, jpai > - 8321467: MemorySegment.setString(long, String, Charset) throws IAE(Misaligned access) > > Reviewed-by: pminborg > - 8321505: JFR: Update views > > Reviewed-by: mgronlun > - 8321519: Typo in exception message > > Reviewed-by: jpai > - ... and 134 more: https://git.openjdk.org/shenandoah/compare/0a0562a5...0b638103 Single test failure in tier3, looks unrelated to Shenandoah. ------------- PR Comment: https://git.openjdk.org/shenandoah/pull/368#issuecomment-1854324380 From wkemper at openjdk.org Wed Dec 13 16:53:24 2023 From: wkemper at openjdk.org (William Kemper) Date: Wed, 13 Dec 2023 16:53:24 GMT Subject: Integrated: Merge openjdk/jdk:master In-Reply-To: References: Message-ID: <1wlQyMRhG0hyg8JLA5IuWraQx8a46Mg2H0EGafc0sSI=.80a32000-0ca3-4361-b2ba-0edefaab0893@github.com> On Tue, 12 Dec 2023 22:38:01 GMT, William Kemper wrote: > Merges tag jdk-22+27 This pull request has now been integrated. Changeset: ba3f619a Author: William Kemper URL: https://git.openjdk.org/shenandoah/commit/ba3f619a6962e5f94a1fcd72ed48f4138229f361 Stats: 88269 lines in 1722 files changed: 41555 ins; 40212 del; 6502 mod Merge ------------- PR: https://git.openjdk.org/shenandoah/pull/368 From wkemper at openjdk.org Wed Dec 13 16:56:02 2023 From: wkemper at openjdk.org (William Kemper) Date: Wed, 13 Dec 2023 16:56:02 GMT Subject: RFR: Merge openjdk/jdk21u:master [v2] In-Reply-To: References: Message-ID: <79LsBTDjdfRrxCjWtGbnOBwFA1CmWdj-b9w2RaNpcSg=.22fa2ba3-9ff2-422a-9bcd-9bf92d62754e@github.com> > Merge tip of jdk21u William Kemper has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains two additional commits since the last revision: - Merge 'openjdk-21u/master' into 'shenandoah-jdk21u/master' - 8320913: GenShen: Bootstrap 21u backports repo Reviewed-by: ysr, kdnilsen ------------- Changes: - all: https://git.openjdk.org/shenandoah-jdk21u/pull/2/files - new: https://git.openjdk.org/shenandoah-jdk21u/pull/2/files/7c145a52..7c145a52 Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=2&range=01 - incr: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=2&range=00-01 Stats: 0 lines in 0 files changed: 0 ins; 0 del; 0 mod Patch: https://git.openjdk.org/shenandoah-jdk21u/pull/2.diff Fetch: git fetch https://git.openjdk.org/shenandoah-jdk21u.git pull/2/head:pull/2 PR: https://git.openjdk.org/shenandoah-jdk21u/pull/2 From wkemper at openjdk.org Wed Dec 13 16:56:04 2023 From: wkemper at openjdk.org (William Kemper) Date: Wed, 13 Dec 2023 16:56:04 GMT Subject: Integrated: Merge openjdk/jdk21u:master In-Reply-To: References: Message-ID: <-_WdDOUlLLfJfw1OwTaIeDAgnJBji6L01U1TFkRfTtQ=.cc6ab936-a10b-441a-99f8-5ca0a3ff026b@github.com> On Wed, 29 Nov 2023 21:59:12 GMT, William Kemper wrote: > Merge tip of jdk21u This pull request has now been integrated. Changeset: 4609d721 Author: William Kemper URL: https://git.openjdk.org/shenandoah-jdk21u/commit/4609d72106788cdc85d419fb2e35bf597e6f8995 Stats: 17927 lines in 348 files changed: 11041 ins; 1810 del; 5076 mod Merge ------------- PR: https://git.openjdk.org/shenandoah-jdk21u/pull/2 From wkemper at openjdk.org Wed Dec 13 17:08:48 2023 From: wkemper at openjdk.org (William Kemper) Date: Wed, 13 Dec 2023 17:08:48 GMT Subject: RFR: 8321937: GenShen: Sync up 21u based repo [v2] In-Reply-To: References: Message-ID: > Backport recent Shenandoah changes William Kemper has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains 14 additional commits since the last revision: - Merge remote-tracking branch 'shenandoah-jdk21u/master' into merge-shen-master-into-shen-21 - Use API for version of jtreg used to build jdk21 - 8316226: GenShen: Consider forcing auto-tenure age to be greater than 1 Reviewed-by: wkemper, kdnilsen - 8320336: GenShen: Reduce proactive humongous defragmentation efforts Reviewed-by: wkemper, ysr - 8320112: GenShen: Improve end of process report Reviewed-by: kdnilsen - 8320119: GenShen: Correct misspellings of parsable Reviewed-by: kdnilsen, ysr - 8319342: GenShen: Reset the count of degenerated cycles in a row following Full GC Reviewed-by: wkemper, ysr - 8319931: GenShen: Increase no progress threshold for TestThreadFailure Reviewed-by: kdnilsen - 8319867: GenShen: Make old regions parsable at end of concurrent cycles Reviewed-by: kdnilsen, ysr - 8318462: [GenShen] Prevent unsafe access to displaced mark-word Reviewed-by: kdnilsen, wkemper, ysr - ... and 4 more: https://git.openjdk.org/shenandoah-jdk21u/compare/9b613688...555c581a ------------- Changes: - all: https://git.openjdk.org/shenandoah-jdk21u/pull/4/files - new: https://git.openjdk.org/shenandoah-jdk21u/pull/4/files/0afff542..555c581a Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=4&range=01 - incr: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=4&range=00-01 Stats: 17927 lines in 348 files changed: 11041 ins; 1810 del; 5076 mod Patch: https://git.openjdk.org/shenandoah-jdk21u/pull/4.diff Fetch: git fetch https://git.openjdk.org/shenandoah-jdk21u.git pull/4/head:pull/4 PR: https://git.openjdk.org/shenandoah-jdk21u/pull/4 From wkemper at openjdk.org Wed Dec 13 20:37:14 2023 From: wkemper at openjdk.org (William Kemper) Date: Wed, 13 Dec 2023 20:37:14 GMT Subject: RFR: Merge openjdk/jdk:master Message-ID: Merges tag jdk-23+1 ------------- Commit messages: - 8319413: Start of release updates for JDK 23 The merge commit only contains trivial merges, so no merge-specific webrevs have been generated. Changes: https://git.openjdk.org/shenandoah/pull/370/files Stats: 4872 lines in 98 files changed: 4828 ins; 0 del; 44 mod Patch: https://git.openjdk.org/shenandoah/pull/370.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/370/head:pull/370 PR: https://git.openjdk.org/shenandoah/pull/370 From kdnilsen at openjdk.org Wed Dec 13 21:23:12 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 13 Dec 2023 21:23:12 GMT Subject: RFR: 8321939: [GenShen] ShenandoahOldEvacRatioPercent=100 fails with divide-by-zero [v2] In-Reply-To: References:

Message-ID: On Wed, 13 Dec 2023 01:39:47 GMT, Y. Srinivas Ramakrishna wrote: >> The code that used the option ShenandoahOldEvacRatioPercent (SOEP) didn't correctly deal with the extremal setting of 100. Corrected the error and did some light refactoring in the vicinity, getting rid of some dead variables, adding documentation, and assertions. Also modified slightly the documentation of the option. >> >> **Testing:** >> - [x] GHA >> - [ ] code pipeline testing >> - [ ] specjbb with SOEP=default,0,100 to verify that the issue is fixed > > Y. Srinivas Ramakrishna has updated the pull request incrementally with one additional commit since the last revision: > > Fix a scope issue with a local variable. Marked as reviewed by kdnilsen (Committer). src/hotspot/share/gc/shenandoah/shenandoahGeneration.cpp line 269: > 267: // SOEP/100 = OE/TE > 268: // = OE/(OE+YE) > 269: // => SOEP/(100-SOEP) = OE/((OE+YE)-OE) I get lost at this step. Here's how I see the algebra from this point forward: (OE + YE) * SOEP == 100 OE (multiply both sides by 100 (OE + YE) YE * SOEP = 100 OE - SOEP * OE (subtract SOEP * OE from both sides) YE * SOEP = OE * (100 - SOEP) OE = YE * SOEP / (100 - SOEP) (we agree on final result. I just don't follow the justification for step at line 269. maybe you can explain that line better?) src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 1206: > 1204: // = OE/YE > 1205: // => OE = YE*SOEP/(100-SOEP) > 1206: Same comment as above, regarding the algebra steps. src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 1242: > 1240: const bool doing_promotions = promo_load > 0; > 1241: if (doing_promotions) { > 1242: // We're only promoting and we have a maximum bound on the amount to be promoted With reference to the preceding comment, we should probably add an assert here: assert(!doing_mixed, "Don't do mixed and promotions in the same cycle") Or maybe this is no longer true. It doesn't have to be true. The code that follows seems to suggest this is no longer true. It was a simplification at some point in the implementation history. If it is no longer true, let's remove that comment. ------------- PR Review: https://git.openjdk.org/shenandoah/pull/369#pullrequestreview-1780476196 PR Review Comment: https://git.openjdk.org/shenandoah/pull/369#discussion_r1425876097 PR Review Comment: https://git.openjdk.org/shenandoah/pull/369#discussion_r1425879329 PR Review Comment: https://git.openjdk.org/shenandoah/pull/369#discussion_r1425893069 From ysr at openjdk.org Wed Dec 13 21:46:24 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Wed, 13 Dec 2023 21:46:24 GMT Subject: RFR: 8321939: [GenShen] ShenandoahOldEvacRatioPercent=100 fails with divide-by-zero [v2] In-Reply-To: References:

Message-ID: On Wed, 13 Dec 2023 20:56:19 GMT, Kelvin Nilsen wrote: >> Y. Srinivas Ramakrishna has updated the pull request incrementally with one additional commit since the last revision: >> >> Fix a scope issue with a local variable. > > src/hotspot/share/gc/shenandoah/shenandoahGeneration.cpp line 269: > >> 267: // SOEP/100 = OE/TE >> 268: // = OE/(OE+YE) >> 269: // => SOEP/(100-SOEP) = OE/((OE+YE)-OE) > > I get lost at this step. Here's how I see the algebra from this point forward: > (OE + YE) * SOEP == 100 OE (multiply both sides by 100 (OE + YE) > YE * SOEP = 100 OE - SOEP * OE (subtract SOEP * OE from both sides) > YE * SOEP = OE * (100 - SOEP) > OE = YE * SOEP / (100 - SOEP) > > (we agree on final result. I just don't follow the justification for step at line 269. maybe you can explain that line better?) ah, yes, I used the so-called "componendo-dividendo" rule from middle school, in this instantiation of the more general schema: If a/b = c/d, then a/(b-a) = c/(d-c) It is perhaps not very well-known in modern ratio-proportion pedagogy in grade-school, but is very useful for quick arithemtic. https://hsm.stackexchange.com/questions/7544/etymology-of-some-terms-used-in-ratio-and-proportion-in-old-algebra-textbooks ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/369#discussion_r1425916254 From wkemper at openjdk.org Wed Dec 13 21:47:37 2023 From: wkemper at openjdk.org (William Kemper) Date: Wed, 13 Dec 2023 21:47:37 GMT Subject: RFR: 8321937: GenShen: Sync up 21u based repo [v3] In-Reply-To: References: Message-ID: > Backport recent Shenandoah changes William Kemper has updated the pull request incrementally with one additional commit since the last revision: Remove use of option which has not been backported yet ------------- Changes: - all: https://git.openjdk.org/shenandoah-jdk21u/pull/4/files - new: https://git.openjdk.org/shenandoah-jdk21u/pull/4/files/555c581a..b6415bdd Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=4&range=02 - incr: https://webrevs.openjdk.org/?repo=shenandoah-jdk21u&pr=4&range=01-02 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/shenandoah-jdk21u/pull/4.diff Fetch: git fetch https://git.openjdk.org/shenandoah-jdk21u.git pull/4/head:pull/4 PR: https://git.openjdk.org/shenandoah-jdk21u/pull/4 From ysr at openjdk.org Wed Dec 13 21:58:23 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Wed, 13 Dec 2023 21:58:23 GMT Subject: RFR: 8321939: [GenShen] ShenandoahOldEvacRatioPercent=100 fails with divide-by-zero [v2] In-Reply-To: References:

Message-ID: On Wed, 13 Dec 2023 21:43:52 GMT, Y. Srinivas Ramakrishna wrote: >> src/hotspot/share/gc/shenandoah/shenandoahGeneration.cpp line 269: >> >>> 267: // SOEP/100 = OE/TE >>> 268: // = OE/(OE+YE) >>> 269: // => SOEP/(100-SOEP) = OE/((OE+YE)-OE) >> >> I get lost at this step. Here's how I see the algebra from this point forward: >> (OE + YE) * SOEP == 100 OE (multiply both sides by 100 (OE + YE) >> YE * SOEP = 100 OE - SOEP * OE (subtract SOEP * OE from both sides) >> YE * SOEP = OE * (100 - SOEP) >> OE = YE * SOEP / (100 - SOEP) >> >> (we agree on final result. I just don't follow the justification for step at line 269. maybe you can explain that line better?) > > ah, yes, I used the so-called "componendo-dividendo" rule from middle school, in this instantiation of the more general schema: > > If a/b = c/d, then a/(b-a) = c/(d-c) > > It is perhaps not very well-known in modern ratio-proportion pedagogy in grade-school, but is very useful for quick arithemtic. > > https://hsm.stackexchange.com/questions/7544/etymology-of-some-terms-used-in-ratio-and-proportion-in-old-algebra-textbooks I'll add the justification in the comment. ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/369#discussion_r1425924525 From ysr at openjdk.org Wed Dec 13 21:58:23 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Wed, 13 Dec 2023 21:58:23 GMT Subject: RFR: 8321939: [GenShen] ShenandoahOldEvacRatioPercent=100 fails with divide-by-zero [v2] In-Reply-To: References:

Message-ID: On Wed, 13 Dec 2023 21:16:08 GMT, Kelvin Nilsen wrote: >> Y. Srinivas Ramakrishna has updated the pull request incrementally with one additional commit since the last revision: >> >> Fix a scope issue with a local variable. > > src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 1242: > >> 1240: const bool doing_promotions = promo_load > 0; >> 1241: if (doing_promotions) { >> 1242: // We're only promoting and we have a maximum bound on the amount to be promoted > > With reference to the preceding comment, we should probably add an assert here: assert(!doing_mixed, "Don't do mixed and promotions in the same cycle") > > Or maybe this is no longer true. It doesn't have to be true. The code that follows seems to suggest this is no longer true. It was a simplification at some point in the implementation history. If it is no longer true, let's remove that comment. I believe you are right. The comment appears obsolete. I'll check/confirm and delete the comment. Thanks! ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/369#discussion_r1425925729 From kdnilsen at openjdk.org Wed Dec 13 22:05:11 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 13 Dec 2023 22:05:11 GMT Subject: RFR: 8321939: [GenShen] ShenandoahOldEvacRatioPercent=100 fails with divide-by-zero [v2] In-Reply-To: References:

Message-ID: On Wed, 13 Dec 2023 21:54:06 GMT, Y. Srinivas Ramakrishna wrote: >> ah, yes, I used the so-called "componendo-dividendo" rule from middle school, in this instantiation of the more general schema: >> >> If a/b = c/d, then a/(b-a) = c/(d-c) >> >> It is perhaps not very well-known in modern ratio-proportion pedagogy in grade-school, but is very useful for quick arithemtic. >> >> https://hsm.stackexchange.com/questions/7544/etymology-of-some-terms-used-in-ratio-and-proportion-in-old-algebra-textbooks > > I'll add the justification in the comment. Thanks. Your recollection of middle-school math is better than mine... :) ------------- PR Review Comment: https://git.openjdk.org/shenandoah/pull/369#discussion_r1425931096 From wkemper at openjdk.org Thu Dec 14 19:20:00 2023 From: wkemper at openjdk.org (William Kemper) Date: Thu, 14 Dec 2023 19:20:00 GMT Subject: RFR: 8321816: GenShen: Provide a minimum amount of time for an old collection to run Message-ID: Young collections are allowed to interrupt old collections. Under heavy load, this may result in starvation of old collections - which may in turn lead to even more memory pressure on the young generation. Adding a minimum guaranteed amount of time for the old gc to make progress will help avoid this situation. ------------- Commit messages: - Merge remote-tracking branch 'shenandoah/master' into reduce-old-starvation - Merge remote-tracking branch 'shenandoah/master' into reduce-old-starvation - Experiment with a minimum old marking increment Changes: https://git.openjdk.org/shenandoah/pull/371/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=371&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8321816 Stats: 20 lines in 3 files changed: 19 ins; 0 del; 1 mod Patch: https://git.openjdk.org/shenandoah/pull/371.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/371/head:pull/371 PR: https://git.openjdk.org/shenandoah/pull/371 From wkemper at openjdk.org Thu Dec 14 19:25:51 2023 From: wkemper at openjdk.org (William Kemper) Date: Thu, 14 Dec 2023 19:25:51 GMT Subject: RFR: 8321815: Shenandoah: gc state should be synchronized to java threads only once per safepoint Message-ID: Presently, every change to the gc change is distributed to all java threads. The gc state is only changed on safepoints. For applications with a very large number of java threads this may increase time on the safepoint. It would be better to synchronize the gc state only once per safepoint. ------------- Commit messages: - Fix whitespace - Sync gc state with threads on VM thread - Synchronize gc state with java threads once, when exiting safepoint Changes: https://git.openjdk.org/jdk/pull/17112/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=17112&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8321815 Stats: 25 lines in 5 files changed: 17 ins; 1 del; 7 mod Patch: https://git.openjdk.org/jdk/pull/17112.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/17112/head:pull/17112 PR: https://git.openjdk.org/jdk/pull/17112 From kdnilsen at openjdk.org Thu Dec 14 19:32:17 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Thu, 14 Dec 2023 19:32:17 GMT Subject: RFR: 8321937: GenShen: Sync up 21u based repo [v3] In-Reply-To: References:

Message-ID: On Wed, 13 Dec 2023 21:47:37 GMT, William Kemper wrote: >> Backport recent Shenandoah changes > > William Kemper has updated the pull request incrementally with one additional commit since the last revision: > > Remove use of option which has not been backported yet Thanks for wading through all these diffs. ------------- Marked as reviewed by kdnilsen (Committer). PR Review: https://git.openjdk.org/shenandoah-jdk21u/pull/4#pullrequestreview-1782562933 From kdnilsen at openjdk.org Thu Dec 14 19:33:17 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Thu, 14 Dec 2023 19:33:17 GMT Subject: RFR: 8321816: GenShen: Provide a minimum amount of time for an old collection to run In-Reply-To: References: Message-ID: <4_IpPiVZ5sqTlWz6qjEVlnmzBNf0Bv-JwY2MQuWvAPg=.d764f8c3-c74c-47f6-bf10-77dc5bcec234@github.com> On Thu, 14 Dec 2023 19:11:34 GMT, William Kemper wrote: > Young collections are allowed to interrupt old collections. Under heavy load, this may result in starvation of old collections - which may in turn lead to even more memory pressure on the young generation. Adding a minimum guaranteed amount of time for the old gc to make progress will help avoid this situation. Marked as reviewed by kdnilsen (Committer). ------------- PR Review: https://git.openjdk.org/shenandoah/pull/371#pullrequestreview-1782564672 From kdnilsen at openjdk.org Thu Dec 14 19:36:38 2023 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Thu, 14 Dec 2023 19:36:38 GMT Subject: RFR: 8321815: Shenandoah: gc state should be synchronized to java threads only once per safepoint In-Reply-To: References: Message-ID: On Thu, 14 Dec 2023 19:14:07 GMT, William Kemper wrote: > Presently, every change to the gc change is distributed to all java threads. The gc state is only changed on safepoints. For applications with a very large number of java threads this may increase time on the safepoint. It would be better to synchronize the gc state only once per safepoint. Am wondering if we measure any improvement in time-at-safepoints for workloads with large number of threads? ------------- Marked as reviewed by kdnilsen (no project role). PR Review: https://git.openjdk.org/jdk/pull/17112#pullrequestreview-1782569774 From cslucas at openjdk.org Thu Dec 14 19:44:09 2023 From: cslucas at openjdk.org (Cesar Soares Lucas) Date: Thu, 14 Dec 2023 19:44:09 GMT Subject: RFR: JDK-8241503: C2: Share MacroAssembler between mach nodes during code emission [v6] In-Reply-To: References: Message-ID: > # Description > > Please review this PR with a patch to re-use the same C2_MacroAssembler object to emit all instructions in the same compilation unit. > > Overall, the change is pretty simple. However, due to the renaming of the variable to access C2_MacroAssembler, from `_masm.` to `masm->`, and also some method prototype changes, the patch became quite large. > > # Help Needed for Testing > > I don't have access to all platforms necessary to test this. I hope some other folks can help with testing on `S390`, `RISC-V` and `PPC`. > > # Testing status > > ## tier1 > > | | Win | Mac | Linux | > |----------|---------|---------|---------| > | ARM64 | | | | > | ARM32 | | | | > | x86 | | | | > | x64 | | | | > | PPC64 | | | | > | S390x | | | | > | RiscV | | | | Cesar Soares Lucas has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains seven commits: - Merge with origin/master - Fix build, copyright dates, m4 files. - Fix merge - Catch up with master branch. Merge remote-tracking branch 'origin/master' into reuse-macroasm - Some inst_mark fixes; Catch up with master. - Catch up with changes on master - Reuse same C2_MacroAssembler object to emit instructions. ------------- Changes: https://git.openjdk.org/jdk/pull/16484/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16484&range=05 Stats: 2446 lines in 61 files changed: 106 ins; 434 del; 1906 mod Patch: https://git.openjdk.org/jdk/pull/16484.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16484/head:pull/16484 PR: https://git.openjdk.org/jdk/pull/16484 From wkemper at openjdk.org Thu Dec 14 22:36:35 2023 From: wkemper at openjdk.org (William Kemper) Date: Thu, 14 Dec 2023 22:36:35 GMT Subject: Integrated: 8321937: GenShen: Sync up 21u based repo In-Reply-To: References: Message-ID: On Fri, 8 Dec 2023 21:53:42 GMT, William Kemper wrote: > Backport recent Shenandoah changes This pull request has now been integrated. Changeset: f680de9a Author: William Kemper URL: https://git.openjdk.org/shenandoah-jdk21u/commit/f680de9a39ec699208e68600a854de849d321e21 Stats: 924 lines in 37 files changed: 438 ins; 225 del; 261 mod 8321937: GenShen: Sync up 21u based repo Reviewed-by: kdnilsen ------------- PR: https://git.openjdk.org/shenandoah-jdk21u/pull/4 From luhenry at openjdk.org Fri Dec 15 01:49:41 2023 From: luhenry at openjdk.org (Ludovic Henry) Date: Fri, 15 Dec 2023 01:49:41 GMT Subject: RFR: JDK-8241503: C2: Share MacroAssembler between mach nodes during code emission [v6] In-Reply-To: References:

Message-ID: On Thu, 14 Dec 2023 19:44:09 GMT, Cesar Soares Lucas wrote: >> # Description >> >> Please review this PR with a patch to re-use the same C2_MacroAssembler object to emit all instructions in the same compilation unit. >> >> Overall, the change is pretty simple. However, due to the renaming of the variable to access C2_MacroAssembler, from `_masm.` to `masm->`, and also some method prototype changes, the patch became quite large. >> >> # Help Needed for Testing >> >> I don't have access to all platforms necessary to test this. I hope some other folks can help with testing on `S390`, `RISC-V` and `PPC`. >> >> # Testing status >> >> ## tier1 >> >> | | Win | Mac | Linux | >> |----------|---------|---------|---------| >> | ARM64 | | | | >> | ARM32 | | | | >> | x86 | | | | >> | x64 | | | | >> | PPC64 | | | | >> | S390x | | | | >> | RiscV | | | | > > Cesar Soares Lucas has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains seven commits: > > - Merge with origin/master > - Fix build, copyright dates, m4 files. > - Fix merge > - Catch up with master branch. > > Merge remote-tracking branch 'origin/master' into reuse-macroasm > - Some inst_mark fixes; Catch up with master. > - Catch up with changes on master > - Reuse same C2_MacroAssembler object to emit instructions. I've verified it works on riscv64, passing hotspot tier1 and tier2 tests. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16484#issuecomment-1857140962 From ysr at openjdk.org Fri Dec 15 08:40:37 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Fri, 15 Dec 2023 08:40:37 GMT Subject: RFR: 8322150: [GenShen] TestPeriodicGC fails with "Generational mode: Should have periodic young GC in logs" Message-ID: 8322150: [GenShen] TestPeriodicGC fails with "Generational mode: Should have periodic young GC in logs" ------------- Commit messages: - Merge branch 'master' into should_unload_classes - JDK-8322150 should_unload_classes should return true only if a metaspace Changes: https://git.openjdk.org/shenandoah/pull/372/files Webrev: https://webrevs.openjdk.org/?repo=shenandoah&pr=372&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8322150 Stats: 2 lines in 1 file changed: 1 ins; 0 del; 1 mod Patch: https://git.openjdk.org/shenandoah/pull/372.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/372/head:pull/372 PR: https://git.openjdk.org/shenandoah/pull/372 From ysr at openjdk.org Fri Dec 15 09:19:37 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Fri, 15 Dec 2023 09:19:37 GMT Subject: RFR: 8321939: [GenShen] ShenandoahOldEvacRatioPercent=100 fails with divide-by-zero [v3] In-Reply-To: References: Message-ID: > The code that used the option ShenandoahOldEvacRatioPercent (SOEP) didn't correctly deal with the extremal setting of 100. Corrected the error and did some light refactoring in the vicinity, getting rid of some dead variables, adding documentation, and assertions. Also modified slightly the documentation of the option. > > **Testing:** > - [x] GHA > - [ ] code pipeline testing > - [x] specjbb with SOEP=default,0,100 to verify that the issue is fixed Y. Srinivas Ramakrishna has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains ten additional commits since the last revision: - Merge branch 'should_unload_classes' into tenure_budget_8314599 - Merge branch 'master' into should_unload_classes - JDK-8322150 should_unload_classes should return true only if a metaspace oom has been signalled, and other conditions are met. - Merge branch 'master' into tenure_budget_8314599 - Fix a scope issue with a local variable. - jcheck trailing white-space. - Merge branch 'master' into tenure_budget_8314599 - Fix another case where divide-by-zero could occur when ShenandoahOldEvacRatioPercent is set to 100. Some slight refactoring in the method. - Fix divide-by-zero when ShenandoahOldEvacRatioPercent==100; minor refactor & modification of documentation of flag ------------- Changes: - all: https://git.openjdk.org/shenandoah/pull/369/files - new: https://git.openjdk.org/shenandoah/pull/369/files/03e55dee..67318a30 Webrevs: - full: https://webrevs.openjdk.org/?repo=shenandoah&pr=369&range=02 - incr: https://webrevs.openjdk.org/?repo=shenandoah&pr=369&range=01-02 Stats: 88271 lines in 1723 files changed: 41556 ins; 40212 del; 6503 mod Patch: https://git.openjdk.org/shenandoah/pull/369.diff Fetch: git fetch https://git.openjdk.org/shenandoah.git pull/369/head:pull/369 PR: https://git.openjdk.org/shenandoah/pull/369 From ysr at openjdk.org Fri Dec 15 09:24:15 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Fri, 15 Dec 2023 09:24:15 GMT Subject: RFR: 8321939: [GenShen] ShenandoahOldEvacRatioPercent=100 fails with divide-by-zero [v3] In-Reply-To: References:

Message-ID: On Fri, 15 Dec 2023 09:19:37 GMT, Y. Srinivas Ramakrishna wrote: >> The code that used the option ShenandoahOldEvacRatioPercent (SOEP) didn't correctly deal with the extremal setting of 100. Corrected the error and did some light refactoring in the vicinity, getting rid of some dead variables, adding documentation, and assertions. Also modified slightly the documentation of the option. >> >> **Testing:** >> - [x] GHA >> - [ ] code pipeline testing >> - [x] specjbb with SOEP=default,0,100 to verify that the issue is fixed > > Y. Srinivas Ramakrishna has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains ten additional commits since the last revision: > > - Merge branch 'should_unload_classes' into tenure_budget_8314599 > - Merge branch 'master' into should_unload_classes > - JDK-8322150 should_unload_classes should return true only if a metaspace > oom has been signalled, and other conditions are met. > - Merge branch 'master' into tenure_budget_8314599 > - Fix a scope issue with a local variable. > - jcheck trailing white-space. > - Merge branch 'master' into tenure_budget_8314599 > - Fix another case where divide-by-zero could occur when > ShenandoahOldEvacRatioPercent is set to 100. Some slight refactoring in > the method. > - Fix divide-by-zero when ShenandoahOldEvacRatioPercent==100; minor > refactor & modification of documentation of flag This PR is blocked until PR https://github.com/openjdk/shenandoah/pull/372 integrates. src/hotspot/share/gc/shenandoah/heuristics/shenandoahHeuristics.cpp line 1: > 1: /* Please ignore the changes in this file for now. It'll go away once the fix for the linked ticket https://bugs.openjdk.org/browse/JDK-8322150 is committed. This PR is blocked until that ticet's PR is committed. ------------- PR Comment: https://git.openjdk.org/shenandoah/pull/369#issuecomment-1857546018 PR Review Comment: https://git.openjdk.org/shenandoah/pull/369#discussion_r1427743546 From ysr at openjdk.org Fri Dec 15 16:35:15 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Fri, 15 Dec 2023 16:35:15 GMT Subject: RFR: 8321816: GenShen: Provide a minimum amount of time for an old collection to run In-Reply-To: References: Message-ID: <2KWP95bvJ9D7WXDPM0_eg6AKoqHWlxk7PK9kZhYxSZE=.acfba3ce-7b6c-43f7-8822-2b2c5a5dd523@github.com> On Thu, 14 Dec 2023 19:11:34 GMT, William Kemper wrote: > Young collections are allowed to interrupt old collections. Under heavy load, this may result in starvation of old collections - which may in turn lead to even more memory pressure on the young generation. Adding a minimum guaranteed amount of time for the old gc to make progress will help avoid this situation. Looks generally good, but a couple of clarification/documentation requests/questions. src/hotspot/share/gc/shenandoah/shenandoahRegulatorThread.cpp line 79: > 77: } > 78: } else { > 79: if (should_start_young()) { 1. Could you explain (and document) why we don't do the same check `should_start_young` also at line 88 further below, _then_ `start_young_cycle`, where we find the control thread in the mode `servicing_old`? 2. The split of the roles of checks between the control points of this loop and the heuristics, and the actual requests to start a cycle look like they could do with a bit of clean up for consistency. At the current time (not as a result of your current change), the checks and the actions are scattered over a few routines so that reasoning about their correctness (or their behaviour) can be a bit challenging. This latter clean-up can, however, wait for the future. src/hotspot/share/gc/shenandoah/shenandoahRegulatorThread.hpp line 91: > 89: bool request_concurrent_gc(ShenandoahGenerationType generation); > 90: > 91: bool should_start_young(); Please add a brief line of documentation. ------------- PR Review: https://git.openjdk.org/shenandoah/pull/371#pullrequestreview-1784487189 PR Review Comment: https://git.openjdk.org/shenandoah/pull/371#discussion_r1428177814 PR Review Comment: https://git.openjdk.org/shenandoah/pull/371#discussion_r1428178613 From ysr at openjdk.org Fri Dec 15 16:52:40 2023 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Fri, 15 Dec 2023 16:52:40 GMT Subject: RFR: 8321815: Shenandoah: gc state should be synchronized to java threads only once per safepoint In-Reply-To: References: Message-ID: <2YbhDiO35Y6hAbuZwTaaUDWyC5av3ArrQLwZgtcIkhY=.39c627e5-21fc-49e7-aff6-5295833c58fa@github.com> On Thu, 14 Dec 2023 19:14:07 GMT, William Kemper wrote: > Presently, every change to the gc change is distributed to all java threads. The gc state is only changed on safepoints. For applications with a very large number of java threads this may increase time on the safepoint. It would be better to synchronize the gc state only once per safepoint. This seems reasonable. How many unnecessary state change distributions to threads do we save from this, so as to quantify the savings? It would seem as if, as a verification mechanism that no state changes were dropped from distribution, it might be a good idea to assert in both prologue and epilogue of ShenandoahHeap stop-world ops that the `_gc_state_changed` flag is false, indicating that the last state change was "consumed and distributed" by any stop-world op? ------------- PR Review: https://git.openjdk.org/jdk/pull/17112#pullrequestreview-1784520403 From wkemper at openjdk.org Fri Dec 15 17:29:38 2023 From: wkemper at openjdk.org (William Kemper) Date: Fri, 15 Dec 2023 17:29:38 GMT Subject: RFR: 8321815: Shenandoah: gc state should be synchronized to java threads only once per safepoint In-Reply-To: References: Message-ID: <7XBwU6CWI5uLBUVhZwqcd2K8RSMKyCFqc9GYXWU2CW8=.93723fbc-2975-40da-8cc0-4f2b3b70dc0b@github.com> On Thu, 14 Dec 2023 19:14:07 GMT, William Kemper wrote: > Presently, every change to the gc change is distributed to all java threads. The gc state is only changed on safepoints. For applications with a very large number of java threads this may increase time on the safepoint. It would be better to synchronize the gc state only once per safepoint. This instrumentation isn't included in this PR, but it shows how often GC state changes in a typical cycle: [40.118s][31676][info ][gc,start ] GC(1) Pause Init Mark (unload classes) [40.118s][31676][info ][gc ] GC(1) GC state = 22 [40.118s][31676][info ][gc ] GC(1) Pause Init Mark (unload classes) 0.119ms [41.646s][31676][info ][gc,start ] GC(1) Pause Final Mark (unload classes) [41.646s][31676][info ][gc ] GC(1) GC state = 0 [41.646s][31676][info ][gc ] GC(1) GC state = 10 [41.648s][31676][info ][gc ] GC(1) GC state = 14 [41.648s][31676][info ][gc ] GC(1) GC state = 15 [41.648s][31676][info ][gc ] GC(1) Pause Final Mark (unload classes) 2.324ms [42.774s][31676][info ][gc,start ] GC(1) Pause Init Update Refs [42.774s][31676][info ][gc ] GC(1) GC state = 11 [42.774s][31676][info ][gc ] GC(1) GC state = 1 [42.774s][31676][info ][gc ] GC(1) GC state = 9 [42.774s][31676][info ][gc ] GC(1) Pause Init Update Refs 0.108ms [43.040s][31676][info ][gc,start ] GC(1) Pause Final Update Refs [43.040s][31676][info ][gc ] GC(1) GC state = 1 [43.040s][31676][info ][gc ] GC(1) GC state = 0 [43.040s][31676][info ][gc ] GC(1) Pause Final Update Refs 0.162ms And for some degenerated cycles, many of these changes may happen on the same safepoint: [60.839s][31676][info ][gc,start ] GC(17) Pause Degenerated GC (Mark) [61.417s][31676][info ][gc ] GC(17) GC state = 0 [62.291s][31676][info ][gc ] GC(17) GC state = 4 [62.291s][31676][info ][gc ] GC(17) GC state = 5 [62.309s][31676][info ][gc ] GC(17) GC state = 1 [62.309s][31676][info ][gc ] GC(17) GC state = 1 [62.309s][31676][info ][gc ] GC(17) GC state = 9 [63.372s][31676][info ][gc ] GC(17) GC state = 1 [63.372s][31676][info ][gc ] GC(17) GC state = 0 [63.373s][31676][info ][gc ] GC(17) Pause Degenerated GC (Mark) 31103M->24719M(32768M) 2534.356ms I can add the assertions you described in the prologue and epilogue methods. For what it's worth, changes to the gc state already assert the changes occur on a safepoint. ------------- PR Comment: https://git.openjdk.org/jdk/pull/17112#issuecomment-1858231413 From cslucas at openjdk.org Fri Dec 15 18:21:47 2023 From: cslucas at openjdk.org (Cesar Soares Lucas) Date: Fri, 15 Dec 2023 18:21:47 GMT Subject: RFR: JDK-8241503: C2: Share MacroAssembler between mach nodes during code emission [v6] In-Reply-To: References:

Message-ID: On Fri, 15 Dec 2023 01:47:26 GMT, Ludovic Henry wrote: >> Cesar Soares Lucas has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains seven commits: >> >> - Merge with origin/master >> - Fix build, copyright dates, m4 files. >> - Fix merge >> - Catch up with master branch. >> >> Merge remote-tracking branch 'origin/master' into reuse-macroasm >> - Some inst_mark fixes; Catch up with master. >> - Catch up with changes on master >> - Reuse same C2_MacroAssembler object to emit instructions. > > I've verified it works on riscv64, passing hotspot tier1 and tier2 tests. Thank you for testing @luhenry . Which OS did you run the tests? @offamitkumar, @TheRealMDoerr - can you please re-run the tests on the platforms convenient for you? ------------- PR Comment: https://git.openjdk.org/jdk/pull/16484#issuecomment-1858306441 From wkemper at openjdk.org Fri Dec 15 18:31:44 2023 From: wkemper at openjdk.org (William Kemper) Date: Fri, 15 Dec 2023 18:31:44 GMT Subject: RFR: Merge openjdk/jdk:master [v2] In-Reply-To: