From mli at openjdk.org Sun Oct 1 10:28:36 2023 From: mli at openjdk.org (Hamlin Li) Date: Sun, 1 Oct 2023 10:28:36 GMT Subject: RFR: 8317317: G1: Make TestG1RemSetFlags use createTestJvm In-Reply-To: <9idI83bH5iOe-X793ARiN2Xi4SSOxpzPseaSLqlKSzM=.6be2e8ca-d17e-4e75-84e1-9436ac5a6438@github.com> References: <9idI83bH5iOe-X793ARiN2Xi4SSOxpzPseaSLqlKSzM=.6be2e8ca-d17e-4e75-84e1-9436ac5a6438@github.com> Message-ID: On Fri, 29 Sep 2023 14:51:23 GMT, Leo Korinth wrote: > Minor testing done, I will test more later with other fixes. Marked as reviewed by mli (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/15989#pullrequestreview-1651878135 From mli at openjdk.org Sun Oct 1 10:28:36 2023 From: mli at openjdk.org (Hamlin Li) Date: Sun, 1 Oct 2023 10:28:36 GMT Subject: RFR: 8317218: G1: Make TestG1HeapRegionSize use createTestJvm In-Reply-To: References: Message-ID: On Thu, 28 Sep 2023 09:07:04 GMT, Leo Korinth wrote: > Tested with: > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestG1HeapRegionSize.java JTREG='JAVA_OPTIONS=-XX:+UseG1GC'` -> PASS 1 > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestG1HeapRegionSize.java JTREG='JAVA_OPTIONS=-XX:+UseZGC'` -> TOTAL 0 > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestG1HeapRegionSize.java JTREG='JAVA_OPTIONS=-XX:G1HeapRegionSize=80000'` > TOTAL 0 > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestG1HeapRegionSize.java JTREG='JAVA_OPTIONS='` -> PASS 1 Marked as reviewed by mli (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/15959#pullrequestreview-1651878187 From mli at openjdk.org Sun Oct 1 10:28:43 2023 From: mli at openjdk.org (Hamlin Li) Date: Sun, 1 Oct 2023 10:28:43 GMT Subject: RFR: 8317188: G1: Make TestG1ConcRefinementThreads use createTestJvm In-Reply-To: <0DdQPK9T0TaJ9GKOZpnPkf_sbMWj7KRNSOnZCCdx8gw=.bce252ce-ea85-4eb6-b7b1-87f9be203ff7@github.com> References: <0DdQPK9T0TaJ9GKOZpnPkf_sbMWj7KRNSOnZCCdx8gw=.bce252ce-ea85-4eb6-b7b1-87f9be203ff7@github.com> Message-ID: On Thu, 28 Sep 2023 08:34:31 GMT, Leo Korinth wrote: > Testing after changes: > > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestG1ConcRefinementThreads.java JTREG='JAVA_OPTIONS=-XX:+UseG1GC' ` -> PASS 1 > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestG1ConcRefinementThreads.java JTREG='JAVA_OPTIONS=-XX:+UseZGC' ` -> TOTAL 0 > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestG1ConcRefinementThreads.java JTREG='JAVA_OPTIONS=-XX:G1ConcRefinementThreads=42' ` -> TOTAL 0 Marked as reviewed by mli (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/15958#pullrequestreview-1651878205 From mli at openjdk.org Sun Oct 1 10:28:46 2023 From: mli at openjdk.org (Hamlin Li) Date: Sun, 1 Oct 2023 10:28:46 GMT Subject: RFR: 8317042: G1: Make TestG1ConcMarkStepDurationMillis use createTestJvm In-Reply-To: References: Message-ID: On Wed, 27 Sep 2023 16:37:54 GMT, Leo Korinth wrote: > Use createTestJvm. > > Tested with: `-XX+UseG1GC` (pass 1), `-XX+UseZGC` (total 0), `-XX:G1ConcMarkStepDurationMillis=23` (total 0) and no options (pass 1) Marked as reviewed by mli (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/15948#pullrequestreview-1651878221 From mli at openjdk.org Sun Oct 1 10:28:53 2023 From: mli at openjdk.org (Hamlin Li) Date: Sun, 1 Oct 2023 10:28:53 GMT Subject: RFR: 8317316: G1: Make TestG1PercentageOptions use createTestJvm In-Reply-To: <0C-HGCwaWhhH9Lo6IhAoSoOAFu7O-7oMJ0MtAizZxCU=.a67d9a2b-1fee-433c-8f23-670892f7b4e8@github.com> References: <0C-HGCwaWhhH9Lo6IhAoSoOAFu7O-7oMJ0MtAizZxCU=.a67d9a2b-1fee-433c-8f23-670892f7b4e8@github.com> Message-ID: On Fri, 29 Sep 2023 14:24:07 GMT, Leo Korinth wrote: > Done basic testing, will do more testing before pushing (together with other tests) Marked as reviewed by mli (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/15987#pullrequestreview-1651878165 From mli at openjdk.org Sun Oct 1 10:43:33 2023 From: mli at openjdk.org (Hamlin Li) Date: Sun, 1 Oct 2023 10:43:33 GMT Subject: RFR: 8316973: GC: Make TestDisableDefaultGC use createTestJvm In-Reply-To: References: Message-ID: On Tue, 26 Sep 2023 17:24:22 GMT, Leo Korinth wrote: > There seems there is no need to strip external vm flags. The `@requires vm.gc=="null"` will handle the case when we iterate different GCs and we will then just skip the test. > > Tested with: > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestDisableDefaultGC.java JTREG='RETAIN=all;VERBOSE=all;JAVA_OPTIONS=-XX:+UseG1GC'` -> total 0 > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestDisableDefaultGC.java` -> success > > started tier 1-5 Marked as reviewed by mli (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/15931#pullrequestreview-1651880228 From mli at openjdk.org Sun Oct 1 10:44:43 2023 From: mli at openjdk.org (Hamlin Li) Date: Sun, 1 Oct 2023 10:44:43 GMT Subject: RFR: 8316410: GC: Make TestCompressedClassFlags use createTestJvm In-Reply-To: References: Message-ID: On Tue, 26 Sep 2023 16:50:26 GMT, Leo Korinth wrote: > Use createTestJvm. > > Tested with: > > make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestCompressedClassFlags.java JTREG='RETAIN=all;VERBOSE=all;JAVA_OPTIONS=-XX:+UseSerialGC' ==> TOTAL:1 PASS:1 > make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestCompressedClassFlags.java JTREG='RETAIN=all;VERBOSE=all;JAVA_OPTIONS=-XX:+UseParallelGC' ==> TOTAL:1 PASS:1 > make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestCompressedClassFlags.java JTREG='RETAIN=all;VERBOSE=all;JAVA_OPTIONS=-XX:+UseG1GC' ==> TOTAL:1 PASS:1 > make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestCompressedClassFlags.java JTREG='RETAIN=all;VERBOSE=all;JAVA_OPTIONS=-XX:+UseZGC' ==> TOTAL:1 PASS:1 > make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestCompressedClassFlags.java JTREG='RETAIN=all;VERBOSE=all;JAVA_OPTIONS=-XX:+UseZGC -XX:+ZGenerational' ==> TOTAL:1 PASS:1 > make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestCompressedClassFlags.java JTREG='RETAIN=all;VERBOSE=all;JAVA_OPTIONS=-XX:+UseShenandoahGC' ==> TOTAL:1 PASS:1 > make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestCompressedClassFlags.java JTREG='RETAIN=all;VERBOSE=all;JAVA_OPTIONS=-XX:+UnlockExperimentalVMOptions -XX:+UseEpsilonGC' ==> TOTAL:1 PASS:1 > make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestCompressedClassFlags.java JTREG='RETAIN=all;VERBOSE=all;JAVA_OPTIONS=-XX:CompressedClassSpaceSize=1g -XX:-UseCompressedClassPointers' ==> TOTAL:0 > make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestCompressedClassFlags.java JTREG='RETAIN=all;VERBOSE=all;JAVA_OPTIONS=-XX:CompressedClassSpaceSize=1g' ==> TOTAL:0 > make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestCompressedClassFlags.java JTREG='RETAIN=all;VERBOSE=all;JAVA_OPTIONS=-XX:-UseCompressedClassPointers' ==> TOTAL:0 > make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestCompressedClassFlags.java JTREG='RETAIN=all;VERBOSE=all;JAVA_OPTIONS=-XX:+UseCompressedClassPointers' ==> TOTAL:0 Marked as reviewed by mli (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/15929#pullrequestreview-1651880326 From mli at openjdk.org Sun Oct 1 10:49:38 2023 From: mli at openjdk.org (Hamlin Li) Date: Sun, 1 Oct 2023 10:49:38 GMT Subject: RFR: 8315692: Parallelize gc/stress/TestStressRSetCoarsening.java test In-Reply-To: References: Message-ID: <6VUav0RO4LriO26hZ_OrTxnDDw0Ri2_Gx1rnMeGgzCg=.687a8fb2-dace-40cf-b01c-56e4d4b5e591@github.com> On Mon, 18 Sep 2023 13:24:59 GMT, Soumadipta Roy wrote: > Looking at tier4 tests, gc/stress tests take lots of time per test. For example TestStressRSetCoarsening takes about 33 minutes to runin fastdebug mode in macos arm cpu. This limits effective parallelism of tier4 testing on large machines. We can parallelize its `@run` configs to improve effective parallelism for tier4. Below are some of the observation from before any change, partial parallelization and full parallelization: > > * before_release : **585.70s user 23.80s system 106% cpu 9:32.16 total** > * before_fastdebug : **2033.77s user 30.04s system 105% cpu 32:43.10 total** > * fully-parallelized_fastdebug : **2246.94s user 36.97s system 135% cpu 28:07.24 total** > * fully-parallelized_release : **463.52s user 31.54s system 234% cpu 3:31.19 total** > * partially-parallelized_release : **461.15s user 20.88s system 257% cpu 3:06.91 total** > > Even though partial parallelization shows better results it has anomaly 33%-50% of the time where the run results are same as before_release. I have runn each of the above combinations multiple times to establish a consistency and fully parallel gives us the most benefit in terms of total time without deviating too much on user and system times. Marked as reviewed by mli (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/15788#pullrequestreview-1651881178 From lmesnik at openjdk.org Sun Oct 1 20:52:39 2023 From: lmesnik at openjdk.org (Leonid Mesnik) Date: Sun, 1 Oct 2023 20:52:39 GMT Subject: RFR: 8315692: Parallelize gc/stress/TestStressRSetCoarsening.java test In-Reply-To: References: Message-ID: On Mon, 18 Sep 2023 13:24:59 GMT, Soumadipta Roy wrote: > Looking at tier4 tests, gc/stress tests take lots of time per test. For example TestStressRSetCoarsening takes about 33 minutes to runin fastdebug mode in macos arm cpu. This limits effective parallelism of tier4 testing on large machines. We can parallelize its `@run` configs to improve effective parallelism for tier4. Below are some of the observation from before any change, partial parallelization and full parallelization: > > * before_release : **585.70s user 23.80s system 106% cpu 9:32.16 total** > * before_fastdebug : **2033.77s user 30.04s system 105% cpu 32:43.10 total** > * fully-parallelized_fastdebug : **2246.94s user 36.97s system 135% cpu 28:07.24 total** > * fully-parallelized_release : **463.52s user 31.54s system 234% cpu 3:31.19 total** > * partially-parallelized_release : **461.15s user 20.88s system 257% cpu 3:06.91 total** > > Even though partial parallelization shows better results it has anomaly 33%-50% of the time where the run results are same as before_release. I have runn each of the above combinations multiple times to establish a consistency and fully parallel gives us the most benefit in terms of total time without deviating too much on user and system times. Thanks for doing this!. You could add meaninful test id if you want. ------------- Marked as reviewed by lmesnik (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/15788#pullrequestreview-1651961743 From lmesnik at openjdk.org Sun Oct 1 21:08:35 2023 From: lmesnik at openjdk.org (Leonid Mesnik) Date: Sun, 1 Oct 2023 21:08:35 GMT Subject: RFR: 8316973: GC: Make TestDisableDefaultGC use createTestJvm In-Reply-To: References: Message-ID: On Tue, 26 Sep 2023 17:24:22 GMT, Leo Korinth wrote: > There seems there is no need to strip external vm flags. The `@requires vm.gc=="null"` will handle the case when we iterate different GCs and we will then just skip the test. > > Tested with: > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestDisableDefaultGC.java JTREG='RETAIN=all;VERBOSE=all;JAVA_OPTIONS=-XX:+UseG1GC'` -> total 0 > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestDisableDefaultGC.java` -> success > > started tier 1-5 Marked as reviewed by lmesnik (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/15931#pullrequestreview-1651963605 From tschatzl at openjdk.org Mon Oct 2 08:09:50 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 2 Oct 2023 08:09:50 GMT Subject: RFR: 8315503: G1: Code root scan causes long GC pauses due to imbalanced iteration [v4] In-Reply-To: References:

Message-ID: On Mon, 25 Sep 2023 17:26:56 GMT, Ivan Walulya wrote: >> Thomas Schatzl has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains four commits: >> >> - Merge branch 'master' into 8315503-code-root-scan-imbalance >> - iwalulya review - more (gtest) cleanup >> - iwalulya review >> - initial version that seems to work >> >> Contains kludge to avoid modification of currently scanned code root set. >> Ought to be fixed differently. >> >> Contains debug code in table scanners of CodeRootSet/CardSet to find out problems with table growing >> >> Hashcode hack for code root set, using copy&paste ZHash >> >> Shrink table after clean >> >> Bulk removal of nmethods from code root sets after class unloading. From Ivan. >> >> Cleanup, resize after bulk delete, hashcode verification > > Still LGTM! Thanks @walulyai @albertnetymk for your reviews ------------- PR Comment: https://git.openjdk.org/jdk/pull/15811#issuecomment-1742516392 From tschatzl at openjdk.org Mon Oct 2 08:33:16 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 2 Oct 2023 08:33:16 GMT Subject: Integrated: 8315503: G1: Code root scan causes long GC pauses due to imbalanced iteration In-Reply-To: References: Message-ID: On Tue, 19 Sep 2023 08:04:23 GMT, Thomas Schatzl wrote: > Hi all, > > please review this change that modifies the code root (remembered) set to use the CHT as internal representation. > > This removes lots of locking (inhibiting throughput), provides automatic balancing for the code root scan phase, and (parallel) bulk unregistering of nmethdos during code cache unloading improving performance of various pauses that deal with code root sets. > > With a stress test that frequently loads and unloads 6000 classes and associated methods from them we could previously see the following issues: > > During collection pauses: > > [4179,965s][gc,phases ] GC(273) Evacuate Collection Set: 812,18ms > [..] > [4179,965s][gc,phases ] GC(273) Code Root Scan (ms): Min: 0,00, Avg: 59,03, Max: 775,12, Diff: 775,12, Sum: 944,44, Workers: 16 > [...] > [4179,965s][gc,phases ] GC(273) Termination (ms): Min: 0,03, Avg: 643,90, Max: 690,96, Diff: 690,93, Sum: 10302,47, Workers: 16 > > > Code root scan now reduces to ~22ms max on average in this case. > > We have recently seen some imbalances in code root scan and long Remark pauses (thankfully not to that extreme) in other real-world applications too: > > [2466.979s][gc,phases ] GC(131) Code Root Scan (ms): Min: 0.0, Avg: 5.7, Max: 46.4, Diff: 46.4, Sum: 57.0, Workers: 10 > > > Some random comment: > * the mutex for the CHT had to be decreased in priority by one to not conflict with `CodeCache_lock`. This does not seem to be detrimental otherwise. At the same time, I had to move the locks at `nosafepoint-3` to `nosafepoint-4` as well to keep previous ordering. All mutexes with uses of `nosafepoint` as their rank seem to be good now. > > Testing: tier1-5 > > Thanks, > Thomas This pull request has now been integrated. Changeset: 795e5dcc Author: Thomas Schatzl URL: https://git.openjdk.org/jdk/commit/795e5dcc856491031b87a1f2a942681a582673ab Stats: 382 lines in 13 files changed: 218 ins; 114 del; 50 mod 8315503: G1: Code root scan causes long GC pauses due to imbalanced iteration Co-authored-by: Ivan Walulya Reviewed-by: iwalulya, ayang ------------- PR: https://git.openjdk.org/jdk/pull/15811 From shade at openjdk.org Mon Oct 2 08:41:35 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 2 Oct 2023 08:41:35 GMT Subject: RFR: 8316929: Shenandoah: Shenandoah degenerated GC and full GC need to cleanup old OopMapCache entries In-Reply-To: References: Message-ID: On Tue, 26 Sep 2023 12:48:12 GMT, Zhengyu Gu wrote: > During STW root scan, interpreted frame's oop map may be cached. But due to limited cache size (32 entries per instance class), entries may be evicted to old entries list due to collision, should be cleanup in VM_Operation's doit_epilogue(), or risk leaking memory. > > Test: > hotspot_gc_shenandoah (fastdebug and release on MacOSX) This looks okay, but I think you can "just" add `OMC::cleanup_old_entries` in `VM_ShenandoahReferenceOperation` destructor, without introducing the intermediary? ------------- PR Review: https://git.openjdk.org/jdk/pull/15921#pullrequestreview-1652325437 From lkorinth at openjdk.org Mon Oct 2 10:05:36 2023 From: lkorinth at openjdk.org (Leo Korinth) Date: Mon, 2 Oct 2023 10:05:36 GMT Subject: RFR: 8317343: GC: Make TestHeapFreeRatio use createTestJvm Message-ID: <90LqPWddFASKjNrWHZVoKgqHHAgrUBg0FUFnqtMJDCw=.012774fa-388e-4ef2-af4f-059c1a9ec41b@github.com> This fix is implicitly dependent on https://github.com/openjdk/jdk/pull/15986/files for `@requires opt.x` support. Initial testing passes, but I will do more (tier) testing before pushing. ------------- Commit messages: - 8317343: GC: Make TestHeapFreeRatio use createTestJvm Changes: https://git.openjdk.org/jdk/pull/16007/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16007&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8317343 Stats: 3 lines in 1 file changed: 1 ins; 0 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/16007.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16007/head:pull/16007 PR: https://git.openjdk.org/jdk/pull/16007 From duke at openjdk.org Mon Oct 2 10:35:34 2023 From: duke at openjdk.org (Soumadipta Roy) Date: Mon, 2 Oct 2023 10:35:34 GMT Subject: RFR: 8315692: Parallelize gc/stress/TestStressRSetCoarsening.java test In-Reply-To: References:

Message-ID: On Sun, 1 Oct 2023 20:49:39 GMT, Leonid Mesnik wrote: > Thanks for doing this!. You could add meaninful test id if you want. @lmesnik I will push another commit with test ids ------------- PR Comment: https://git.openjdk.org/jdk/pull/15788#issuecomment-1742774439 From duke at openjdk.org Mon Oct 2 11:55:09 2023 From: duke at openjdk.org (Soumadipta Roy) Date: Mon, 2 Oct 2023 11:55:09 GMT Subject: RFR: 8315692: Parallelize gc/stress/TestStressRSetCoarsening.java test [v2] In-Reply-To: References: Message-ID: > Looking at tier4 tests, gc/stress tests take lots of time per test. For example TestStressRSetCoarsening takes about 33 minutes to runin fastdebug mode in macos arm cpu. This limits effective parallelism of tier4 testing on large machines. We can parallelize its `@run` configs to improve effective parallelism for tier4. Below are some of the observation from before any change, partial parallelization and full parallelization: > > * before_release : **585.70s user 23.80s system 106% cpu 9:32.16 total** > * before_fastdebug : **2033.77s user 30.04s system 105% cpu 32:43.10 total** > * fully-parallelized_fastdebug : **2246.94s user 36.97s system 135% cpu 28:07.24 total** > * fully-parallelized_release : **463.52s user 31.54s system 234% cpu 3:31.19 total** > * partially-parallelized_release : **461.15s user 20.88s system 257% cpu 3:06.91 total** > > Even though partial parallelization shows better results it has anomaly 33%-50% of the time where the run results are same as before_release. I have runn each of the above combinations multiple times to establish a consistency and fully parallel gives us the most benefit in terms of total time without deviating too much on user and system times. Soumadipta Roy has updated the pull request incrementally with one additional commit since the last revision: Cosmetic fixes Fixing summary positioning along with some spacing and spelling issues. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/15788/files - new: https://git.openjdk.org/jdk/pull/15788/files/004bda6c..8c73db56 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=15788&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=15788&range=00-01 Stats: 8 lines in 1 file changed: 1 ins; 1 del; 6 mod Patch: https://git.openjdk.org/jdk/pull/15788.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/15788/head:pull/15788 PR: https://git.openjdk.org/jdk/pull/15788 From duke at openjdk.org Mon Oct 2 11:55:10 2023 From: duke at openjdk.org (Soumadipta Roy) Date: Mon, 2 Oct 2023 11:55:10 GMT Subject: RFR: 8315692: Parallelize gc/stress/TestStressRSetCoarsening.java test [v2] In-Reply-To: References:

Message-ID: On Mon, 2 Oct 2023 10:33:00 GMT, Soumadipta Roy wrote: > Thanks for doing this!. You could add meaninful test id if you want. @lmesnik Tried to add some test ids locally, however the naming doesn't sound or look that that great for people to concur unanimously. So keeping it to default. ------------- PR Comment: https://git.openjdk.org/jdk/pull/15788#issuecomment-1742869496 From lkorinth at openjdk.org Mon Oct 2 11:55:33 2023 From: lkorinth at openjdk.org (Leo Korinth) Date: Mon, 2 Oct 2023 11:55:33 GMT Subject: RFR: 8317347: Parallel: Make TestInitialTenuringThreshold use createTestJvm Message-ID: Also add parallel flag for first invocation (this is a bug). Initial testing ok, but will run tiers before pushing. ------------- Commit messages: - 8317347: Parallel: Make TestInitialTenuringThreshold use createTestJvm Changes: https://git.openjdk.org/jdk/pull/16009/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16009&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8317347 Stats: 5 lines in 1 file changed: 1 ins; 0 del; 4 mod Patch: https://git.openjdk.org/jdk/pull/16009.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16009/head:pull/16009 PR: https://git.openjdk.org/jdk/pull/16009 From shade at openjdk.org Mon Oct 2 12:00:12 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 2 Oct 2023 12:00:12 GMT Subject: RFR: 8315692: Parallelize gc/stress/TestStressRSetCoarsening.java test [v2] In-Reply-To: References:

Message-ID: On Mon, 2 Oct 2023 11:55:09 GMT, Soumadipta Roy wrote: >> Looking at tier4 tests, gc/stress tests take lots of time per test. For example TestStressRSetCoarsening takes about 33 minutes to runin fastdebug mode in macos arm cpu. This limits effective parallelism of tier4 testing on large machines. We can parallelize its `@run` configs to improve effective parallelism for tier4. Below are some of the observation from before any change, partial parallelization and full parallelization: >> >> * before_release : **585.70s user 23.80s system 106% cpu 9:32.16 total** >> * before_fastdebug : **2033.77s user 30.04s system 105% cpu 32:43.10 total** >> * fully-parallelized_fastdebug : **2246.94s user 36.97s system 135% cpu 28:07.24 total** >> * fully-parallelized_release : **463.52s user 31.54s system 234% cpu 3:31.19 total** >> * partially-parallelized_release : **461.15s user 20.88s system 257% cpu 3:06.91 total** >> >> Even though partial parallelization shows better results it has anomaly 33%-50% of the time where the run results are same as before_release. I have runn each of the above combinations multiple times to establish a consistency and fully parallel gives us the most benefit in terms of total time without deviating too much on user and system times. > > Soumadipta Roy has updated the pull request incrementally with one additional commit since the last revision: > > Cosmetic fixes > > Fixing summary positioning along with some spacing and spelling issues. Marked as reviewed by tschatzl (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/15788#pullrequestreview-1652648456 From zgu at openjdk.org Mon Oct 2 13:37:05 2023 From: zgu at openjdk.org (Zhengyu Gu) Date: Mon, 2 Oct 2023 13:37:05 GMT Subject: RFR: 8316929: Shenandoah: Shenandoah degenerated GC and full GC need to cleanup old OopMapCache entries [v2] In-Reply-To: References: Message-ID: > During STW root scan, interpreted frame's oop map may be cached. But due to limited cache size (32 entries per instance class), entries may be evicted to old entries list due to collision, should be cleanup in VM_Operation's doit_epilogue(), or risk leaking memory. > > Test: > hotspot_gc_shenandoah (fastdebug and release on MacOSX) Zhengyu Gu has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains three additional commits since the last revision: - Merge branch 'master' into JDK-8316929 - cleanup - 8316929: Shenandoah: Shenandoah degenerated GC and full GC need to cleanup old OopMapCache entries ------------- Changes: - all: https://git.openjdk.org/jdk/pull/15921/files - new: https://git.openjdk.org/jdk/pull/15921/files/92dc4f4e..b84d2cad Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=15921&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=15921&range=00-01 Stats: 7940 lines in 283 files changed: 5780 ins; 1041 del; 1119 mod Patch: https://git.openjdk.org/jdk/pull/15921.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/15921/head:pull/15921 PR: https://git.openjdk.org/jdk/pull/15921 From zgu at openjdk.org Mon Oct 2 13:41:56 2023 From: zgu at openjdk.org (Zhengyu Gu) Date: Mon, 2 Oct 2023 13:41:56 GMT Subject: RFR: 8316929: Shenandoah: Shenandoah degenerated GC and full GC need to cleanup old OopMapCache entries [v3] In-Reply-To: References: Message-ID: > During STW root scan, interpreted frame's oop map may be cached. But due to limited cache size (32 entries per instance class), entries may be evicted to old entries list due to collision, should be cleanup in VM_Operation's doit_epilogue(), or risk leaking memory. > > Test: > hotspot_gc_shenandoah (fastdebug and release on MacOSX) Zhengyu Gu has updated the pull request incrementally with one additional commit since the last revision: Fix order ------------- Changes: - all: https://git.openjdk.org/jdk/pull/15921/files - new: https://git.openjdk.org/jdk/pull/15921/files/b84d2cad..c1e9cf27 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=15921&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=15921&range=01-02 Stats: 2 lines in 1 file changed: 1 ins; 1 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/15921.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/15921/head:pull/15921 PR: https://git.openjdk.org/jdk/pull/15921 From zgu at openjdk.org Mon Oct 2 13:48:58 2023 From: zgu at openjdk.org (Zhengyu Gu) Date: Mon, 2 Oct 2023 13:48:58 GMT Subject: RFR: 8316929: Shenandoah: Shenandoah degenerated GC and full GC need to cleanup old OopMapCache entries [v4] In-Reply-To: References: Message-ID: > During STW root scan, interpreted frame's oop map may be cached. But due to limited cache size (32 entries per instance class), entries may be evicted to old entries list due to collision, should be cleanup in VM_Operation's doit_epilogue(), or risk leaking memory. > > Test: > hotspot_gc_shenandoah (fastdebug and release on MacOSX) Zhengyu Gu has updated the pull request incrementally with one additional commit since the last revision: More cleanup ------------- Changes: - all: https://git.openjdk.org/jdk/pull/15921/files - new: https://git.openjdk.org/jdk/pull/15921/files/c1e9cf27..4f41dc14 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=15921&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=15921&range=02-03 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/15921.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/15921/head:pull/15921 PR: https://git.openjdk.org/jdk/pull/15921 From zgu at openjdk.org Mon Oct 2 13:51:13 2023 From: zgu at openjdk.org (Zhengyu Gu) Date: Mon, 2 Oct 2023 13:51:13 GMT Subject: RFR: 8316929: Shenandoah: Shenandoah degenerated GC and full GC need to cleanup old OopMapCache entries [v3] In-Reply-To: References:

Message-ID: On Mon, 2 Oct 2023 13:41:56 GMT, Zhengyu Gu wrote: >> During STW root scan, interpreted frame's oop map may be cached. But due to limited cache size (32 entries per instance class), entries may be evicted to old entries list due to collision, should be cleanup in VM_Operation's doit_epilogue(), or risk leaking memory. >> >> Test: >> hotspot_gc_shenandoah (fastdebug and release on MacOSX) > > Zhengyu Gu has updated the pull request incrementally with one additional commit since the last revision: > > Fix order > This looks okay, but I think you can "just" add `OMC::cleanup_old_entries` in `VM_ShenandoahReferenceOperation` destructor, without introducing the intermediary? > This looks okay, but I think you can "just" add `OMC::cleanup_old_entries` in `VM_ShenandoahReferenceOperation` destructor, without introducing the intermediary? Place the call in `doit_epilogue()` for the symmetry with other [GCs](https://github.com/openjdk/jdk/blob/master/src/hotspot/share/gc/shared/gcVMOperations.cpp#L135). But you are right, we don't need another VM operation. ------------- PR Comment: https://git.openjdk.org/jdk/pull/15921#issuecomment-1743052006 From shade at openjdk.org Mon Oct 2 14:32:14 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 2 Oct 2023 14:32:14 GMT Subject: RFR: 8316929: Shenandoah: Shenandoah degenerated GC and full GC need to cleanup old OopMapCache entries [v4] In-Reply-To: References:

Message-ID: On Mon, 2 Oct 2023 13:48:58 GMT, Zhengyu Gu wrote: >> During STW root scan, interpreted frame's oop map may be cached. But due to limited cache size (32 entries per instance class), entries may be evicted to old entries list due to collision, should be cleanup in VM_Operation's doit_epilogue(), or risk leaking memory. >> >> Test: >> hotspot_gc_shenandoah (fastdebug and release on MacOSX) > > Zhengyu Gu has updated the pull request incrementally with one additional commit since the last revision: > > More cleanup Looks fine, thanks! ------------- Marked as reviewed by shade (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/15921#pullrequestreview-1652875148 From lkorinth at openjdk.org Mon Oct 2 15:17:28 2023 From: lkorinth at openjdk.org (Leo Korinth) Date: Mon, 2 Oct 2023 15:17:28 GMT Subject: RFR: 8317358: G1: Make TestMaxNewSize use createTestJvm Message-ID: In addition remove deprecated `Long(long)` constructor and rewrite `compareTo` to use `> 0` instead of `== 1`. Also remove unused `isRunningG1(String[] args)` and `checkIncompatibleNewSize(String[] flags)` Minimal testing completed, will run tier testing before pushing. ------------- Commit messages: - 8317358: G1: Make TestMaxNewSize use createTestJvm Changes: https://git.openjdk.org/jdk/pull/16012/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16012&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8317358 Stats: 30 lines in 1 file changed: 0 ins; 23 del; 7 mod Patch: https://git.openjdk.org/jdk/pull/16012.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16012/head:pull/16012 PR: https://git.openjdk.org/jdk/pull/16012 From duke at openjdk.org Mon Oct 2 15:20:31 2023 From: duke at openjdk.org (Soumadipta Roy) Date: Mon, 2 Oct 2023 15:20:31 GMT Subject: Integrated: 8315692: Parallelize gc/stress/TestStressRSetCoarsening.java test In-Reply-To: References: Message-ID: On Mon, 18 Sep 2023 13:24:59 GMT, Soumadipta Roy wrote: > Looking at tier4 tests, gc/stress tests take lots of time per test. For example TestStressRSetCoarsening takes about 33 minutes to runin fastdebug mode in macos arm cpu. This limits effective parallelism of tier4 testing on large machines. We can parallelize its `@run` configs to improve effective parallelism for tier4. Below are some of the observation from before any change, partial parallelization and full parallelization: > > * before_release : **585.70s user 23.80s system 106% cpu 9:32.16 total** > * before_fastdebug : **2033.77s user 30.04s system 105% cpu 32:43.10 total** > * fully-parallelized_fastdebug : **2246.94s user 36.97s system 135% cpu 28:07.24 total** > * fully-parallelized_release : **463.52s user 31.54s system 234% cpu 3:31.19 total** > * partially-parallelized_release : **461.15s user 20.88s system 257% cpu 3:06.91 total** > > Even though partial parallelization shows better results it has anomaly 33%-50% of the time where the run results are same as before_release. I have runn each of the above combinations multiple times to establish a consistency and fully parallel gives us the most benefit in terms of total time without deviating too much on user and system times. This pull request has now been integrated. Changeset: a564d436 Author: Soumadipta Roy Committer: Aleksey Shipilev URL: https://git.openjdk.org/jdk/commit/a564d436c722f14041231158f21c4ad3a2f6a3a5 Stats: 63 lines in 1 file changed: 55 ins; 1 del; 7 mod 8315692: Parallelize gc/stress/TestStressRSetCoarsening.java test Reviewed-by: shade, mli, lmesnik, tschatzl ------------- PR: https://git.openjdk.org/jdk/pull/15788 From wkemper at openjdk.org Mon Oct 2 20:18:31 2023 From: wkemper at openjdk.org (William Kemper) Date: Mon, 2 Oct 2023 20:18:31 GMT Subject: RFR: 8316632: Shenandoah: Raise OOME when gc threshold is exceeded [v3] In-Reply-To: <2bXcUWi2pXLKmpiPTNIT2_xu2fGBwPHB5YUMInN-OFo=.92656fc9-7549-41fa-bb9d-445cee3db5f8@github.com> References:

<2bXcUWi2pXLKmpiPTNIT2_xu2fGBwPHB5YUMInN-OFo=.92656fc9-7549-41fa-bb9d-445cee3db5f8@github.com> Message-ID: On Fri, 29 Sep 2023 16:28:30 GMT, Aleksey Shipilev wrote: >> William Kemper has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains eight additional commits since the last revision: >> >> - Merge remote-tracking branch 'openjdk/master' into shenandoah-oome-redux >> - Extend exemption for EATests that rely on timely OOME to Shenandoah >> - Improve comment, increase default for no progress threshold >> - Allocator should not reset bad progress count >> - Allocator should not reset bad progress count >> - Fix 32-bit build error >> - Do not use atomics in header >> - Signal gc threshold exceeded when appropriate > > test/hotspot/jtreg/TEST.groups line 612: > >> 610: gtest/NMTGtests.java >> 611: >> 612: hotspot_oome = \ > > Adding new test groups require additional attention. I think this should be removed. > > Note that during debugging/testing, you can run the same by: > > > $ make test TEST="runtime/reflect/ReflectOutOfMemoryError.java gc/InfiniteList.java runtime/ClassInitErrors/TestOutOfMemoryDuringInit.java" I'll revert this. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/15852#discussion_r1343104448 From zgu at openjdk.org Mon Oct 2 20:56:33 2023 From: zgu at openjdk.org (Zhengyu Gu) Date: Mon, 2 Oct 2023 20:56:33 GMT Subject: RFR: 8316929: Shenandoah: Shenandoah degenerated GC and full GC need to cleanup old OopMapCache entries [v4] In-Reply-To: References:

Message-ID: On Mon, 2 Oct 2023 14:29:11 GMT, Aleksey Shipilev wrote: >> Zhengyu Gu has updated the pull request incrementally with one additional commit since the last revision: >> >> More cleanup > > Looks fine, thanks! Thanks, @shipilev ------------- PR Comment: https://git.openjdk.org/jdk/pull/15921#issuecomment-1743693867 From zgu at openjdk.org Mon Oct 2 20:56:38 2023 From: zgu at openjdk.org (Zhengyu Gu) Date: Mon, 2 Oct 2023 20:56:38 GMT Subject: Integrated: 8316929: Shenandoah: Shenandoah degenerated GC and full GC need to cleanup old OopMapCache entries In-Reply-To: References: Message-ID: On Tue, 26 Sep 2023 12:48:12 GMT, Zhengyu Gu wrote: > During STW root scan, interpreted frame's oop map may be cached. But due to limited cache size (32 entries per instance class), entries may be evicted to old entries list due to collision, should be cleanup in VM_Operation's doit_epilogue(), or risk leaking memory. > > Test: > hotspot_gc_shenandoah (fastdebug and release on MacOSX) This pull request has now been integrated. Changeset: e25121d1 Author: Zhengyu Gu URL: https://git.openjdk.org/jdk/commit/e25121d1d908bd74e7a5914d85284ab322bed1a3 Stats: 2 lines in 1 file changed: 2 ins; 0 del; 0 mod 8316929: Shenandoah: Shenandoah degenerated GC and full GC need to cleanup old OopMapCache entries Reviewed-by: shade ------------- PR: https://git.openjdk.org/jdk/pull/15921 From wkemper at openjdk.org Mon Oct 2 21:31:13 2023 From: wkemper at openjdk.org (William Kemper) Date: Mon, 2 Oct 2023 21:31:13 GMT Subject: RFR: 8316632: Shenandoah: Raise OOME when gc threshold is exceeded [v4] In-Reply-To: References: Message-ID: > Shenandoah will run back-to-back full GCs and _almost_ grind mutator progress to a halt before eventually exhausting memory. This change will have Shenandoah raise a gc threshold exceeded exception if the collector fails to make progress after `ShenandoahNoProgressThreshold` full GC cycles (default is 3). William Kemper has updated the pull request incrementally with two additional commits since the last revision: - Merge check for no-progress into retry allocation block - Revert change to TEST.groups ------------- Changes: - all: https://git.openjdk.org/jdk/pull/15852/files - new: https://git.openjdk.org/jdk/pull/15852/files/cd4989e7..1971467f Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=15852&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=15852&range=02-03 Stats: 33 lines in 2 files changed: 10 ins; 14 del; 9 mod Patch: https://git.openjdk.org/jdk/pull/15852.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/15852/head:pull/15852 PR: https://git.openjdk.org/jdk/pull/15852 From wkemper at openjdk.org Mon Oct 2 21:31:19 2023 From: wkemper at openjdk.org (William Kemper) Date: Mon, 2 Oct 2023 21:31:19 GMT Subject: RFR: 8316632: Shenandoah: Raise OOME when gc threshold is exceeded [v3] In-Reply-To: <2bXcUWi2pXLKmpiPTNIT2_xu2fGBwPHB5YUMInN-OFo=.92656fc9-7549-41fa-bb9d-445cee3db5f8@github.com> References:

<2bXcUWi2pXLKmpiPTNIT2_xu2fGBwPHB5YUMInN-OFo=.92656fc9-7549-41fa-bb9d-445cee3db5f8@github.com> Message-ID: On Fri, 29 Sep 2023 16:40:22 GMT, Aleksey Shipilev wrote: >> William Kemper has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains eight additional commits since the last revision: >> >> - Merge remote-tracking branch 'openjdk/master' into shenandoah-oome-redux >> - Extend exemption for EATests that rely on timely OOME to Shenandoah >> - Improve comment, increase default for no progress threshold >> - Allocator should not reset bad progress count >> - Allocator should not reset bad progress count >> - Fix 32-bit build error >> - Do not use atomics in header >> - Signal gc threshold exceeded when appropriate > > src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 877: > >> 875: } >> 876: >> 877: if (result == nullptr && !req.is_lab_alloc() && get_gc_no_progress_count() > ShenandoahNoProgressThreshold) { > > Can this be moved to the block that already does allocation-after-gc retries? That block already exits with `nullptr` (implies delivering OOME), and it already calls `handle_alloc_failure` (thus triggering GC). We "only" need it to specialize for `is_lab_alloc` and `ShenandoahNoProgressThreshold`? > > This PR changes that block anyway... Yes, I've made this change as you suggest, but I'm not sure it improves readability. Note that when the `ShenandoahNoProgressThreshold` is exceeded, we do _not_ retry the allocation or wait for another gc cycle to complete. > src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 902: > >> 900: ResourceMark rm; >> 901: log_debug(gc, alloc)("Thread: %s, Result: " PTR_FORMAT ", Shared: %s, Size: " SIZE_FORMAT ", Original: " SIZE_FORMAT ", Latest: " SIZE_FORMAT, >> 902: Thread::current()->name(), p2i(result), BOOL_TO_STR(!req.is_lab_alloc()), req.size(), original_count, get_gc_no_progress_count()); > > There is `type_string()` that can be used instead of `is_lab_alloc()` here. TIL - thank you. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/15852#discussion_r1343174921 PR Review Comment: https://git.openjdk.org/jdk/pull/15852#discussion_r1343175156 From ayang at openjdk.org Tue Oct 3 08:30:05 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Tue, 3 Oct 2023 08:30:05 GMT Subject: RFR: 8317354: Serial: Move DirtyCardToOopClosure to gc/serial folder Message-ID: <49vr8f73ylqcYr7iaxBziG_cTPkZjATyCiCHwRxSoFg=.ac721666-4eb0-45a7-b312-b8f2d90c8f91@github.com> Simple relocating code to more appropriate folder. `ClearNoncleanCardWrapper` is moved to cpp as well. One can sth like `git diff

--color-moved=dimmed_zebra` to better review this PR. ------------- Commit messages: - s1-dirty-card-closure Changes: https://git.openjdk.org/jdk/pull/16025/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16025&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8317354 Stats: 362 lines in 4 files changed: 181 ins; 181 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/16025.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16025/head:pull/16025 PR: https://git.openjdk.org/jdk/pull/16025 From tschatzl at openjdk.org Tue Oct 3 10:20:32 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 3 Oct 2023 10:20:32 GMT Subject: RFR: 8317354: Serial: Move DirtyCardToOopClosure to gc/serial folder In-Reply-To: <49vr8f73ylqcYr7iaxBziG_cTPkZjATyCiCHwRxSoFg=.ac721666-4eb0-45a7-b312-b8f2d90c8f91@github.com> References: <49vr8f73ylqcYr7iaxBziG_cTPkZjATyCiCHwRxSoFg=.ac721666-4eb0-45a7-b312-b8f2d90c8f91@github.com> Message-ID: On Tue, 3 Oct 2023 08:12:42 GMT, Albert Mingkun Yang wrote: > Simple relocating code to more appropriate folder. `ClearNoncleanCardWrapper` is moved to cpp as well. > > One can sth like `git diff

--color-moved=dimmed_zebra` to better review this PR. Lgtm ------------- Marked as reviewed by tschatzl (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16025#pullrequestreview-1654854908 From tschatzl at openjdk.org Tue Oct 3 10:22:32 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 3 Oct 2023 10:22:32 GMT Subject: RFR: 8317042: G1: Make TestG1ConcMarkStepDurationMillis use createTestJvm In-Reply-To: References: Message-ID: On Wed, 27 Sep 2023 16:37:54 GMT, Leo Korinth wrote: > Use createTestJvm. > > Tested with: `-XX+UseG1GC` (pass 1), `-XX+UseZGC` (total 0), `-XX:G1ConcMarkStepDurationMillis=23` (total 0) and no options (pass 1) Marked as reviewed by tschatzl (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/15948#pullrequestreview-1654858311 From tschatzl at openjdk.org Tue Oct 3 10:23:33 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 3 Oct 2023 10:23:33 GMT Subject: RFR: 8317188: G1: Make TestG1ConcRefinementThreads use createTestJvm In-Reply-To: <0DdQPK9T0TaJ9GKOZpnPkf_sbMWj7KRNSOnZCCdx8gw=.bce252ce-ea85-4eb6-b7b1-87f9be203ff7@github.com> References: <0DdQPK9T0TaJ9GKOZpnPkf_sbMWj7KRNSOnZCCdx8gw=.bce252ce-ea85-4eb6-b7b1-87f9be203ff7@github.com> Message-ID: On Thu, 28 Sep 2023 08:34:31 GMT, Leo Korinth wrote: > Testing after changes: > > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestG1ConcRefinementThreads.java JTREG='JAVA_OPTIONS=-XX:+UseG1GC' ` -> PASS 1 > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestG1ConcRefinementThreads.java JTREG='JAVA_OPTIONS=-XX:+UseZGC' ` -> TOTAL 0 > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestG1ConcRefinementThreads.java JTREG='JAVA_OPTIONS=-XX:G1ConcRefinementThreads=42' ` -> TOTAL 0 Marked as reviewed by tschatzl (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/15958#pullrequestreview-1654860006 From tschatzl at openjdk.org Tue Oct 3 10:24:35 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 3 Oct 2023 10:24:35 GMT Subject: RFR: 8317218: G1: Make TestG1HeapRegionSize use createTestJvm In-Reply-To: References: Message-ID: On Thu, 28 Sep 2023 09:07:04 GMT, Leo Korinth wrote: > Tested with: > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestG1HeapRegionSize.java JTREG='JAVA_OPTIONS=-XX:+UseG1GC'` -> PASS 1 > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestG1HeapRegionSize.java JTREG='JAVA_OPTIONS=-XX:+UseZGC'` -> TOTAL 0 > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestG1HeapRegionSize.java JTREG='JAVA_OPTIONS=-XX:G1HeapRegionSize=80000'` > TOTAL 0 > `make run-test TEST=open/test/hotspot/jtreg/gc/arguments/TestG1HeapRegionSize.java JTREG='JAVA_OPTIONS='` -> PASS 1 Marked as reviewed by tschatzl (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/15959#pullrequestreview-1654861258 From tschatzl at openjdk.org Tue Oct 3 10:25:32 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 3 Oct 2023 10:25:32 GMT Subject: RFR: 8317316: G1: Make TestG1PercentageOptions use createTestJvm In-Reply-To: <0C-HGCwaWhhH9Lo6IhAoSoOAFu7O-7oMJ0MtAizZxCU=.a67d9a2b-1fee-433c-8f23-670892f7b4e8@github.com> References: <0C-HGCwaWhhH9Lo6IhAoSoOAFu7O-7oMJ0MtAizZxCU=.a67d9a2b-1fee-433c-8f23-670892f7b4e8@github.com> Message-ID: On Fri, 29 Sep 2023 14:24:07 GMT, Leo Korinth wrote: > Done basic testing, will do more testing before pushing (together with other tests) Marked as reviewed by tschatzl (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/15987#pullrequestreview-1654863391 From tschatzl at openjdk.org Tue Oct 3 10:28:35 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 3 Oct 2023 10:28:35 GMT Subject: RFR: 8317343: GC: Make TestHeapFreeRatio use createTestJvm In-Reply-To: <90LqPWddFASKjNrWHZVoKgqHHAgrUBg0FUFnqtMJDCw=.012774fa-388e-4ef2-af4f-059c1a9ec41b@github.com> References: <90LqPWddFASKjNrWHZVoKgqHHAgrUBg0FUFnqtMJDCw=.012774fa-388e-4ef2-af4f-059c1a9ec41b@github.com> Message-ID: On Mon, 2 Oct 2023 09:57:55 GMT, Leo Korinth wrote: > This fix is implicitly dependent on https://github.com/openjdk/jdk/pull/15986/files for `@requires opt.x` support. Initial testing passes, but I will do more (tier) testing before pushing. Marked as reviewed by tschatzl (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/16007#pullrequestreview-1654868543 From tschatzl at openjdk.org Tue Oct 3 10:28:35 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 3 Oct 2023 10:28:35 GMT Subject: RFR: 8317317: G1: Make TestG1RemSetFlags use createTestJvm In-Reply-To: <9idI83bH5iOe-X793ARiN2Xi4SSOxpzPseaSLqlKSzM=.6be2e8ca-d17e-4e75-84e1-9436ac5a6438@github.com> References: <9idI83bH5iOe-X793ARiN2Xi4SSOxpzPseaSLqlKSzM=.6be2e8ca-d17e-4e75-84e1-9436ac5a6438@github.com> Message-ID: On Fri, 29 Sep 2023 14:51:23 GMT, Leo Korinth wrote: > Minor testing done, I will test more later with other fixes. Marked as reviewed by tschatzl (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/15989#pullrequestreview-1654867432 From duke at openjdk.org Tue Oct 3 14:12:12 2023 From: duke at openjdk.org (Soumadipta Roy) Date: Tue, 3 Oct 2023 14:12:12 GMT Subject: RFR: 8316608: Enable parallelism in vmTestbase/gc/vector tests Message-ID: The commit includes changes to unblock parallelism for more `hotspot:tier4` tests. in `vmTestbase/gc/vector` tests. Below are the before and after run comparisons: * before_fastdebug: **3480.71s user 83.97s system 830% cpu 7:09.41 total** * before_release: **1369.61s user 147.03s system 371% cpu 6:48.63 total** * after_fastdebug: **2214.52s user 63.19s system 2374% cpu 1:35.94 total** * after_release: **1130.28s user 110.97s system 2478% cpu 50.089 total** ------------- Commit messages: - 8316608: Enable parallelism in vmTestbase/gc/vector tests Changes: https://git.openjdk.org/jdk/pull/16028/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16028&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8316608 Stats: 299 lines in 13 files changed: 0 ins; 299 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/16028.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16028/head:pull/16028 PR: https://git.openjdk.org/jdk/pull/16028 From dcubed at openjdk.org Tue Oct 3 18:28:01 2023 From: dcubed at openjdk.org (Daniel D. Daugherty) Date: Tue, 3 Oct 2023 18:28:01 GMT Subject: RFR: 8317446: ProblemList gc/arguments/TestNewSizeFlags.java on macosx-aarch64 in Xcomp Message-ID: Trivial fixes to ProblemList noisy tests in the JDK22 CI; [JDK-8317446](https://bugs.openjdk.org/browse/JDK-8317446) ProblemList gc/arguments/TestNewSizeFlags.java on macosx-aarch64 in Xcomp [JDK-8317448](https://bugs.openjdk.org/browse/JDK-8317448) ProblemList compiler/interpreter/TestVerifyStackAfterDeopt.java on macosx-aarch64 in Xcomp [JDK-8317449](https://bugs.openjdk.org/browse/JDK-8317449) ProblemList serviceability/jvmti/stress/StackTrace/NotSuspended/GetStackTraceNotSuspendedStressTest.java on several platforms ------------- Commit messages: - 8317449: ProblemList serviceability/jvmti/stress/StackTrace/NotSuspended/GetStackTraceNotSuspendedStressTest.java on several platforms - 8317448: ProblemList compiler/interpreter/TestVerifyStackAfterDeopt.java on macosx-aarch64 in Xcomp - 8317446: ProblemList gc/arguments/TestNewSizeFlags.java on macosx-aarch64 in Xcomp Changes: https://git.openjdk.org/jdk/pull/16031/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16031&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8317446 Stats: 5 lines in 2 files changed: 5 ins; 0 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/16031.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16031/head:pull/16031 PR: https://git.openjdk.org/jdk/pull/16031 From thartmann at openjdk.org Tue Oct 3 18:33:38 2023 From: thartmann at openjdk.org (Tobias Hartmann) Date: Tue, 3 Oct 2023 18:33:38 GMT Subject: RFR: 8317446: ProblemList gc/arguments/TestNewSizeFlags.java on macosx-aarch64 in Xcomp In-Reply-To: References: Message-ID: On Tue, 3 Oct 2023 17:53:27 GMT, Daniel D. Daugherty wrote: > Trivial fixes to ProblemList noisy tests in the JDK22 CI; > > [JDK-8317446](https://bugs.openjdk.org/browse/JDK-8317446) ProblemList gc/arguments/TestNewSizeFlags.java on macosx-aarch64 in Xcomp > [JDK-8317448](https://bugs.openjdk.org/browse/JDK-8317448) ProblemList compiler/interpreter/TestVerifyStackAfterDeopt.java on macosx-aarch64 in Xcomp > [JDK-8317449](https://bugs.openjdk.org/browse/JDK-8317449) ProblemList serviceability/jvmti/stress/StackTrace/NotSuspended/GetStackTraceNotSuspendedStressTest.java on several platforms Looks good and trivial. ------------- Marked as reviewed by thartmann (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16031#pullrequestreview-1655866995 From dcubed at openjdk.org Tue Oct 3 19:18:50 2023 From: dcubed at openjdk.org (Daniel D. Daugherty) Date: Tue, 3 Oct 2023 19:18:50 GMT Subject: RFR: 8317446: ProblemList gc/arguments/TestNewSizeFlags.java on macosx-aarch64 in Xcomp In-Reply-To: References:

Message-ID: On Tue, 3 Oct 2023 18:30:44 GMT, Tobias Hartmann wrote: >> Trivial fixes to ProblemList noisy tests in the JDK22 CI; >> >> [JDK-8317446](https://bugs.openjdk.org/browse/JDK-8317446) ProblemList gc/arguments/TestNewSizeFlags.java on macosx-aarch64 in Xcomp >> [JDK-8317448](https://bugs.openjdk.org/browse/JDK-8317448) ProblemList compiler/interpreter/TestVerifyStackAfterDeopt.java on macosx-aarch64 in Xcomp >> [JDK-8317449](https://bugs.openjdk.org/browse/JDK-8317449) ProblemList serviceability/jvmti/stress/StackTrace/NotSuspended/GetStackTraceNotSuspendedStressTest.java on several platforms > > Looks good and trivial. @TobiHartmann - Thanks for the fast review! ------------- PR Comment: https://git.openjdk.org/jdk/pull/16031#issuecomment-1745573937 From dcubed at openjdk.org Tue Oct 3 19:22:11 2023 From: dcubed at openjdk.org (Daniel D. Daugherty) Date: Tue, 3 Oct 2023 19:22:11 GMT Subject: Integrated: 8317446: ProblemList gc/arguments/TestNewSizeFlags.java on macosx-aarch64 in Xcomp In-Reply-To: References: Message-ID: On Tue, 3 Oct 2023 17:53:27 GMT, Daniel D. Daugherty wrote: > Trivial fixes to ProblemList noisy tests in the JDK22 CI; > > [JDK-8317446](https://bugs.openjdk.org/browse/JDK-8317446) ProblemList gc/arguments/TestNewSizeFlags.java on macosx-aarch64 in Xcomp > [JDK-8317448](https://bugs.openjdk.org/browse/JDK-8317448) ProblemList compiler/interpreter/TestVerifyStackAfterDeopt.java on macosx-aarch64 in Xcomp > [JDK-8317449](https://bugs.openjdk.org/browse/JDK-8317449) ProblemList serviceability/jvmti/stress/StackTrace/NotSuspended/GetStackTraceNotSuspendedStressTest.java on several platforms This pull request has now been integrated. Changeset: 8ff10a0d Author: Daniel D. Daugherty URL: https://git.openjdk.org/jdk/commit/8ff10a0d3520fbeae9fe7aac4226d65b93ec79f8 Stats: 5 lines in 2 files changed: 5 ins; 0 del; 0 mod 8317446: ProblemList gc/arguments/TestNewSizeFlags.java on macosx-aarch64 in Xcomp 8317448: ProblemList compiler/interpreter/TestVerifyStackAfterDeopt.java on macosx-aarch64 in Xcomp 8317449: ProblemList serviceability/jvmti/stress/StackTrace/NotSuspended/GetStackTraceNotSuspendedStressTest.java on several platforms Reviewed-by: thartmann ------------- PR: https://git.openjdk.org/jdk/pull/16031 From duke at openjdk.org Tue Oct 3 21:37:46 2023 From: duke at openjdk.org (Diego Pino Navarro) Date: Tue, 3 Oct 2023 21:37:46 GMT Subject: RFR: 8308507: G1: GClocker induced GCs can starve threads requiring memory leading to OOME [v10] In-Reply-To: <2meSHY2dSGLyLNXPTeSwOBbsCh6Ki0itMEfh2x-GhUM=.a715c087-a300-452f-b406-04168843b363@github.com> References: <2meSHY2dSGLyLNXPTeSwOBbsCh6Ki0itMEfh2x-GhUM=.a715c087-a300-452f-b406-04168843b363@github.com> Message-ID: On Mon, 12 Jun 2023 13:32:52 GMT, Ivan Walulya wrote: >> Please review this change which fixes the thread starvation problem during allocation for G1. >> >> The starvation problem is not limited to GCLocker, however, currently, it manifests as an OOME only when GCLocker is active. In other cases, the starvation only affects the "starved" thread as it may loop indefinitely. >> >> Starvation with an active GCLocker happens as below: >> >> 1. Thread A tries to allocate memory as normal, and tries to start a GC; the GCLocker is active and so the thread gets stalled waiting for the GC. >> 2. GCLocker induced GC executes and frees some memory. >> 3. Thread A does not get any of that memory, but other threads also waiting for memory. >> 4. Goto 1 until the gclocker retry count has been reached. >> >> In this change, we take the general approach to solving starvation problems with announcement tables (request queues). On slow allocation, a thread that wishes to complete an Allocation GC and then attempt an allocation, announces its allocation request before proceeding to participate in a race to execute a GC safepoint. Whichever thread succeeds in executing the Allocation GC safepoint will be tasked with completing all allocation requests that were announced before the safepoint. This guarantees that all announced allocation requests are either satisfied during the safepoint, or failed in case there is not enough memory to complete all requests. This effectively deals with the starvation issue and reduces the number of allocation GCs triggered. >> >> Note: The change also adopts ZList from ZGC and makes it available under utilities as DoublyLinkedList with slight modifications. >> >> Testing: Tier 1-7 > > Ivan Walulya has updated the pull request incrementally with one additional commit since the last revision: > > clean up keep alive+1 ------------- PR Comment: https://git.openjdk.org/jdk/pull/14077#issuecomment-1682616848 From shade at openjdk.org Wed Oct 4 08:02:35 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 4 Oct 2023 08:02:35 GMT Subject: RFR: 8316608: Enable parallelism in vmTestbase/gc/vector tests In-Reply-To: References: Message-ID: On Tue, 3 Oct 2023 14:02:09 GMT, Soumadipta Roy wrote: > The commit includes changes to unblock parallelism for more `hotspot:tier4` tests. in `vmTestbase/gc/vector` tests. > > Below are the before and after run comparisons: > > # Fastdebug > before: 3480.71s user 83.97s system 830% cpu 7:09.41 total > after: 2214.52s user 63.19s system 2374% cpu 1:35.94 total > > # Release > before: 1369.61s user 147.03s system 371% cpu 6:48.63 total > after: 1130.28s user 110.97s system 2478% cpu 50.089 total I think this is good, but @tschatzl and @lmesnik need to be looped in for visibility. ------------- Marked as reviewed by shade (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16028#pullrequestreview-1656903673 From aph at openjdk.org Wed Oct 4 08:31:47 2023 From: aph at openjdk.org (Andrew Haley) Date: Wed, 4 Oct 2023 08:31:47 GMT Subject: RFR: 8317446: ProblemList gc/arguments/TestNewSizeFlags.java on macosx-aarch64 in Xcomp In-Reply-To: References: Message-ID: On Tue, 3 Oct 2023 17:53:27 GMT, Daniel D. Daugherty wrote: > Trivial fixes to ProblemList noisy tests in the JDK22 CI; > > [JDK-8317446](https://bugs.openjdk.org/browse/JDK-8317446) ProblemList gc/arguments/TestNewSizeFlags.java on macosx-aarch64 in Xcomp > [JDK-8317448](https://bugs.openjdk.org/browse/JDK-8317448) ProblemList compiler/interpreter/TestVerifyStackAfterDeopt.java on macosx-aarch64 in Xcomp > [JDK-8317449](https://bugs.openjdk.org/browse/JDK-8317449) ProblemList serviceability/jvmti/stress/StackTrace/NotSuspended/GetStackTraceNotSuspendedStressTest.java on several platforms What is this all about? I can't see any link to a root cause. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16031#issuecomment-1746387857 From thartmann at openjdk.org Wed Oct 4 11:30:47 2023 From: thartmann at openjdk.org (Tobias Hartmann) Date: Wed, 4 Oct 2023 11:30:47 GMT Subject: RFR: 8317446: ProblemList gc/arguments/TestNewSizeFlags.java on macosx-aarch64 in Xcomp In-Reply-To: References: Message-ID: On Tue, 3 Oct 2023 17:53:27 GMT, Daniel D. Daugherty wrote: > Trivial fixes to ProblemList noisy tests in the JDK22 CI; > > [JDK-8317446](https://bugs.openjdk.org/browse/JDK-8317446) ProblemList gc/arguments/TestNewSizeFlags.java on macosx-aarch64 in Xcomp > [JDK-8317448](https://bugs.openjdk.org/browse/JDK-8317448) ProblemList compiler/interpreter/TestVerifyStackAfterDeopt.java on macosx-aarch64 in Xcomp > [JDK-8317449](https://bugs.openjdk.org/browse/JDK-8317449) ProblemList serviceability/jvmti/stress/StackTrace/NotSuspended/GetStackTraceNotSuspendedStressTest.java on several platforms This change is simply problem listing multiple tests. As described in the [OpenJDK Developers? Guide](https://openjdk.org/guide/#problemlisting-jtreg-tests), the problem listing issues are sub-tasks of the actual (still open) issues and are linked by Skara in the "Issues" section of this PR. The corresponding JBS issues are also listed in the problem list entries themselves. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16031#issuecomment-1746682420 From ayang at openjdk.org Wed Oct 4 11:52:41 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 4 Oct 2023 11:52:41 GMT Subject: RFR: 8310031: Parallel: Implement better work distribution for large object arrays in old gen [v12] In-Reply-To: <50VtEqmFaxK4NnbXqU54rQW8R1YrGDa6HukQOuniupE=.5a5365f1-546a-4c48-a763-9248346c6593@github.com> References: <50VtEqmFaxK4NnbXqU54rQW8R1YrGDa6HukQOuniupE=.5a5365f1-546a-4c48-a763-9248346c6593@github.com> Message-ID: On Thu, 28 Sep 2023 07:41:18 GMT, Richard Reingruber wrote: >> This pr introduces parallel scanning of large object arrays in the old generation containing roots for young collections of Parallel GC. This allows for better distribution of the actual work (following the array references) as opposed to "stealing" from other task queues which can lead to inverse scaling demonstrated by small tests (attached to JDK-8310031) and also observed in gerrit production systems. >> >> The algorithm to share scanning large arrays is supposed to be a straight >> forward extension of the scheme implemented in >> `PSCardTable::scavenge_contents_parallel`. >> >> - A worker scans the part of a large array located in its stripe >> >> - Except for the end of the large array reaching into a stripe which is scanned by the thread owning the previous stripe. This is just what the current implementation does: it skips objects crossing into the stripe. >> >> - For this it is necessary that large arrays cover at least 3 stripes (see `PSCardTable::large_obj_arr_min_words`) >> >> The implementation also makes use of the precise card marks for arrays. Only dirty regions are actually scanned. >> >> #### Performance testing >> >> ##### BigArrayInOldGenRR.java >> >> [BigArrayInOldGenRR.java](https://bugs.openjdk.org/secure/attachment/104422/BigArrayInOldGenRR.java) is a micro benchmark that assigns new objects to a large array in a loop. Creating new array elements triggers young collections. In each collection the large array is scanned because of its references to the new elements in the young generation. The benchmark score is the geometric mean of the duration of the last 5 young collections (lower is better). >> >> [BigArrayInOldGenRR.pdf](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.pdf)([BigArrayInOldGenRR.ods](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.ods)) presents the benchmark results with 1 to 64 gc threads. >> >> Observations >> >> * JDK22 scales inversely. Adding gc threads prolongues young collections. With 32 threads young collections take ~15x longer than single threaded. >> >> * Fixed JDK22 scales well. Adding gc theads reduces the duration of young collections. With 32 threads young collections are 5x shorter than single threaded. >> >> * With just 1 gc thread there is a regression. Young collections are 1.5x longer with the fix. I assume the reason is that the iteration over the array elements is interrupted at the end of a stripe which makes it less efficient. The prize for parallelization is paid ... > > Richard Reingruber has updated the pull request incrementally with one additional commit since the last revision: > > Remove stripe size adaptations and cache potentially expensive start array queries Performed additional performance testing on the latest revision of this pull request and https://github.com/openjdk/jdk/compare/master...albertnetymk:jdk:pgc-precise-obj-arr?expand=1 (made sure they were on top of the same master commit) Couldn't identify significant differences when running micro benchmarks from the JBS ticket with different gc-threads; implemented various tweaks but the distinction between the two approaches remains mostly marginal. Both methods exhibit substantial improvements over the master, as demonstrated earlier. No performance difference observed in pjbb2005 between the master, this pull request, and shadow-card-table. (I had difficulty in running `timefold`, so I asked Thomas for help about it.) The cost of malloc + memset for the shadow-card-table is ~0.26ms per 1G of old-gen (each card being 512 bytes) (raw data: 0.553169 ms for 4395946 cards). Since the shadow-card-table approach doesn't result in any noticeable regression, offers better scalability for large-array-objects, and comes with the lowest implementation complexity, I am inclined to settle for the shadow-card-table approach for now and explore more sophisticated optimizations later on. What are others' thoughts on this direction? ------------- PR Comment: https://git.openjdk.org/jdk/pull/14846#issuecomment-1746714371 From rrich at openjdk.org Wed Oct 4 11:59:39 2023 From: rrich at openjdk.org (Richard Reingruber) Date: Wed, 4 Oct 2023 11:59:39 GMT Subject: RFR: 8310031: Parallel: Implement better work distribution for large object arrays in old gen [v12] In-Reply-To: <50VtEqmFaxK4NnbXqU54rQW8R1YrGDa6HukQOuniupE=.5a5365f1-546a-4c48-a763-9248346c6593@github.com> References: <50VtEqmFaxK4NnbXqU54rQW8R1YrGDa6HukQOuniupE=.5a5365f1-546a-4c48-a763-9248346c6593@github.com> Message-ID: On Thu, 28 Sep 2023 07:41:18 GMT, Richard Reingruber wrote: >> This pr introduces parallel scanning of large object arrays in the old generation containing roots for young collections of Parallel GC. This allows for better distribution of the actual work (following the array references) as opposed to "stealing" from other task queues which can lead to inverse scaling demonstrated by small tests (attached to JDK-8310031) and also observed in gerrit production systems. >> >> The algorithm to share scanning large arrays is supposed to be a straight >> forward extension of the scheme implemented in >> `PSCardTable::scavenge_contents_parallel`. >> >> - A worker scans the part of a large array located in its stripe >> >> - Except for the end of the large array reaching into a stripe which is scanned by the thread owning the previous stripe. This is just what the current implementation does: it skips objects crossing into the stripe. >> >> - For this it is necessary that large arrays cover at least 3 stripes (see `PSCardTable::large_obj_arr_min_words`) >> >> The implementation also makes use of the precise card marks for arrays. Only dirty regions are actually scanned. >> >> #### Performance testing >> >> ##### BigArrayInOldGenRR.java >> >> [BigArrayInOldGenRR.java](https://bugs.openjdk.org/secure/attachment/104422/BigArrayInOldGenRR.java) is a micro benchmark that assigns new objects to a large array in a loop. Creating new array elements triggers young collections. In each collection the large array is scanned because of its references to the new elements in the young generation. The benchmark score is the geometric mean of the duration of the last 5 young collections (lower is better). >> >> [BigArrayInOldGenRR.pdf](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.pdf)([BigArrayInOldGenRR.ods](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.ods)) presents the benchmark results with 1 to 64 gc threads. >> >> Observations >> >> * JDK22 scales inversely. Adding gc threads prolongues young collections. With 32 threads young collections take ~15x longer than single threaded. >> >> * Fixed JDK22 scales well. Adding gc theads reduces the duration of young collections. With 32 threads young collections are 5x shorter than single threaded. >> >> * With just 1 gc thread there is a regression. Young collections are 1.5x longer with the fix. I assume the reason is that the iteration over the array elements is interrupted at the end of a stripe which makes it less efficient. The prize for parallelization is paid ... > > Richard Reingruber has updated the pull request incrementally with one additional commit since the last revision: > > Remove stripe size adaptations and cache potentially expensive start array queries We've had a public holiday yesterday. I'm still working on the version without shadow card table. It is more complex, and finding the issues is time consuming but I'd like to finish the work on it. ------------- PR Comment: https://git.openjdk.org/jdk/pull/14846#issuecomment-1746726305 From iwalulya at openjdk.org Wed Oct 4 14:12:39 2023 From: iwalulya at openjdk.org (Ivan Walulya) Date: Wed, 4 Oct 2023 14:12:39 GMT Subject: RFR: 8317354: Serial: Move DirtyCardToOopClosure to gc/serial folder In-Reply-To: <49vr8f73ylqcYr7iaxBziG_cTPkZjATyCiCHwRxSoFg=.ac721666-4eb0-45a7-b312-b8f2d90c8f91@github.com> References: <49vr8f73ylqcYr7iaxBziG_cTPkZjATyCiCHwRxSoFg=.ac721666-4eb0-45a7-b312-b8f2d90c8f91@github.com> Message-ID: <-ucg1BAbRw9VQhMzuOSmPARKhb45VOi6rv2dV2_aYK8=.7ff5b862-84d7-42f3-8b4f-cf5dc49c1ce5@github.com> On Tue, 3 Oct 2023 08:12:42 GMT, Albert Mingkun Yang wrote: > Simple relocating code to more appropriate folder. `ClearNoncleanCardWrapper` is moved to cpp as well. > > One can sth like `git diff

--color-moved=dimmed_zebra` to better review this PR. Marked as reviewed by iwalulya (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/16025#pullrequestreview-1657635747 From ayang at openjdk.org Wed Oct 4 14:18:55 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 4 Oct 2023 14:18:55 GMT Subject: RFR: 8317354: Serial: Move DirtyCardToOopClosure to gc/serial folder In-Reply-To: <49vr8f73ylqcYr7iaxBziG_cTPkZjATyCiCHwRxSoFg=.ac721666-4eb0-45a7-b312-b8f2d90c8f91@github.com> References: <49vr8f73ylqcYr7iaxBziG_cTPkZjATyCiCHwRxSoFg=.ac721666-4eb0-45a7-b312-b8f2d90c8f91@github.com> Message-ID: On Tue, 3 Oct 2023 08:12:42 GMT, Albert Mingkun Yang wrote: > Simple relocating code to more appropriate folder. `ClearNoncleanCardWrapper` is moved to cpp as well. > > One can sth like `git diff

--color-moved=dimmed_zebra` to better review this PR. Thanks for the review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16025#issuecomment-1746966388 From ayang at openjdk.org Wed Oct 4 14:18:56 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 4 Oct 2023 14:18:56 GMT Subject: Integrated: 8317354: Serial: Move DirtyCardToOopClosure to gc/serial folder In-Reply-To: <49vr8f73ylqcYr7iaxBziG_cTPkZjATyCiCHwRxSoFg=.ac721666-4eb0-45a7-b312-b8f2d90c8f91@github.com> References: <49vr8f73ylqcYr7iaxBziG_cTPkZjATyCiCHwRxSoFg=.ac721666-4eb0-45a7-b312-b8f2d90c8f91@github.com> Message-ID: On Tue, 3 Oct 2023 08:12:42 GMT, Albert Mingkun Yang wrote: > Simple relocating code to more appropriate folder. `ClearNoncleanCardWrapper` is moved to cpp as well. > > One can sth like `git diff

--color-moved=dimmed_zebra` to better review this PR. This pull request has now been integrated. Changeset: 4195246f Author: Albert Mingkun Yang URL: https://git.openjdk.org/jdk/commit/4195246fba721934f2b2c0525b1d5b2fe4b08122 Stats: 362 lines in 4 files changed: 181 ins; 181 del; 0 mod 8317354: Serial: Move DirtyCardToOopClosure to gc/serial folder Reviewed-by: tschatzl, iwalulya ------------- PR: https://git.openjdk.org/jdk/pull/16025 From jwaters at openjdk.org Thu Oct 5 10:12:26 2023 From: jwaters at openjdk.org (Julian Waters) Date: Thu, 5 Oct 2023 10:12:26 GMT Subject: RFR: 8315880: change LockingMode default from LM_LEGACY to LM_LIGHTWEIGHT In-Reply-To: References: Message-ID: On Mon, 18 Sep 2023 20:47:13 GMT, Daniel D. Daugherty wrote: > Change the default of LockingMode to LM_LIGHTWEIGHT from LM_LEGACY. > > This fix has been tested with 3 Mach5 Tier[1-8] runs and a 4th is in process. I think this may have broken JDK-8288293 ? Is there any compiler specific implementation involved with the new lightweight locking? Just a quick question, I can figure out the rest on my own ------------- PR Comment: https://git.openjdk.org/jdk/pull/15797#issuecomment-1748553071 From rrich at openjdk.org Thu Oct 5 10:22:45 2023 From: rrich at openjdk.org (Richard Reingruber) Date: Thu, 5 Oct 2023 10:22:45 GMT Subject: RFR: 8310031: Parallel: Implement better work distribution for large object arrays in old gen [v13] In-Reply-To: References: Message-ID: > This pr introduces parallel scanning of large object arrays in the old generation containing roots for young collections of Parallel GC. This allows for better distribution of the actual work (following the array references) as opposed to "stealing" from other task queues which can lead to inverse scaling demonstrated by small tests (attached to JDK-8310031) and also observed in gerrit production systems. > > The algorithm to share scanning large arrays is supposed to be a straight > forward extension of the scheme implemented in > `PSCardTable::scavenge_contents_parallel`. > > - A worker scans the part of a large array located in its stripe > > - Except for the end of the large array reaching into a stripe which is scanned by the thread owning the previous stripe. This is just what the current implementation does: it skips objects crossing into the stripe. > > - For this it is necessary that large arrays cover at least 3 stripes (see `PSCardTable::large_obj_arr_min_words`) > > The implementation also makes use of the precise card marks for arrays. Only dirty regions are actually scanned. > > #### Performance testing > > ##### BigArrayInOldGenRR.java > > [BigArrayInOldGenRR.java](https://bugs.openjdk.org/secure/attachment/104422/BigArrayInOldGenRR.java) is a micro benchmark that assigns new objects to a large array in a loop. Creating new array elements triggers young collections. In each collection the large array is scanned because of its references to the new elements in the young generation. The benchmark score is the geometric mean of the duration of the last 5 young collections (lower is better). > > [BigArrayInOldGenRR.pdf](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.pdf)([BigArrayInOldGenRR.ods](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.ods)) presents the benchmark results with 1 to 64 gc threads. > > Observations > > * JDK22 scales inversely. Adding gc threads prolongues young collections. With 32 threads young collections take ~15x longer than single threaded. > > * Fixed JDK22 scales well. Adding gc theads reduces the duration of young collections. With 32 threads young collections are 5x shorter than single threaded. > > * With just 1 gc thread there is a regression. Young collections are 1.5x longer with the fix. I assume the reason is that the iteration over the array elements is interrupted at the end of a stripe which makes it less efficient. The prize for parallelization is paid without actually doing it. Also ParallelGC will use at lea... Richard Reingruber has updated the pull request incrementally with two additional commits since the last revision: - Split work strictly at stripe boundaries - Reset to master ------------- Changes: - all: https://git.openjdk.org/jdk/pull/14846/files - new: https://git.openjdk.org/jdk/pull/14846/files/50737dda..817b164c Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=14846&range=12 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=14846&range=11-12 Stats: 505 lines in 8 files changed: 197 ins; 272 del; 36 mod Patch: https://git.openjdk.org/jdk/pull/14846.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/14846/head:pull/14846 PR: https://git.openjdk.org/jdk/pull/14846 From rrich at openjdk.org Thu Oct 5 10:23:13 2023 From: rrich at openjdk.org (Richard Reingruber) Date: Thu, 5 Oct 2023 10:23:13 GMT Subject: RFR: 8310031: Parallel: Implement better work distribution for large object arrays in old gen [v12] In-Reply-To: <50VtEqmFaxK4NnbXqU54rQW8R1YrGDa6HukQOuniupE=.5a5365f1-546a-4c48-a763-9248346c6593@github.com> References: <50VtEqmFaxK4NnbXqU54rQW8R1YrGDa6HukQOuniupE=.5a5365f1-546a-4c48-a763-9248346c6593@github.com> Message-ID: On Thu, 28 Sep 2023 07:41:18 GMT, Richard Reingruber wrote: >> This pr introduces parallel scanning of large object arrays in the old generation containing roots for young collections of Parallel GC. This allows for better distribution of the actual work (following the array references) as opposed to "stealing" from other task queues which can lead to inverse scaling demonstrated by small tests (attached to JDK-8310031) and also observed in gerrit production systems. >> >> The algorithm to share scanning large arrays is supposed to be a straight >> forward extension of the scheme implemented in >> `PSCardTable::scavenge_contents_parallel`. >> >> - A worker scans the part of a large array located in its stripe >> >> - Except for the end of the large array reaching into a stripe which is scanned by the thread owning the previous stripe. This is just what the current implementation does: it skips objects crossing into the stripe. >> >> - For this it is necessary that large arrays cover at least 3 stripes (see `PSCardTable::large_obj_arr_min_words`) >> >> The implementation also makes use of the precise card marks for arrays. Only dirty regions are actually scanned. >> >> #### Performance testing >> >> ##### BigArrayInOldGenRR.java >> >> [BigArrayInOldGenRR.java](https://bugs.openjdk.org/secure/attachment/104422/BigArrayInOldGenRR.java) is a micro benchmark that assigns new objects to a large array in a loop. Creating new array elements triggers young collections. In each collection the large array is scanned because of its references to the new elements in the young generation. The benchmark score is the geometric mean of the duration of the last 5 young collections (lower is better). >> >> [BigArrayInOldGenRR.pdf](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.pdf)([BigArrayInOldGenRR.ods](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.ods)) presents the benchmark results with 1 to 64 gc threads. >> >> Observations >> >> * JDK22 scales inversely. Adding gc threads prolongues young collections. With 32 threads young collections take ~15x longer than single threaded. >> >> * Fixed JDK22 scales well. Adding gc theads reduces the duration of young collections. With 32 threads young collections are 5x shorter than single threaded. >> >> * With just 1 gc thread there is a regression. Young collections are 1.5x longer with the fix. I assume the reason is that the iteration over the array elements is interrupted at the end of a stripe which makes it less efficient. The prize for parallelization is paid ... > > Richard Reingruber has updated the pull request incrementally with one additional commit since the last revision: > > Remove stripe size adaptations and cache potentially expensive start array queries With the last version the work is strictly limited to stripes. * Work partitioning happens at stripe boundaries splitting also non-array objects if necessary. * A worker thread accesses only card table entries corresponding to the stripe it's working on. * Object arrays are scanned precisely. * Non-object arrays are not scanned precisely. * The implementation is inspired by Albert's work (especially the `ObjStartCache` which prevents `ObjectStartArray` performance issues with very large arrays). It does not duplicate the card table though. Instead it copies imprecise card marks from (non-array) object start to the first card of stripes reached (see `PSCardTable::pre_scavenge`). * `ObjStartCache` is a class because I need to pass it by value to `find_first_clean_card`. I haven't seen a significant difference between this version, Albert's work and the baseline running the card_scan* tests (except for the improvement over the baseline of course). The new test [`card_scan_big_instances.java`](https://bugs.openjdk.org/secure/attachment/106702/card_scan_big_instances.java) fills the old generation with large (32K) non-array instances. It shows the costs of copying imprecise card marks or the complete card table: Baseline -------- $ H=64g; T=16 ; jdk-baseline/bin/java -Xms${H} -Xmx${H} -XX:+UseParallelGC -XX:ParallelGCThreads=$T -Xlog:gc=trace card_scan_big_instances 24 [0.005s][info][gc] Using Parallel BIG_INSTANCE_SIZE_BYTES:32768 (32K) bigInstancesCount:786432 [6.439s][trace][gc] GC(0) PSYoung generation size at maximum: 22369280K [6.439s][info ][gc] GC(0) Pause Young (Allocation Failure) 16384M->14095M(62805M) 2965.766ms ### System.gc [10.871s][trace][gc] GC(1) PSYoung generation size at maximum: 22369280K [10.871s][info ][gc] GC(1) Pause Young (System.gc()) 24926M->24600M(62805M) 2785.957ms [11.835s][info ][gc] GC(2) Pause Full (System.gc()) 24600M->24600M(62805M) 963.882ms ### System.gc done [14.442s][trace][gc] GC(3) PSYoung generation size at maximum: 22369280K [14.442s][info ][gc] GC(3) Pause Young (Allocation Failure) 40984M->24600M(62805M) 28.970ms [16.967s][trace][gc] GC(4) PSYoung generation size at maximum: 22369280K [16.967s][info ][gc] GC(4) Pause Young (Allocation Failure) 40984M->24600M(62805M) 30.074ms [19.490s][trace][gc] GC(5) PSYoung generation size at maximum: 22369280K [19.490s][info ][gc] GC(5) Pause Young (Allocation Failure) 40984M->24600M(62805M) 30.643ms Albert's first draft -------------------- $ H=64g; T=16 ; jdk-alb/bin/java -Xms${H} -Xmx${H} -XX:+UseParallelGC -XX:ParallelGCThreads=$T -Xlog:gc=trace card_scan_big_instances 24 [0.005s][info][gc] Using Parallel BIG_INSTANCE_SIZE_BYTES:32768 (32K) bigInstancesCount:786432 [6.410s][trace][gc] GC(0) PSYoung generation size at maximum: 22369280K [6.410s][info ][gc] GC(0) Pause Young (Allocation Failure) 16384M->14095M(62805M) 2957.403ms ### System.gc [10.734s][trace][gc] GC(1) PSYoung generation size at maximum: 22369280K [10.734s][info ][gc] GC(1) Pause Young (System.gc()) 24926M->24600M(62805M) 2656.551ms [11.713s][info ][gc] GC(2) Pause Full (System.gc()) 24600M->24600M(62805M) 978.694ms ### System.gc done [14.587s][trace][gc] GC(3) PSYoung generation size at maximum: 22369280K [14.587s][info ][gc] GC(3) Pause Young (Allocation Failure) 40984M->24600M(62805M) 74.870ms [17.132s][trace][gc] GC(4) PSYoung generation size at maximum: 22369280K [17.132s][info ][gc] GC(4) Pause Young (Allocation Failure) 40984M->24600M(62805M) 74.031ms [19.678s][trace][gc] GC(5) PSYoung generation size at maximum: 22369280K [19.678s][info ][gc] GC(5) Pause Young (Allocation Failure) 40984M->24600M(62805M) 71.357ms New --- $ H=64g; T=16 ; jdk-new/bin/java -Xms${H} -Xmx${H} -XX:+UseParallelGC -XX:ParallelGCThreads=$T -Xlog:gc=trace card_scan_big_instances 24 [0.006s][info][gc] Using Parallel BIG_INSTANCE_SIZE_BYTES:32768 (32K) bigInstancesCount:786432 [5.643s][trace][gc] GC(0) PSYoung generation size at maximum: 22369280K [5.643s][info ][gc] GC(0) Pause Young (Allocation Failure) 16384M->14095M(62805M) 2196.615ms ### System.gc [8.875s][trace][gc] GC(1) PSYoung generation size at maximum: 22369280K [8.875s][info ][gc] GC(1) Pause Young (System.gc()) 24926M->24601M(62805M) 1432.776ms [10.303s][info ][gc] GC(2) Pause Full (System.gc()) 24601M->24600M(62805M) 1428.142ms ### System.gc done [13.464s][trace][gc] GC(3) PSYoung generation size at maximum: 22369280K [13.464s][info ][gc] GC(3) Pause Young (Allocation Failure) 40984M->24600M(62805M) 106.833ms [15.929s][trace][gc] GC(4) PSYoung generation size at maximum: 22369280K [15.929s][info ][gc] GC(4) Pause Young (Allocation Failure) 40984M->24600M(62805M) 103.752ms [18.397s][trace][gc] GC(5) PSYoung generation size at maximum: 22369280K [18.397s][info ][gc] GC(5) Pause Young (Allocation Failure) 40984M->24600M(62805M) 106.153ms ------------- PR Comment: https://git.openjdk.org/jdk/pull/14846#issuecomment-1748600398 From rkennke at openjdk.org Thu Oct 5 10:23:27 2023 From: rkennke at openjdk.org (Roman Kennke) Date: Thu, 5 Oct 2023 10:23:27 GMT Subject: RFR: 8315880: change LockingMode default from LM_LEGACY to LM_LIGHTWEIGHT In-Reply-To: References:

Message-ID: <3pcyxBEm99WQE_cOsMPRu5sqFS9altVDvks4BqRw3nY=.37f3a180-f800-4b2c-bf4e-9e4d40e5e898@github.com> On Thu, 5 Oct 2023 10:09:33 GMT, Julian Waters wrote: > I think this may have broken JDK-8288293 ? > > Is there any compiler specific implementation involved with the new lightweight locking? Just a quick question, I can figure out the rest on my own Not that I am aware of. AFAIK, it works fine with various versions of GCC on Linux, Xcode on MacOS and VS on Windows. ------------- PR Comment: https://git.openjdk.org/jdk/pull/15797#issuecomment-1748605692 From tschatzl at openjdk.org Thu Oct 5 10:35:10 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Thu, 5 Oct 2023 10:35:10 GMT Subject: RFR: 8317358: G1: Make TestMaxNewSize use createTestJvm In-Reply-To: References: Message-ID: <_pmezp7zYPvMSNwaAHR71gZV3tnGp8vIu7LYYEqeCeM=.f8602dc1-ed70-4d17-9d32-4a36efc17c9b@github.com> On Mon, 2 Oct 2023 15:09:49 GMT, Leo Korinth wrote: > In addition remove deprecated `Long(long)` constructor and rewrite `compareTo` to use `> 0` instead of `== 1`. > > Also remove unused `isRunningG1(String[] args)` and `checkIncompatibleNewSize(String[] flags)` > > Minimal testing completed, will run tier testing before pushing. Good. ------------- Marked as reviewed by tschatzl (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16012#pullrequestreview-1659504330 From tschatzl at openjdk.org Thu Oct 5 10:37:11 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Thu, 5 Oct 2023 10:37:11 GMT Subject: RFR: 8317347: Parallel: Make TestInitialTenuringThreshold use createTestJvm In-Reply-To: References: Message-ID: <28X1pwXMd9BwfV7BaJ2xvspvvykSb95UTkw8lDUMSUA=.d8ccbcac-7642-459c-9331-5dbc99987c18@github.com> On Mon, 2 Oct 2023 11:48:18 GMT, Leo Korinth wrote: > Also add parallel flag for first invocation (this is a bug). Initial testing ok, but will run tiers before pushing. Seems good. ------------- Marked as reviewed by tschatzl (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16009#pullrequestreview-1659507211 From tschatzl at openjdk.org Thu Oct 5 10:42:10 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Thu, 5 Oct 2023 10:42:10 GMT Subject: RFR: 8317318: Serial: Change GenCollectedHeap to SerialHeap in whitebox [v2] In-Reply-To: References:

Message-ID: On Sat, 30 Sep 2023 17:27:42 GMT, Albert Mingkun Yang wrote: >> Use more precise type for Serial GC. I also added a ` ShouldNotReachHere()` there, because using serial-heap when serial-gc is not used seems problematic. > > Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains one commit: > > s1-prims lgtm ------------- Marked as reviewed by tschatzl (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/15988#pullrequestreview-1659517772 From ayang at openjdk.org Thu Oct 5 12:37:10 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Thu, 5 Oct 2023 12:37:10 GMT Subject: RFR: 8317592: Serial: Remove Space::toContiguousSpace Message-ID: Simple removing unnecessary abstraction after using more precise type. ------------- Commit messages: - s1-remove-to-contiguous Changes: https://git.openjdk.org/jdk/pull/16054/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16054&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8317592 Stats: 15 lines in 2 files changed: 0 ins; 11 del; 4 mod Patch: https://git.openjdk.org/jdk/pull/16054.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16054/head:pull/16054 PR: https://git.openjdk.org/jdk/pull/16054 From ayang at openjdk.org Thu Oct 5 12:43:40 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Thu, 5 Oct 2023 12:43:40 GMT Subject: RFR: 8317594: G1: Refactor find_empty_from_idx_reverse Message-ID: Simple range boundary value adjustment to remove some redundant operations inside/around this method. ------------- Commit messages: - g1-hr-manager Changes: https://git.openjdk.org/jdk/pull/16055/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16055&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8317594 Stats: 22 lines in 2 files changed: 5 ins; 0 del; 17 mod Patch: https://git.openjdk.org/jdk/pull/16055.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16055/head:pull/16055 PR: https://git.openjdk.org/jdk/pull/16055 From tschatzl at openjdk.org Thu Oct 5 14:24:31 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Thu, 5 Oct 2023 14:24:31 GMT Subject: RFR: 8317594: G1: Refactor find_empty_from_idx_reverse In-Reply-To: References: Message-ID: On Thu, 5 Oct 2023 12:34:12 GMT, Albert Mingkun Yang wrote: > Simple range boundary value adjustment to remove some redundant operations inside/around this method. Marked as reviewed by tschatzl (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/16055#pullrequestreview-1659929400 From tschatzl at openjdk.org Thu Oct 5 14:25:00 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Thu, 5 Oct 2023 14:25:00 GMT Subject: RFR: 8317592: Serial: Remove Space::toContiguousSpace In-Reply-To: References: Message-ID: On Thu, 5 Oct 2023 12:28:23 GMT, Albert Mingkun Yang wrote: > Simple removing unnecessary abstraction after using more precise type. Marked as reviewed by tschatzl (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/16054#pullrequestreview-1659931734 From rrich at openjdk.org Thu Oct 5 15:04:54 2023 From: rrich at openjdk.org (Richard Reingruber) Date: Thu, 5 Oct 2023 15:04:54 GMT Subject: RFR: 8310031: Parallel: Implement better work distribution for large object arrays in old gen [v13] In-Reply-To: References:

Message-ID: On Thu, 5 Oct 2023 10:22:45 GMT, Richard Reingruber wrote: >> This pr introduces parallel scanning of large object arrays in the old generation containing roots for young collections of Parallel GC. This allows for better distribution of the actual work (following the array references) as opposed to "stealing" from other task queues which can lead to inverse scaling demonstrated by small tests (attached to JDK-8310031) and also observed in gerrit production systems. >> >> The algorithm to share scanning large arrays is supposed to be a straight >> forward extension of the scheme implemented in >> `PSCardTable::scavenge_contents_parallel`. >> >> - A worker scans the part of a large array located in its stripe >> >> - Except for the end of the large array reaching into a stripe which is scanned by the thread owning the previous stripe. This is just what the current implementation does: it skips objects crossing into the stripe. >> >> - For this it is necessary that large arrays cover at least 3 stripes (see `PSCardTable::large_obj_arr_min_words`) >> >> The implementation also makes use of the precise card marks for arrays. Only dirty regions are actually scanned. >> >> #### Performance testing >> >> ##### BigArrayInOldGenRR.java >> >> [BigArrayInOldGenRR.java](https://bugs.openjdk.org/secure/attachment/104422/BigArrayInOldGenRR.java) is a micro benchmark that assigns new objects to a large array in a loop. Creating new array elements triggers young collections. In each collection the large array is scanned because of its references to the new elements in the young generation. The benchmark score is the geometric mean of the duration of the last 5 young collections (lower is better). >> >> [BigArrayInOldGenRR.pdf](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.pdf)([BigArrayInOldGenRR.ods](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.ods)) presents the benchmark results with 1 to 64 gc threads. >> >> Observations >> >> * JDK22 scales inversely. Adding gc threads prolongues young collections. With 32 threads young collections take ~15x longer than single threaded. >> >> * Fixed JDK22 scales well. Adding gc theads reduces the duration of young collections. With 32 threads young collections are 5x shorter than single threaded. >> >> * With just 1 gc thread there is a regression. Young collections are 1.5x longer with the fix. I assume the reason is that the iteration over the array elements is interrupted at the end of a stripe which makes it less efficient. The prize for parallelization is paid ... > > Richard Reingruber has updated the pull request incrementally with two additional commits since the last revision: > > - Split work strictly at stripe boundaries > - Reset to master The difference to the baseline in the `card_scan_big_instances.java` test is < 5ms when the card mark copying to the stripes is done in parallel. It would be possible to improve this further: once a thread has completed its part of the copying it could begin scanning its stripes. Not sure if it's worth it. Testing: langtools:tier1 TEST_VM_OPTS="-XX:+UseParallelGC" hotspot:tier1 ------------- PR Comment: https://git.openjdk.org/jdk/pull/14846#issuecomment-1749053411 From rrich at openjdk.org Thu Oct 5 15:24:30 2023 From: rrich at openjdk.org (Richard Reingruber) Date: Thu, 5 Oct 2023 15:24:30 GMT Subject: RFR: 8310031: Parallel: Implement better work distribution for large object arrays in old gen [v14] In-Reply-To: References: Message-ID: > This pr introduces parallel scanning of large object arrays in the old generation containing roots for young collections of Parallel GC. This allows for better distribution of the actual work (following the array references) as opposed to "stealing" from other task queues which can lead to inverse scaling demonstrated by small tests (attached to JDK-8310031) and also observed in gerrit production systems. > > The algorithm to share scanning large arrays is supposed to be a straight > forward extension of the scheme implemented in > `PSCardTable::scavenge_contents_parallel`. > > - A worker scans the part of a large array located in its stripe > > - Except for the end of the large array reaching into a stripe which is scanned by the thread owning the previous stripe. This is just what the current implementation does: it skips objects crossing into the stripe. > > - For this it is necessary that large arrays cover at least 3 stripes (see `PSCardTable::large_obj_arr_min_words`) > > The implementation also makes use of the precise card marks for arrays. Only dirty regions are actually scanned. > > #### Performance testing > > ##### BigArrayInOldGenRR.java > > [BigArrayInOldGenRR.java](https://bugs.openjdk.org/secure/attachment/104422/BigArrayInOldGenRR.java) is a micro benchmark that assigns new objects to a large array in a loop. Creating new array elements triggers young collections. In each collection the large array is scanned because of its references to the new elements in the young generation. The benchmark score is the geometric mean of the duration of the last 5 young collections (lower is better). > > [BigArrayInOldGenRR.pdf](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.pdf)([BigArrayInOldGenRR.ods](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.ods)) presents the benchmark results with 1 to 64 gc threads. > > Observations > > * JDK22 scales inversely. Adding gc threads prolongues young collections. With 32 threads young collections take ~15x longer than single threaded. > > * Fixed JDK22 scales well. Adding gc theads reduces the duration of young collections. With 32 threads young collections are 5x shorter than single threaded. > > * With just 1 gc thread there is a regression. Young collections are 1.5x longer with the fix. I assume the reason is that the iteration over the array elements is interrupted at the end of a stripe which makes it less efficient. The prize for parallelization is paid without actually doing it. Also ParallelGC will use at lea... Richard Reingruber has updated the pull request incrementally with one additional commit since the last revision: Parallel copying of imprecise marks to stripes ------------- Changes: - all: https://git.openjdk.org/jdk/pull/14846/files - new: https://git.openjdk.org/jdk/pull/14846/files/817b164c..22fe8496 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=14846&range=13 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=14846&range=12-13 Stats: 61 lines in 3 files changed: 32 ins; 23 del; 6 mod Patch: https://git.openjdk.org/jdk/pull/14846.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/14846/head:pull/14846 PR: https://git.openjdk.org/jdk/pull/14846 From dcubed at openjdk.org Thu Oct 5 19:59:11 2023 From: dcubed at openjdk.org (Daniel D. Daugherty) Date: Thu, 5 Oct 2023 19:59:11 GMT Subject: RFR: 8315880: change LockingMode default from LM_LEGACY to LM_LIGHTWEIGHT In-Reply-To: References: Message-ID: On Mon, 18 Sep 2023 20:47:13 GMT, Daniel D. Daugherty wrote: > Change the default of LockingMode to LM_LIGHTWEIGHT from LM_LEGACY. > > This fix has been tested with 3 Mach5 Tier[1-8] runs and a 4th is in process. That would be unexpected breakage indeed... ------------- PR Comment: https://git.openjdk.org/jdk/pull/15797#issuecomment-1749557286 From jwaters at openjdk.org Fri Oct 6 05:06:12 2023 From: jwaters at openjdk.org (Julian Waters) Date: Fri, 6 Oct 2023 05:06:12 GMT Subject: RFR: 8315880: change LockingMode default from LM_LEGACY to LM_LIGHTWEIGHT In-Reply-To: References:

Message-ID: On Thu, 5 Oct 2023 19:56:00 GMT, Daniel D. Daugherty wrote: > That would be unexpected breakage indeed... Haha, indeed. Since this was just a default flag change, it's likely the entire thing was broken on my end to begin with, just that it never manifested since I never used the lightweight mode > > I think this may have broken JDK-8288293 ? > > Is there any compiler specific implementation involved with the new lightweight locking? Just a quick question, I can figure out the rest on my own > > Not that I am aware of. AFAIK, it works fine with various versions of GCC on Linux, Xcode on MacOS and VS on Windows. Not with gcc on Windows, unfortunately. Looks like I have some work cut out for me. Thanks for the replies ------------- PR Comment: https://git.openjdk.org/jdk/pull/15797#issuecomment-1749988003 From rrich at openjdk.org Fri Oct 6 11:17:12 2023 From: rrich at openjdk.org (Richard Reingruber) Date: Fri, 6 Oct 2023 11:17:12 GMT Subject: RFR: 8310031: Parallel: Implement better work distribution for large object arrays in old gen [v15] In-Reply-To: References: Message-ID: > This pr introduces parallel scanning of large object arrays in the old generation containing roots for young collections of Parallel GC. This allows for better distribution of the actual work (following the array references) as opposed to "stealing" from other task queues which can lead to inverse scaling demonstrated by small tests (attached to JDK-8310031) and also observed in gerrit production systems. > > The algorithm to share scanning large arrays is supposed to be a straight > forward extension of the scheme implemented in > `PSCardTable::scavenge_contents_parallel`. > > - A worker scans the part of a large array located in its stripe > > - Except for the end of the large array reaching into a stripe which is scanned by the thread owning the previous stripe. This is just what the current implementation does: it skips objects crossing into the stripe. > > - For this it is necessary that large arrays cover at least 3 stripes (see `PSCardTable::large_obj_arr_min_words`) > > The implementation also makes use of the precise card marks for arrays. Only dirty regions are actually scanned. > > #### Performance testing > > ##### BigArrayInOldGenRR.java > > [BigArrayInOldGenRR.java](https://bugs.openjdk.org/secure/attachment/104422/BigArrayInOldGenRR.java) is a micro benchmark that assigns new objects to a large array in a loop. Creating new array elements triggers young collections. In each collection the large array is scanned because of its references to the new elements in the young generation. The benchmark score is the geometric mean of the duration of the last 5 young collections (lower is better). > > [BigArrayInOldGenRR.pdf](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.pdf)([BigArrayInOldGenRR.ods](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.ods)) presents the benchmark results with 1 to 64 gc threads. > > Observations > > * JDK22 scales inversely. Adding gc threads prolongues young collections. With 32 threads young collections take ~15x longer than single threaded. > > * Fixed JDK22 scales well. Adding gc theads reduces the duration of young collections. With 32 threads young collections are 5x shorter than single threaded. > > * With just 1 gc thread there is a regression. Young collections are 1.5x longer with the fix. I assume the reason is that the iteration over the array elements is interrupted at the end of a stripe which makes it less efficient. The prize for parallelization is paid without actually doing it. Also ParallelGC will use at lea... Richard Reingruber has updated the pull request incrementally with two additional commits since the last revision: - Missed acquire semantics - Overlap scavenge with pre-scavenge ------------- Changes: - all: https://git.openjdk.org/jdk/pull/14846/files - new: https://git.openjdk.org/jdk/pull/14846/files/22fe8496..d845e650 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=14846&range=14 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=14846&range=13-14 Stats: 114 lines in 3 files changed: 80 ins; 26 del; 8 mod Patch: https://git.openjdk.org/jdk/pull/14846.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/14846/head:pull/14846 PR: https://git.openjdk.org/jdk/pull/14846 From rrich at openjdk.org Fri Oct 6 11:17:25 2023 From: rrich at openjdk.org (Richard Reingruber) Date: Fri, 6 Oct 2023 11:17:25 GMT Subject: RFR: 8310031: Parallel: Implement better work distribution for large object arrays in old gen [v14] In-Reply-To: References:

Message-ID: On Thu, 5 Oct 2023 15:24:30 GMT, Richard Reingruber wrote: >> This pr introduces parallel scanning of large object arrays in the old generation containing roots for young collections of Parallel GC. This allows for better distribution of the actual work (following the array references) as opposed to "stealing" from other task queues which can lead to inverse scaling demonstrated by small tests (attached to JDK-8310031) and also observed in gerrit production systems. >> >> The algorithm to share scanning large arrays is supposed to be a straight >> forward extension of the scheme implemented in >> `PSCardTable::scavenge_contents_parallel`. >> >> - A worker scans the part of a large array located in its stripe >> >> - Except for the end of the large array reaching into a stripe which is scanned by the thread owning the previous stripe. This is just what the current implementation does: it skips objects crossing into the stripe. >> >> - For this it is necessary that large arrays cover at least 3 stripes (see `PSCardTable::large_obj_arr_min_words`) >> >> The implementation also makes use of the precise card marks for arrays. Only dirty regions are actually scanned. >> >> #### Performance testing >> >> ##### BigArrayInOldGenRR.java >> >> [BigArrayInOldGenRR.java](https://bugs.openjdk.org/secure/attachment/104422/BigArrayInOldGenRR.java) is a micro benchmark that assigns new objects to a large array in a loop. Creating new array elements triggers young collections. In each collection the large array is scanned because of its references to the new elements in the young generation. The benchmark score is the geometric mean of the duration of the last 5 young collections (lower is better). >> >> [BigArrayInOldGenRR.pdf](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.pdf)([BigArrayInOldGenRR.ods](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.ods)) presents the benchmark results with 1 to 64 gc threads. >> >> Observations >> >> * JDK22 scales inversely. Adding gc threads prolongues young collections. With 32 threads young collections take ~15x longer than single threaded. >> >> * Fixed JDK22 scales well. Adding gc theads reduces the duration of young collections. With 32 threads young collections are 5x shorter than single threaded. >> >> * With just 1 gc thread there is a regression. Young collections are 1.5x longer with the fix. I assume the reason is that the iteration over the array elements is interrupted at the end of a stripe which makes it less efficient. The prize for parallelization is paid ... > > Richard Reingruber has updated the pull request incrementally with one additional commit since the last revision: > > Parallel copying of imprecise marks to stripes I think it would be possible to combine the two approaches: we would have a read only copy of the card table only for the current stripe. This would reduce the required extra memory to just `num_cards_in_stripe * active_workers` (128 * active_workers) bytes. The readonly copy could be a local variable (on stack) in `scavenge_contents_parallel`. What do you think? ------------- PR Comment: https://git.openjdk.org/jdk/pull/14846#issuecomment-1750252198 From coleenp at openjdk.org Fri Oct 6 11:51:46 2023 From: coleenp at openjdk.org (Coleen Phillimore) Date: Fri, 6 Oct 2023 11:51:46 GMT Subject: RFR: 8317440: Lock rank checking fails when code root set is modified with the Servicelock held after JDK-8315503 In-Reply-To: References: Message-ID: On Thu, 5 Oct 2023 17:19:35 GMT, Thomas Schatzl wrote: > Hi all, > > please review this change that fixes lock ranking after recent changes to the code root set, now using a CHT. > > The issue came up because the lock rank of the CHT lock has been larger than the rank of the Servicethread_lock where it is possible that code roots can be added. > > The suggested solution is to fix up the lock rankings to work; actually this PR contains two variants: > 1) one that statically sets the lock ranks of the CHT lock (and the ThreadSMR_lock that can be used during CHT operation) to something smaller than Servicethread_lock. > 2) one that allows setting of the CHT lock rank via parameter as well (the last commit changed the code to variant 1). > > The other lock ranking changes to Metaspace_lock and ContinuationRelativize_lock are simply undos of the respective changes in [JDK-8315503](https://bugs.openjdk.org/browse/JDK-8315503). > > Testing: tier1-8 for variant 2), tier 1-7 for variant 1) > > Thanks, > Thomas Variant 1 seems ok. Uses of the CHT shouldn't take locks, so having a low lock ranking for CHT lock seems like it'll be fine (I can't find where it takes the ThreadsSMRDelete_lock). If any of this breaks, we can try approach #2 next. ------------- Marked as reviewed by coleenp (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16062#pullrequestreview-1661696681 From iwalulya at openjdk.org Fri Oct 6 11:58:44 2023 From: iwalulya at openjdk.org (Ivan Walulya) Date: Fri, 6 Oct 2023 11:58:44 GMT Subject: RFR: 8317343: GC: Make TestHeapFreeRatio use createTestJvm In-Reply-To: <90LqPWddFASKjNrWHZVoKgqHHAgrUBg0FUFnqtMJDCw=.012774fa-388e-4ef2-af4f-059c1a9ec41b@github.com> References: <90LqPWddFASKjNrWHZVoKgqHHAgrUBg0FUFnqtMJDCw=.012774fa-388e-4ef2-af4f-059c1a9ec41b@github.com> Message-ID: On Mon, 2 Oct 2023 09:57:55 GMT, Leo Korinth wrote: > This fix is implicitly dependent on https://github.com/openjdk/jdk/pull/15986/files for `@requires opt.x` support. Initial testing passes, but I will do more (tier) testing before pushing. Marked as reviewed by iwalulya (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/16007#pullrequestreview-1661706197 From iwalulya at openjdk.org Fri Oct 6 11:58:47 2023 From: iwalulya at openjdk.org (Ivan Walulya) Date: Fri, 6 Oct 2023 11:58:47 GMT Subject: RFR: 8317318: Serial: Change GenCollectedHeap to SerialHeap in whitebox [v2] In-Reply-To: References:

Message-ID: On Fri, 6 Oct 2023 11:17:12 GMT, Richard Reingruber wrote: >> This pr introduces parallel scanning of large object arrays in the old generation containing roots for young collections of Parallel GC. This allows for better distribution of the actual work (following the array references) as opposed to "stealing" from other task queues which can lead to inverse scaling demonstrated by small tests (attached to JDK-8310031) and also observed in gerrit production systems. >> >> The algorithm to share scanning large arrays is supposed to be a straight >> forward extension of the scheme implemented in >> `PSCardTable::scavenge_contents_parallel`. >> >> - A worker scans the part of a large array located in its stripe >> >> - Except for the end of the large array reaching into a stripe which is scanned by the thread owning the previous stripe. This is just what the current implementation does: it skips objects crossing into the stripe. >> >> - For this it is necessary that large arrays cover at least 3 stripes (see `PSCardTable::large_obj_arr_min_words`) >> >> The implementation also makes use of the precise card marks for arrays. Only dirty regions are actually scanned. >> >> #### Performance testing >> >> ##### BigArrayInOldGenRR.java >> >> [BigArrayInOldGenRR.java](https://bugs.openjdk.org/secure/attachment/104422/BigArrayInOldGenRR.java) is a micro benchmark that assigns new objects to a large array in a loop. Creating new array elements triggers young collections. In each collection the large array is scanned because of its references to the new elements in the young generation. The benchmark score is the geometric mean of the duration of the last 5 young collections (lower is better). >> >> [BigArrayInOldGenRR.pdf](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.pdf)([BigArrayInOldGenRR.ods](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.ods)) presents the benchmark results with 1 to 64 gc threads. >> >> Observations >> >> * JDK22 scales inversely. Adding gc threads prolongues young collections. With 32 threads young collections take ~15x longer than single threaded. >> >> * Fixed JDK22 scales well. Adding gc theads reduces the duration of young collections. With 32 threads young collections are 5x shorter than single threaded. >> >> * With just 1 gc thread there is a regression. Young collections are 1.5x longer with the fix. I assume the reason is that the iteration over the array elements is interrupted at the end of a stripe which makes it less efficient. The prize for parallelization is paid ... > > Richard Reingruber has updated the pull request incrementally with two additional commits since the last revision: > > - Missed acquire semantics > - Overlap scavenge with pre-scavenge I find pre-processing card-table removes much complexity in determining which (part of) obj belongs to current stripe. However, synchronizing with actual scavenging introduce some complexity. The fact that `find_first_clean_card` copies the cached-obj-start is easy to miss and hard to reason IMO. > we would have a read only copy of the card table only for the current stripe. It would still require pre-processing card-table, right? Otherwise, I don't see how one can work around the "interference" across stripes. Maybe this can simplify the impl of `find_first_clean_card`. I am not too concerned about the regression observed for "large (32K) non-array instances", because that pattern is not common in java and the pause-time is still reasonable (<100ms). The long-term optimization (or the redemption of the extra-mem-requirement) I have in mind is to use 1 bit (instead of 1 byte) for a card -- Parallel requires only a boolean info for a particular card. One can even pre-alloc two card-tables now that each card-table is 1/8 of its original size, to avoid calling malloc inside young-gc-pause. My preference is some simple code without much regression. Ofc, this is quite subjective. ------------- PR Comment: https://git.openjdk.org/jdk/pull/14846#issuecomment-1750541087 From ayang at openjdk.org Fri Oct 6 12:19:57 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Fri, 6 Oct 2023 12:19:57 GMT Subject: RFR: 8317592: Serial: Remove Space::toContiguousSpace In-Reply-To: References: Message-ID: On Thu, 5 Oct 2023 12:28:23 GMT, Albert Mingkun Yang wrote: > Simple removing unnecessary abstraction after using more precise type. Thanks for the review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16054#issuecomment-1750542055 From ayang at openjdk.org Fri Oct 6 12:19:58 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Fri, 6 Oct 2023 12:19:58 GMT Subject: Integrated: 8317592: Serial: Remove Space::toContiguousSpace In-Reply-To: References: Message-ID: On Thu, 5 Oct 2023 12:28:23 GMT, Albert Mingkun Yang wrote: > Simple removing unnecessary abstraction after using more precise type. This pull request has now been integrated. Changeset: 691db5df Author: Albert Mingkun Yang URL: https://git.openjdk.org/jdk/commit/691db5df73a48cf7d78cb6b5f5085a3219baca50 Stats: 15 lines in 2 files changed: 0 ins; 11 del; 4 mod 8317592: Serial: Remove Space::toContiguousSpace Reviewed-by: tschatzl, iwalulya ------------- PR: https://git.openjdk.org/jdk/pull/16054 From ayang at openjdk.org Fri Oct 6 12:20:49 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Fri, 6 Oct 2023 12:20:49 GMT Subject: RFR: 8317318: Serial: Change GenCollectedHeap to SerialHeap in whitebox [v2] In-Reply-To: References:

Message-ID: On Sat, 30 Sep 2023 17:27:42 GMT, Albert Mingkun Yang wrote: >> Use more precise type for Serial GC. I also added a ` ShouldNotReachHere()` there, because using serial-heap when serial-gc is not used seems problematic. > > Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains one additional commit since the last revision: > > s1-prims Thanks for the review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/15988#issuecomment-1750541862 From ayang at openjdk.org Fri Oct 6 12:20:50 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Fri, 6 Oct 2023 12:20:50 GMT Subject: Integrated: 8317318: Serial: Change GenCollectedHeap to SerialHeap in whitebox In-Reply-To: References: Message-ID: On Fri, 29 Sep 2023 14:47:19 GMT, Albert Mingkun Yang wrote: > Use more precise type for Serial GC. I also added a ` ShouldNotReachHere()` there, because using serial-heap when serial-gc is not used seems problematic. This pull request has now been integrated. Changeset: b3cc0c84 Author: Albert Mingkun Yang URL: https://git.openjdk.org/jdk/commit/b3cc0c84316dd59f406a6fa23fcaf3d029910843 Stats: 11 lines in 1 file changed: 8 ins; 1 del; 2 mod 8317318: Serial: Change GenCollectedHeap to SerialHeap in whitebox Reviewed-by: tschatzl, iwalulya ------------- PR: https://git.openjdk.org/jdk/pull/15988 From ayang at openjdk.org Fri Oct 6 12:33:45 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Fri, 6 Oct 2023 12:33:45 GMT Subject: RFR: 8317440: Lock rank checking fails when code root set is modified with the Servicelock held after JDK-8315503 In-Reply-To: References: Message-ID: On Thu, 5 Oct 2023 17:19:35 GMT, Thomas Schatzl wrote: > Hi all, > > please review this change that fixes lock ranking after recent changes to the code root set, now using a CHT. > > The issue came up because the lock rank of the CHT lock has been larger than the rank of the Servicethread_lock where it is possible that code roots can be added. > > The suggested solution is to fix up the lock rankings to work; actually this PR contains two variants: > 1) one that statically sets the lock ranks of the CHT lock (and the ThreadSMR_lock that can be used during CHT operation) to something smaller than Servicethread_lock. > 2) one that allows setting of the CHT lock rank via parameter as well (the last commit changed the code to variant 1). > > The other lock ranking changes to Metaspace_lock and ContinuationRelativize_lock are simply undos of the respective changes in [JDK-8315503](https://bugs.openjdk.org/browse/JDK-8315503). > > Testing: tier1-8 for variant 2), tier 1-7 for variant 1) > > Thanks, > Thomas Marked as reviewed by ayang (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/16062#pullrequestreview-1661761085 From ayang at openjdk.org Fri Oct 6 12:38:39 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Fri, 6 Oct 2023 12:38:39 GMT Subject: RFR: 8317675: Serial: Move gc/shared/generation to serial folder Message-ID: Simple moving files/renamings. ------------- Commit messages: - s1-generation Changes: https://git.openjdk.org/jdk/pull/16072/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16072&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8317675 Stats: 20 lines in 12 files changed: 8 ins; 8 del; 4 mod Patch: https://git.openjdk.org/jdk/pull/16072.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16072/head:pull/16072 PR: https://git.openjdk.org/jdk/pull/16072 From tschatzl at openjdk.org Fri Oct 6 12:51:08 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Fri, 6 Oct 2023 12:51:08 GMT Subject: RFR: 8317440: Lock rank checking fails when code root set is modified with the Servicelock held after JDK-8315503 In-Reply-To: References:

Message-ID: On Fri, 6 Oct 2023 11:49:21 GMT, Coleen Phillimore wrote: > Variant 1 seems ok. Uses of the CHT shouldn't take locks, so having a low lock ranking for CHT lock seems like it'll be fine (I can't find where it takes the ThreadsSMRDelete_lock). If any of this breaks, we can try approach #2 next. Thread lists synchronization in `GlobalCounter::write_synchronize()` uses the `ThreadsSMRDelete_lock` via `JavaThreadIteratorWithHandle`->`ThreadsListHandle`->`SafeThreadsListPtr` in the destructor ... ------------- PR Comment: https://git.openjdk.org/jdk/pull/16062#issuecomment-1750614008 From zgu at openjdk.org Fri Oct 6 13:37:00 2023 From: zgu at openjdk.org (Zhengyu Gu) Date: Fri, 6 Oct 2023 13:37:00 GMT Subject: RFR: 8317466: Enable interpreter oopMapCache for concurrent GCs Message-ID: Interpreter oop maps are computed lazily during GC root scan and they are expensive to compute. GCs uses a small hash table per instance class to cache computed oop maps during STW root scan, but not for concurrent root scan. This patch is intended to enable `OopMapCache` for concurrent GCs. Test: tier1 and tier2 fastdebug and release on MacOSX, Linux 86_84 and Linux 86_32. ------------- Commit messages: - Fix merge conflicts - Merge - cleanup - Merge branch 'master' into JDK-8317466 - Cleanup - Merge branch 'master' into oopmapcache_for_concurrent_root_scan - v2 - v1 - v0 - 8317240: Promptly free OopMapEntry after fail to insert the entry to OopMapCache Changes: https://git.openjdk.org/jdk/pull/16074/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16074&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8317466 Stats: 56 lines in 10 files changed: 20 ins; 15 del; 21 mod Patch: https://git.openjdk.org/jdk/pull/16074.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16074/head:pull/16074 PR: https://git.openjdk.org/jdk/pull/16074 From lmesnik at openjdk.org Fri Oct 6 16:19:08 2023 From: lmesnik at openjdk.org (Leonid Mesnik) Date: Fri, 6 Oct 2023 16:19:08 GMT Subject: RFR: 8316608: Enable parallelism in vmTestbase/gc/vector tests In-Reply-To: References: Message-ID: On Tue, 3 Oct 2023 14:02:09 GMT, Soumadipta Roy wrote: > The commit includes changes to unblock parallelism for more `hotspot:tier4` tests. in `vmTestbase/gc/vector` tests. > > Below are the before and after run comparisons: > > # Fastdebug > before: 3480.71s user 83.97s system 830% cpu 7:09.41 total > after: 2214.52s user 63.19s system 2374% cpu 1:35.94 total > > # Release > before: 1369.61s user 147.03s system 371% cpu 6:48.63 total > after: 1130.28s user 110.97s system 2478% cpu 50.089 total testing didn't show any issues ------------- Marked as reviewed by lmesnik (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16028#pullrequestreview-1662232566 From iwalulya at openjdk.org Sat Oct 7 08:58:32 2023 From: iwalulya at openjdk.org (Ivan Walulya) Date: Sat, 7 Oct 2023 08:58:32 GMT Subject: RFR: 8317594: G1: Refactor find_empty_from_idx_reverse In-Reply-To: References: Message-ID: <7HiwT6WwWBRZTatZKunKqaUp0emZLdZiqp2LcWN5r-0=.08e945b6-bdcf-40ef-8405-1a270085e634@github.com> On Thu, 5 Oct 2023 12:34:12 GMT, Albert Mingkun Yang wrote: > Simple range boundary value adjustment to remove some redundant operations inside/around this method. LGTM! Nit: Variable renaming `cur` to `i` results in more modifications than necessary, don't see the motivation for such renaming. ------------- Marked as reviewed by iwalulya (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16055#pullrequestreview-1663026170 From mli at openjdk.org Sat Oct 7 20:12:02 2023 From: mli at openjdk.org (Hamlin Li) Date: Sat, 7 Oct 2023 20:12:02 GMT Subject: RFR: 8317675: Serial: Move gc/shared/generation to serial folder In-Reply-To: References: Message-ID: On Fri, 6 Oct 2023 12:30:14 GMT, Albert Mingkun Yang wrote: > Simple moving files/renamings. LGTM ------------- Marked as reviewed by mli (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16072#pullrequestreview-1663104787 From mli at openjdk.org Sat Oct 7 20:13:14 2023 From: mli at openjdk.org (Hamlin Li) Date: Sat, 7 Oct 2023 20:13:14 GMT Subject: RFR: 8314259: G1: Fix -Wconversion warnings of double to uint in G1CardSetConfiguration In-Reply-To: <9cfdAUfFRfTMkU4ztmJ5VcVQVARPi44PYGkRR_8gFCY=.b9064371-9d80-43af-85ae-d1ca88e1c776@github.com> References: <9cfdAUfFRfTMkU4ztmJ5VcVQVARPi44PYGkRR_8gFCY=.b9064371-9d80-43af-85ae-d1ca88e1c776@github.com> Message-ID: On Tue, 15 Aug 2023 08:34:44 GMT, Albert Mingkun Yang wrote: > Adding explicit double to int conversion. Marked as reviewed by mli (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/15284#pullrequestreview-1663104870 From tschatzl at openjdk.org Mon Oct 9 08:28:32 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 9 Oct 2023 08:28:32 GMT Subject: RFR: 8317440: Lock rank checking fails when code root set is modified with the Servicelock held after JDK-8315503 In-Reply-To: References:

Message-ID: On Fri, 6 Oct 2023 11:49:21 GMT, Coleen Phillimore wrote: >> Hi all, >> >> please review this change that fixes lock ranking after recent changes to the code root set, now using a CHT. >> >> The issue came up because the lock rank of the CHT lock has been larger than the rank of the Servicethread_lock where it is possible that code roots can be added. >> >> The suggested solution is to fix up the lock rankings to work; actually this PR contains two variants: >> 1) one that statically sets the lock ranks of the CHT lock (and the ThreadSMR_lock that can be used during CHT operation) to something smaller than Servicethread_lock. >> 2) one that allows setting of the CHT lock rank via parameter as well (the last commit changed the code to variant 1). >> >> The other lock ranking changes to Metaspace_lock and ContinuationRelativize_lock are simply undos of the respective changes in [JDK-8315503](https://bugs.openjdk.org/browse/JDK-8315503). >> >> Testing: tier1-8 for variant 2), tier 1-7 for variant 1) >> >> Thanks, >> Thomas > > Variant 1 seems ok. Uses of the CHT shouldn't take locks, so having a low lock ranking for CHT lock seems like it'll be fine (I can't find where it takes the ThreadsSMRDelete_lock). If any of this breaks, we can try approach #2 next. Thanks @coleenp @albertnetymk for your reviews ------------- PR Comment: https://git.openjdk.org/jdk/pull/16062#issuecomment-1752548969 From tschatzl at openjdk.org Mon Oct 9 08:31:41 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 9 Oct 2023 08:31:41 GMT Subject: Integrated: 8317440: Lock rank checking fails when code root set is modified with the Servicelock held after JDK-8315503 In-Reply-To: References: Message-ID: On Thu, 5 Oct 2023 17:19:35 GMT, Thomas Schatzl wrote: > Hi all, > > please review this change that fixes lock ranking after recent changes to the code root set, now using a CHT. > > The issue came up because the lock rank of the CHT lock has been larger than the rank of the Servicethread_lock where it is possible that code roots can be added. > > The suggested solution is to fix up the lock rankings to work; actually this PR contains two variants: > 1) one that statically sets the lock ranks of the CHT lock (and the ThreadSMR_lock that can be used during CHT operation) to something smaller than Servicethread_lock. > 2) one that allows setting of the CHT lock rank via parameter as well (the last commit changed the code to variant 1). > > The other lock ranking changes to Metaspace_lock and ContinuationRelativize_lock are simply undos of the respective changes in [JDK-8315503](https://bugs.openjdk.org/browse/JDK-8315503). > > Testing: tier1-8 for variant 2), tier 1-7 for variant 1) > > Thanks, > Thomas This pull request has now been integrated. Changeset: 0cf1a558 Author: Thomas Schatzl URL: https://git.openjdk.org/jdk/commit/0cf1a558bacf18d9fc41e43fb5e9eba39dc51f2e Stats: 4 lines in 2 files changed: 0 ins; 0 del; 4 mod 8317440: Lock rank checking fails when code root set is modified with the Servicelock held after JDK-8315503 Reviewed-by: coleenp, ayang ------------- PR: https://git.openjdk.org/jdk/pull/16062 From lkorinth at openjdk.org Mon Oct 9 09:17:44 2023 From: lkorinth at openjdk.org (Leo Korinth) Date: Mon, 9 Oct 2023 09:17:44 GMT Subject: RFR: 8317228: GC: Make TestXXXHeapSizeFlags use createTestJvm In-Reply-To: References: Message-ID: <49kT79BxjQG9RM9Y96eyATKoBZRRqPuO0F1wR7SsFoY=.2418dfbf-fd86-4c5e-bb87-a9ab994e8088@github.com> On Fri, 29 Sep 2023 13:44:26 GMT, Leo Korinth wrote: > Minor testing done, I will test more later with other fixes. I have tested this with tier1-5 on x86. ------------- PR Comment: https://git.openjdk.org/jdk/pull/15986#issuecomment-1752585262 From ayang at openjdk.org Mon Oct 9 10:42:43 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 9 Oct 2023 10:42:43 GMT Subject: RFR: 8317594: G1: Refactor find_empty_from_idx_reverse In-Reply-To: References: Message-ID: On Thu, 5 Oct 2023 12:34:12 GMT, Albert Mingkun Yang wrote: > Simple range boundary value adjustment to remove some redundant operations inside/around this method. Thanks for the review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16055#issuecomment-1752761563 From ayang at openjdk.org Mon Oct 9 10:42:44 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 9 Oct 2023 10:42:44 GMT Subject: Integrated: 8317594: G1: Refactor find_empty_from_idx_reverse In-Reply-To: References: Message-ID: On Thu, 5 Oct 2023 12:34:12 GMT, Albert Mingkun Yang wrote: > Simple range boundary value adjustment to remove some redundant operations inside/around this method. This pull request has now been integrated. Changeset: a57ae7e7 Author: Albert Mingkun Yang URL: https://git.openjdk.org/jdk/commit/a57ae7e7d4c84b012e4a3533f316c4e7e6f99bb7 Stats: 22 lines in 2 files changed: 5 ins; 0 del; 17 mod 8317594: G1: Refactor find_empty_from_idx_reverse Reviewed-by: tschatzl, iwalulya ------------- PR: https://git.openjdk.org/jdk/pull/16055 From sjohanss at openjdk.org Mon Oct 9 10:59:33 2023 From: sjohanss at openjdk.org (Stefan Johansson) Date: Mon, 9 Oct 2023 10:59:33 GMT Subject: RFR: 8317358: G1: Make TestMaxNewSize use createTestJvm In-Reply-To: References: Message-ID: On Mon, 2 Oct 2023 15:09:49 GMT, Leo Korinth wrote: > In addition remove deprecated `Long(long)` constructor and rewrite `compareTo` to use `> 0` instead of `== 1`. > > Also remove unused `isRunningG1(String[] args)` and `checkIncompatibleNewSize(String[] flags)` > > Minimal testing completed, will run tier testing before pushing. >From what I can tell this test now fails when run with for example `-Xmn1g`. So I'm not sure if it really is a good idea to change this test to use `createTestJvm`. We could of course add this option to the requires list, but there are more options that can cause failures as well. ------------- Changes requested by sjohanss (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16012#pullrequestreview-1664196844 From coleenp at openjdk.org Mon Oct 9 14:09:02 2023 From: coleenp at openjdk.org (Coleen Phillimore) Date: Mon, 9 Oct 2023 14:09:02 GMT Subject: RFR: 8317730: Change byte_size to return size_t In-Reply-To: References: Message-ID: On Mon, 9 Oct 2023 12:57:34 GMT, Albert Mingkun Yang wrote: > Simple signature update to `byte_size` to match expectation from callers. This looks good. ------------- Marked as reviewed by coleenp (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16100#pullrequestreview-1664533639 From iwalulya at openjdk.org Mon Oct 9 15:22:55 2023 From: iwalulya at openjdk.org (Ivan Walulya) Date: Mon, 9 Oct 2023 15:22:55 GMT Subject: RFR: 8170817: G1: Returning MinTLABSize from unsafe_max_tlab_alloc causes TLAB flapping Message-ID: <3_DGucK_oOxNiifa0k42NxgdbGz3MVkArCdFJsGNqMs=.fb5a3f0f-bf8e-4310-a878-66e55e6e3c36@github.com> Hi all, Please review this small change to return the maximum allowed TLAB size from `unsafe_max_tlab_alloc` in case the current region does not have enough free space to allocate MinTLABSize. We assume that the next TLAB allocation will happen in a new region. Testing: Tier 1-3. Thanks ------------- Commit messages: - initial Changes: https://git.openjdk.org/jdk/pull/16102/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16102&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8170817 Stats: 8 lines in 1 file changed: 5 ins; 2 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/16102.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16102/head:pull/16102 PR: https://git.openjdk.org/jdk/pull/16102 From rrich at openjdk.org Mon Oct 9 15:34:05 2023 From: rrich at openjdk.org (Richard Reingruber) Date: Mon, 9 Oct 2023 15:34:05 GMT Subject: RFR: 8310031: Parallel: Implement better work distribution for large object arrays in old gen [v15] In-Reply-To: References:

Message-ID: On Fri, 6 Oct 2023 12:15:40 GMT, Albert Mingkun Yang wrote: > I find pre-processing card-table removes much complexity in determining which (part of) obj belongs to current stripe. However, synchronizing with actual scavenging introduce some complexity. The complexity for synchronization is not too bad though. Also it only comes from overlapping card table preprocessing with scavenging. I think this could be removed again without loosing performance. > The fact that `find_first_clean_card` copies the cached-obj-start is easy to miss Yes, it is easy to miss. I thought it was a minor detail anyway. > and hard to reason IMO. It could be passed by reference if the query in `process_range` would be pulled up before the `find_first_clean_card` call. Let me know if you think that was better. > > we would have a read only copy of the card table only for the current stripe. > > It would still require pre-processing card-table, right? Otherwise, I don't see how one can work around the "interference" across stripes. Maybe this can simplify the impl of `find_first_clean_card`. That's correct. The implementation should be straight forward. I think I'll experiment with it. > > I am not too concerned about the regression observed for "large (32K) non-array instances", because that pattern is not common in java and the pause-time is still reasonable (<100ms). Agreed. > The long-term optimization (or the redemption of the extra-mem-requirement) I have in mind is to use 1 bit (instead of 1 byte) for a card -- Parallel requires only a boolean info for a particular card. One can even pre-alloc two card-tables now that each card-table is 1/8 of its original size, to avoid calling malloc inside young-gc-pause. > > My preference is some simple code without much regression. Ofc, this is quite subjective. Sure. My first preference would be that the change can be backported. We were discussing internally if the increased memory consumption could be an issue. Since environments that are sensitive to this either configure serial or g1 we thought it could be ok. At least from our point of view. ------------- PR Comment: https://git.openjdk.org/jdk/pull/14846#issuecomment-1753232330 From rrich at openjdk.org Mon Oct 9 15:42:02 2023 From: rrich at openjdk.org (Richard Reingruber) Date: Mon, 9 Oct 2023 15:42:02 GMT Subject: RFR: 8310031: Parallel: Implement better work distribution for large object arrays in old gen [v16] In-Reply-To: References: Message-ID: <7MCfxKnwPdEgJ_bTJ6T-WGBaiUTm3v_zNuED3OdbNP0=.beb49d29-ad0e-40ab-a0f8-0fff5373dbc4@github.com> > This pr introduces parallel scanning of large object arrays in the old generation containing roots for young collections of Parallel GC. This allows for better distribution of the actual work (following the array references) as opposed to "stealing" from other task queues which can lead to inverse scaling demonstrated by small tests (attached to JDK-8310031) and also observed in gerrit production systems. > > The algorithm to share scanning large arrays is supposed to be a straight > forward extension of the scheme implemented in > `PSCardTable::scavenge_contents_parallel`. > > - A worker scans the part of a large array located in its stripe > > - Except for the end of the large array reaching into a stripe which is scanned by the thread owning the previous stripe. This is just what the current implementation does: it skips objects crossing into the stripe. > > - For this it is necessary that large arrays cover at least 3 stripes (see `PSCardTable::large_obj_arr_min_words`) > > The implementation also makes use of the precise card marks for arrays. Only dirty regions are actually scanned. > > #### Performance testing > > ##### BigArrayInOldGenRR.java > > [BigArrayInOldGenRR.java](https://bugs.openjdk.org/secure/attachment/104422/BigArrayInOldGenRR.java) is a micro benchmark that assigns new objects to a large array in a loop. Creating new array elements triggers young collections. In each collection the large array is scanned because of its references to the new elements in the young generation. The benchmark score is the geometric mean of the duration of the last 5 young collections (lower is better). > > [BigArrayInOldGenRR.pdf](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.pdf)([BigArrayInOldGenRR.ods](https://cr.openjdk.org/~rrich/webrevs/8310031/BigArrayInOldGenRR.ods)) presents the benchmark results with 1 to 64 gc threads. > > Observations > > * JDK22 scales inversely. Adding gc threads prolongues young collections. With 32 threads young collections take ~15x longer than single threaded. > > * Fixed JDK22 scales well. Adding gc theads reduces the duration of young collections. With 32 threads young collections are 5x shorter than single threaded. > > * With just 1 gc thread there is a regression. Young collections are 1.5x longer with the fix. I assume the reason is that the iteration over the array elements is interrupted at the end of a stripe which makes it less efficient. The prize for parallelization is paid without actually doing it. Also ParallelGC will use at lea... Richard Reingruber has updated the pull request incrementally with two additional commits since the last revision: - find_first_clean_card: return end_card if final object extends beyond it. - Cleanup ------------- Changes: - all: https://git.openjdk.org/jdk/pull/14846/files - new: https://git.openjdk.org/jdk/pull/14846/files/d845e650..272ab97b Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=14846&range=15 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=14846&range=14-15 Stats: 6 lines in 1 file changed: 3 ins; 0 del; 3 mod Patch: https://git.openjdk.org/jdk/pull/14846.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/14846/head:pull/14846 PR: https://git.openjdk.org/jdk/pull/14846 From wkemper at openjdk.org Mon Oct 9 16:55:37 2023 From: wkemper at openjdk.org (William Kemper) Date: Mon, 9 Oct 2023 16:55:37 GMT Subject: RFR: 8317535 Shenandoah: Remove unused code Message-ID: Tested with `hotspot_gc_shenandoah`, `specjbb`, `specjvm`, `dacapo`, `extremem` and `heapothesys`. ------------- Commit messages: - Merge upstream - Remove unused code and other minor fixes Changes: https://git.openjdk.org/jdk/pull/16104/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16104&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8317535 Stats: 185 lines in 20 files changed: 0 ins; 179 del; 6 mod Patch: https://git.openjdk.org/jdk/pull/16104.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16104/head:pull/16104 PR: https://git.openjdk.org/jdk/pull/16104 From kbarrett at openjdk.org Mon Oct 9 18:41:01 2023 From: kbarrett at openjdk.org (Kim Barrett) Date: Mon, 9 Oct 2023 18:41:01 GMT Subject: RFR: 8317730: Change byte_size to return size_t In-Reply-To: References: Message-ID: <2Lc8rkVmxtf8GLdNC2tKWRnoBin5TixdrtBKNiWHm6U=.ad5dceb1-5155-44f1-929a-d1fd4fc45335@github.com> On Mon, 9 Oct 2023 12:57:34 GMT, Albert Mingkun Yang wrote: > Simple signature update to `byte_size` to match expectation from callers. Looks good. I checked all the uses, and they all are dealing with size_t. Thanks for spotting and fixing. ------------- Marked as reviewed by kbarrett (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16100#pullrequestreview-1665088255 From duke at openjdk.org Mon Oct 9 18:59:08 2023 From: duke at openjdk.org (Soumadipta Roy) Date: Mon, 9 Oct 2023 18:59:08 GMT Subject: Integrated: 8316608: Enable parallelism in vmTestbase/gc/vector tests In-Reply-To: References: Message-ID: On Tue, 3 Oct 2023 14:02:09 GMT, Soumadipta Roy wrote: > The commit includes changes to unblock parallelism for more `hotspot:tier4` tests. in `vmTestbase/gc/vector` tests. > > Below are the before and after run comparisons: > > # Fastdebug > before: 3480.71s user 83.97s system 830% cpu 7:09.41 total > after: 2214.52s user 63.19s system 2374% cpu 1:35.94 total > > # Release > before: 1369.61s user 147.03s system 371% cpu 6:48.63 total > after: 1130.28s user 110.97s system 2478% cpu 50.089 total This pull request has now been integrated. Changeset: f61499c7 Author: Soumadipta Roy Committer: Paul Hohensee URL: https://git.openjdk.org/jdk/commit/f61499c73fe03e2e3680d7f58a84183364c5c5ac Stats: 299 lines in 13 files changed: 0 ins; 299 del; 0 mod 8316608: Enable parallelism in vmTestbase/gc/vector tests Reviewed-by: shade, lmesnik ------------- PR: https://git.openjdk.org/jdk/pull/16028 From shade at openjdk.org Mon Oct 9 21:03:23 2023 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 9 Oct 2023 21:03:23 GMT Subject: RFR: 8317755: G1: Periodic GC interval should test for the last whole heap GC Message-ID: See the description in the bug. Fortunately, we already track the last whole-heap GC. The new regression test verifies the behavior. Additional testing: - [ ] Linux x86_64 fastdebug `tier1 tier2 tier3` ------------- Commit messages: - Keep the cast - Fix Changes: https://git.openjdk.org/jdk/pull/16107/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16107&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8317755 Stats: 161 lines in 2 files changed: 158 ins; 0 del; 3 mod Patch: https://git.openjdk.org/jdk/pull/16107.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16107/head:pull/16107 PR: https://git.openjdk.org/jdk/pull/16107 From ayang at openjdk.org Tue Oct 10 09:45:13 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Tue, 10 Oct 2023 09:45:13 GMT Subject: RFR: 8310031: Parallel: Implement better work distribution for large object arrays in old gen [v15] In-Reply-To: References:

Message-ID: On Mon, 9 Oct 2023 15:31:22 GMT, Richard Reingruber wrote: > Also it only comes from overlapping card table preprocessing with scavenging. I think this could be removed again without loosing performance. That complexity is uncalled for if its benefit is marginal. > It could be passed by reference if the query in process_range would be pulled up before the find_first_clean_card call. > The implementation should be straight forward. I think I'll experiment with it. Could it be updated to not query object-start? That would remove much complexity inside that method. Additionally, I wonder if the scanning-dirty-chunk iteration can be simplified a bit: the num of calls to `scan_obj_with_limit` seems excessive and it's not obvious whether it's intended or not that `continue` skips `drain_stacks_cond_depth`). If so, dirtying-first-card-inside-a-stripe probably strikes the best balance btw complexity and performance/mem-overhead for now. Otherwise, I prefer shadow-card-table for its simplicity and the mem-overhead issue can be addressed later on. ------------- PR Comment: https://git.openjdk.org/jdk/pull/14846#issuecomment-1754833838 From tschatzl at openjdk.org Tue Oct 10 11:20:55 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 10 Oct 2023 11:20:55 GMT Subject: RFR: 8317755: G1: Periodic GC interval should test for the last whole heap GC In-Reply-To: References: Message-ID: <7S39vb4eh2w9761PKaTUG5er4IYi_ZqOOWR4cYukZB4=.3f6c8780-5b96-4ce6-92d4-56d2aa03a12b@github.com> On Mon, 9 Oct 2023 20:46:44 GMT, Aleksey Shipilev wrote: > See the description in the bug. Fortunately, we already track the last whole-heap GC. The new regression test verifies the behavior. > > Additional testing: > - [x] Linux x86_64 fastdebug `tier1 tier2 tier3` Initial comments when looking at it: >From what I understand from the CR description, the VM is not completely idle but has GCs now and then which do not (or almost do not) increase old gen occupancy. So the problem seems to be that the existing policy for the periodic gcs does not detect idleness (and the request to give back memory) in this case. The JEP mentions the `G1PeriodicGCSystemLoadThreshold` option to improve upon that. Maybe use of it is an option in this case? This is probably a significant behavioral change which I think is unintended: if the user enabled to use execution of a full gc instead of a concurrent marking - then instead of chugging along if there is at least some activity, there would be regular full gcs now regardless of "being idle" (depending on how you see idle; very long periodis of only young gcs can be induced by e.g. an overly large heap). I do not think this is expected, even if the user asked for full gcs for periodic gcs. Because this topic about periodic gcs has come up a few times, the RFE does not describe the actual situation you are in, so it would be interesting to have more information. For example, would it be acceptable to have some kind of interaction with the VM to set e.g. `SoftMaxHeapSize` at idle? It is kind of hard to cover all "idle" situations and this has been part of the discussion in the original JEP, and this change seems to try to fix this with the imo fairly crude hammer of "guarantee a whole heap analysis" regularly instead. Which is not necessarily incompatible with the current mechanism (i.e. an either-or situation). There may be other aspects of this suggestion. Formal things: This is a significant behavioral change which needs a CSR imo. The referenced JEP exactly defines the expected behavior to listen to young gcs. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16107#issuecomment-1755080550 From tschatzl at openjdk.org Tue Oct 10 11:21:00 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 10 Oct 2023 11:21:00 GMT Subject: RFR: 8317755: G1: Periodic GC interval should test for the last whole heap GC In-Reply-To: <7S39vb4eh2w9761PKaTUG5er4IYi_ZqOOWR4cYukZB4=.3f6c8780-5b96-4ce6-92d4-56d2aa03a12b@github.com> References: <7S39vb4eh2w9761PKaTUG5er4IYi_ZqOOWR4cYukZB4=.3f6c8780-5b96-4ce6-92d4-56d2aa03a12b@github.com> Message-ID: On Tue, 10 Oct 2023 11:09:11 GMT, Thomas Schatzl wrote: > Formal things: This is a significant behavioral change which needs a CSR imo. The referenced JEP exactly defines the expected behavior to listen to young gcs. Note that this requirement can be removed again if there are significant changes, so probably don't start writing yet... however the current change as is would need one imo. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16107#issuecomment-1755091876 From ayang at openjdk.org Tue Oct 10 11:55:27 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Tue, 10 Oct 2023 11:55:27 GMT Subject: RFR: 8317730: Change byte_size to return size_t In-Reply-To: References: Message-ID: On Mon, 9 Oct 2023 12:57:34 GMT, Albert Mingkun Yang wrote: > Simple signature update to `byte_size` to match expectation from callers. Thanks for the review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16100#issuecomment-1755200730 From ayang at openjdk.org Tue Oct 10 12:00:02 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Tue, 10 Oct 2023 12:00:02 GMT Subject: Integrated: 8317730: Change byte_size to return size_t In-Reply-To: References: Message-ID: On Mon, 9 Oct 2023 12:57:34 GMT, Albert Mingkun Yang wrote: > Simple signature update to `byte_size` to match expectation from callers. This pull request has now been integrated. Changeset: fb4098ff Author: Albert Mingkun Yang URL: https://git.openjdk.org/jdk/commit/fb4098ff1a7cca5ec42600f9ab753681961bb1ad Stats: 2 lines in 1 file changed: 0 ins; 0 del; 2 mod 8317730: Change byte_size to return size_t Reviewed-by: coleenp, kbarrett ------------- PR: https://git.openjdk.org/jdk/pull/16100 From ayang at openjdk.org Tue Oct 10 12:02:58 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Tue, 10 Oct 2023 12:02:58 GMT Subject: RFR: 8317797: G1: Remove unimplemented predict_will_fit Message-ID: Trivial removing dead code. ------------- Commit messages: - trivial Changes: https://git.openjdk.org/jdk/pull/16116/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=16116&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8317797 Stats: 8 lines in 1 file changed: 0 ins; 8 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/16116.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/16116/head:pull/16116 PR: https://git.openjdk.org/jdk/pull/16116 From tschatzl at openjdk.org Tue Oct 10 12:48:50 2023 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 10 Oct 2023 12:48:50 GMT Subject: RFR: 8317797: G1: Remove unimplemented predict_will_fit In-Reply-To: References: Message-ID: On Tue, 10 Oct 2023 11:53:29 GMT, Albert Mingkun Yang wrote: > Trivial removing dead code. Lgtm and trivial. ------------- Marked as reviewed by tschatzl (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/16116#pullrequestreview-1667576382 From ayang at openjdk.org Wed Oct 11 09:25:24 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 11 Oct 2023 09:25:24 GMT Subject: RFR: 8317797: G1: Remove unimplemented predict_will_fit In-Reply-To: References: Message-ID: On Tue, 10 Oct 2023 11:53:29 GMT, Albert Mingkun Yang wrote: > Trivial removing dead code. Thanks for the review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16116#issuecomment-1757239181 From ayang at openjdk.org Wed Oct 11 09:25:25 2023 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 11 Oct 2023 09:25:25 GMT Subject: Integrated: 8317797: G1: Remove unimplemented predict_will_fit In-Reply-To: