From ayang at openjdk.org Thu Aug 1 07:31:43 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Thu, 1 Aug 2024 07:31:43 GMT Subject: RFR: 8337546: Remove unused GCCause::_adaptive_size_policy In-Reply-To: References: Message-ID: On Wed, 31 Jul 2024 11:25:50 GMT, Albert Mingkun Yang wrote: > Trivial removing an unused gc-cause; it was previously used by Parallel only. Thanks for review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/20403#issuecomment-2262242210 From ayang at openjdk.org Thu Aug 1 07:31:43 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Thu, 1 Aug 2024 07:31:43 GMT Subject: Integrated: 8337546: Remove unused GCCause::_adaptive_size_policy In-Reply-To: References: Message-ID: <6THEqrfMC8jW6TBFfLMIn8XdDslUFXP9jBtYzc0jOKc=.474e0a7c-827c-4519-948e-db8aecc15722@github.com> On Wed, 31 Jul 2024 11:25:50 GMT, Albert Mingkun Yang wrote: > Trivial removing an unused gc-cause; it was previously used by Parallel only. This pull request has now been integrated. Changeset: cf1230a5 Author: Albert Mingkun Yang URL: https://git.openjdk.org/jdk/commit/cf1230a5f7e5ae4c72ec6243fff1d0b0eb27779a Stats: 13 lines in 4 files changed: 0 ins; 11 del; 2 mod 8337546: Remove unused GCCause::_adaptive_size_policy Reviewed-by: tschatzl, kbarrett ------------- PR: https://git.openjdk.org/jdk/pull/20403 From ayang at openjdk.org Thu Aug 1 07:43:56 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Thu, 1 Aug 2024 07:43:56 GMT Subject: RFR: 8337641: G1: Remove unused G1CollectedHeap::alloc_highest_free_region Message-ID: Trivial removing dead code. ------------- Commit messages: - g1-trivial Changes: https://git.openjdk.org/jdk/pull/20415/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=20415&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8337641 Stats: 18 lines in 2 files changed: 0 ins; 18 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/20415.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20415/head:pull/20415 PR: https://git.openjdk.org/jdk/pull/20415 From ayang at openjdk.org Thu Aug 1 07:49:02 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Thu, 1 Aug 2024 07:49:02 GMT Subject: RFR: 8337642: Serial: Remove redundant counter update in DefNewGeneration::gc_epilogue Message-ID: Trivial removing an empty method call. (Only subclasses have non-empty method body, which is not used by Serial.) ------------- Commit messages: - s1-trivial Changes: https://git.openjdk.org/jdk/pull/20416/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=20416&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8337642 Stats: 1 line in 1 file changed: 0 ins; 1 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/20416.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20416/head:pull/20416 PR: https://git.openjdk.org/jdk/pull/20416 From tschatzl at openjdk.org Thu Aug 1 07:53:30 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Thu, 1 Aug 2024 07:53:30 GMT Subject: RFR: 8337641: G1: Remove unused G1CollectedHeap::alloc_highest_free_region In-Reply-To: References: Message-ID: On Thu, 1 Aug 2024 07:39:31 GMT, Albert Mingkun Yang wrote: > Trivial removing dead code. `G1HeapRegionManager::find_highest_free()` can also be removed. ------------- Changes requested by tschatzl (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/20415#pullrequestreview-2211909772 From tschatzl at openjdk.org Thu Aug 1 08:39:32 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Thu, 1 Aug 2024 08:39:32 GMT Subject: RFR: 8337642: Serial: Remove redundant counter update in DefNewGeneration::gc_epilogue In-Reply-To: References: Message-ID: On Thu, 1 Aug 2024 07:43:05 GMT, Albert Mingkun Yang wrote: > Trivial removing an empty method call. (Only subclasses have non-empty method body, which is not used by Serial.) I disagree with this change: when using `GCPolicyCounters`, the implied contract seems to be that `update_counters` is called at appropriate locations (e.g. `gc_epilogue`), even if empty, exactly to abstract away differences in the collectors wrt to usage. Looking at the users, it rather seems G1 being wrong in not calling this. ------------- PR Review: https://git.openjdk.org/jdk/pull/20416#pullrequestreview-2212007311 From ayang at openjdk.org Thu Aug 1 09:49:47 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Thu, 1 Aug 2024 09:49:47 GMT Subject: RFR: 8337641: G1: Remove unused G1CollectedHeap::alloc_highest_free_region [v2] In-Reply-To: References: Message-ID: > Trivial removing dead code. Albert Mingkun Yang has updated the pull request incrementally with one additional commit since the last revision: review ------------- Changes: - all: https://git.openjdk.org/jdk/pull/20415/files - new: https://git.openjdk.org/jdk/pull/20415/files/01b06d59..1ff8da35 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=20415&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=20415&range=00-01 Stats: 27 lines in 2 files changed: 0 ins; 27 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/20415.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20415/head:pull/20415 PR: https://git.openjdk.org/jdk/pull/20415 From ayang at openjdk.org Thu Aug 1 09:53:31 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Thu, 1 Aug 2024 09:53:31 GMT Subject: RFR: 8337642: Serial: Remove redundant counter update in DefNewGeneration::gc_epilogue In-Reply-To: References:

Message-ID: <7-9_ZwueknC-L3QGr2xeMipto28jnnmtsLMGNKs3ouA=.19c3c6bd-2d1d-43e0-b48e-de6df2da3032@github.com> On Thu, 1 Aug 2024 08:37:10 GMT, Thomas Schatzl wrote: > the implied contract seems to be that update_counters is called at appropriate locations Mabye we can remove it from the base class. Callers of this method always live in gc-specific location where the concrete policy-counter type is known. ------------- PR Comment: https://git.openjdk.org/jdk/pull/20416#issuecomment-2262625045 From stefank at openjdk.org Thu Aug 1 12:23:57 2024 From: stefank at openjdk.org (Stefan Karlsson) Date: Thu, 1 Aug 2024 12:23:57 GMT Subject: RFR: 8337658: ZGC: Move soft reference handling out of the driver loop function Message-ID: The ZDriver code is written to be neat and have a clear outline. The soft reference handling distracts when reading this code. I propose that we hide it a bit. I've also clarified in comments and names that the code is dealing with clearing of *all* references. ------------- Commit messages: - 8337658: ZGC: Move soft reference handling out of the driver loop function Changes: https://git.openjdk.org/jdk/pull/20418/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=20418&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8337658 Stats: 51 lines in 8 files changed: 20 ins; 4 del; 27 mod Patch: https://git.openjdk.org/jdk/pull/20418.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20418/head:pull/20418 PR: https://git.openjdk.org/jdk/pull/20418 From duke at openjdk.org Thu Aug 1 12:58:30 2024 From: duke at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Thu, 1 Aug 2024 12:58:30 GMT Subject: RFR: 8337658: ZGC: Move soft reference handling out of the driver loop function In-Reply-To: References: Message-ID: <-La3_J21R2DpekRekPcRg4yDUUt7QJ5MfsyQjWznr0o=.fc95a797-44f0-4140-af5c-46ca6a2ef0a0@github.com> On Thu, 1 Aug 2024 12:19:04 GMT, Stefan Karlsson wrote: > The ZDriver code is written to be neat and have a clear outline. The soft reference handling distracts when reading this code. I propose that we hide it a bit. > > I've also clarified in comments and names that the code is dealing with clearing of *all* references. I think this change is good and agree that `ZDriverMajor::run_thread()` becomes easier to read. Since the policy is now read and set in the construction of the `ZDriverScopeMajor`, a new getter is needed from `ZGenerationOld` and in turn `ZReferenceProcessor` to retrieve the policy for the gc request. The naming clarifications seem appropriate. ------------- PR Comment: https://git.openjdk.org/jdk/pull/20418#issuecomment-2262971897 From tschatzl at openjdk.org Thu Aug 1 13:35:34 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Thu, 1 Aug 2024 13:35:34 GMT Subject: RFR: 8337641: G1: Remove unused G1CollectedHeap::alloc_highest_free_region [v2] In-Reply-To: References:

Message-ID: <8h0n7NHo5JaK_BUG_kJAqEe9LxKXvUa3hBx61xvyURM=.cacd30cc-c91a-4bea-88f3-9971810a8961@github.com> On Thu, 1 Aug 2024 09:49:47 GMT, Albert Mingkun Yang wrote: >> Trivial removing dead code. > > Albert Mingkun Yang has updated the pull request incrementally with one additional commit since the last revision: > > review lgtm and trivial. ------------- Marked as reviewed by tschatzl (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/20415#pullrequestreview-2212776652 From ayang at openjdk.org Thu Aug 1 13:44:35 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Thu, 1 Aug 2024 13:44:35 GMT Subject: RFR: 8337641: G1: Remove unused G1CollectedHeap::alloc_highest_free_region [v2] In-Reply-To: References:

Message-ID: <5ULVBkCxofwqMvWaQA579_ERMrUF6CskwU049ouXHeU=.94534efb-5531-46be-890f-c221c98c5428@github.com> On Thu, 1 Aug 2024 09:49:47 GMT, Albert Mingkun Yang wrote: >> Trivial removing dead code. > > Albert Mingkun Yang has updated the pull request incrementally with one additional commit since the last revision: > > review Thanks for review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/20415#issuecomment-2263075474 From ayang at openjdk.org Thu Aug 1 13:44:36 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Thu, 1 Aug 2024 13:44:36 GMT Subject: Integrated: 8337641: G1: Remove unused G1CollectedHeap::alloc_highest_free_region In-Reply-To: References: Message-ID: On Thu, 1 Aug 2024 07:39:31 GMT, Albert Mingkun Yang wrote: > Trivial removing dead code. This pull request has now been integrated. Changeset: 022899a7 Author: Albert Mingkun Yang URL: https://git.openjdk.org/jdk/commit/022899a7eb0100bd6d738471f52e5028e3e5f18e Stats: 45 lines in 4 files changed: 0 ins; 45 del; 0 mod 8337641: G1: Remove unused G1CollectedHeap::alloc_highest_free_region Reviewed-by: tschatzl ------------- PR: https://git.openjdk.org/jdk/pull/20415 From duke at openjdk.org Thu Aug 1 18:24:38 2024 From: duke at openjdk.org (duke) Date: Thu, 1 Aug 2024 18:24:38 GMT Subject: Withdrawn: 8331723: Serial: Remove the unused parameter of the method SerialHeap::gc_prologue In-Reply-To: References: Message-ID: On Sun, 12 May 2024 09:27:36 GMT, xiaotaonan wrote: > Serial: Remove the unused parameter of the method SerialHeap::gc_prologue This pull request has been closed without being integrated. ------------- PR: https://git.openjdk.org/jdk/pull/19207 From nprasad at openjdk.org Thu Aug 1 21:13:48 2024 From: nprasad at openjdk.org (Neethu Prasad) Date: Thu, 1 Aug 2024 21:13:48 GMT Subject: RFR: 8336299: Improve GCLocker stall diagnostics [v4] In-Reply-To: References: Message-ID: <6WQVJcdTTpmLHN11SLuikvQGtPYOiB82dFd-cShd-Qk=.5a20a8f4-cab9-4fd2-9f00-93c064fe7ceb@github.com> > **Notes** > Adding logs to get more visibility into how fast a thread resumes from allocation stall. > > **Testing** > * tier 1, tier 2, hotspot_gc tests. > > Example log messages > > 1. Last thread exiting. Performing GC after exiting critical section. Thread "main" 0 locked. > > 2. Thread exiting critical region Thread "main" 0 locked. > > 3. Thread stalled by JNI critical section. Resumed after 586ms. Thread "Thread-0". > > 4. Thread blocked to enter critical region. Resumed after 1240ms. Thread "SIGINT handler". Neethu Prasad has updated the pull request incrementally with one additional commit since the last revision: Address formating issue and code clean up feedback ------------- Changes: - all: https://git.openjdk.org/jdk/pull/20277/files - new: https://git.openjdk.org/jdk/pull/20277/files/c6b66ceb..c53dc9cf Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=20277&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=20277&range=02-03 Stats: 52 lines in 2 files changed: 23 ins; 29 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/20277.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20277/head:pull/20277 PR: https://git.openjdk.org/jdk/pull/20277 From nprasad at openjdk.org Fri Aug 2 02:58:37 2024 From: nprasad at openjdk.org (Neethu Prasad) Date: Fri, 2 Aug 2024 02:58:37 GMT Subject: RFR: 8336299: Improve GCLocker stall diagnostics In-Reply-To: References:

Message-ID: On Wed, 24 Jul 2024 08:27:49 GMT, Thomas Schatzl wrote: > It might also be nice to give an example of such a new message in the CR. updated PR summary. Examples are as below 1. Last thread exiting. Performing GC after exiting critical section. Thread "main" 0 locked. 2. Thread exiting critical region Thread "main" 0 locked. 3. Thread stalled by JNI critical section. Resumed after 586ms. Thread "Thread-0". 4. Thread blocked to enter critical region. Resumed after 1240ms. Thread "SIGINT handler". ------------- PR Comment: https://git.openjdk.org/jdk/pull/20277#issuecomment-2264414125 From nprasad at openjdk.org Fri Aug 2 02:58:38 2024 From: nprasad at openjdk.org (Neethu Prasad) Date: Fri, 2 Aug 2024 02:58:38 GMT Subject: RFR: 8336299: Improve GCLocker stall diagnostics [v4] In-Reply-To: References:

Message-ID: On Wed, 24 Jul 2024 08:23:54 GMT, Thomas Schatzl wrote: >> Neethu Prasad has updated the pull request incrementally with one additional commit since the last revision: >> >> Address formating issue and code clean up feedback > > src/hotspot/share/gc/shared/gcLocker.cpp line 124: > >> 122: } >> 123: >> 124: elapsedTimer elapsed_timer; > > In GC code we tend to use the newer `Ticks` and `Tickspan` API, not `elapsedTimer`. Only Parallel GC uses it at this point afaict (or just `os::elapsedTime()`/`os::elapsed_counter()`). > > Maybe it's even worth to add a special class that can be used with scopes to hide all that including the manual call to `log_debug_jni` (automatically done in the destructor). Probably not really useful. Addressed in latest revision. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/20277#discussion_r1701133766 From nprasad at openjdk.org Fri Aug 2 02:58:40 2024 From: nprasad at openjdk.org (Neethu Prasad) Date: Fri, 2 Aug 2024 02:58:40 GMT Subject: RFR: 8336299: Improve GCLocker stall diagnostics [v3] In-Reply-To: References:

Message-ID: On Wed, 31 Jul 2024 08:10:43 GMT, Stefan Karlsson wrote: >> Neethu Prasad has updated the pull request incrementally with one additional commit since the last revision: >> >> Add missing imports and remove unused ones > > src/hotspot/share/gc/shared/gcLocker.hpp line 166: > >> 164: GCLockerTimingDebugLogger(const char* log_message); >> 165: ~GCLockerTimingDebugLogger(); >> 166: }; > > There should be no code after the include guard on line 153. This class should be moved above it. With that said, this class is only used in gcLocker.cpp, so there's really no need to expose it through the gcLocker.hpp file, AFAICT. > > Also, note that you are using `/* */` to add a comment about the class, but the rest of the code in this file uses `//`, so I'd prefer to see it changed. > > Also note that GitHub complains that your addition lacks a newline at the end of the file. We recently went over the GC code base and fixed issues like that. Maybe there's a way to configure your editor to add one when making edits to the end of a file? Thanks for the feedback. Addressed in new PR revision. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/20277#discussion_r1701134220 From ayang at openjdk.org Fri Aug 2 06:52:00 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Fri, 2 Aug 2024 06:52:00 GMT Subject: RFR: 8337721: G1: Remove unused G1CollectedHeap::young_collection_verify_type Message-ID: Trivial removing dead code. ------------- Commit messages: - g1-trivial Changes: https://git.openjdk.org/jdk/pull/20438/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=20438&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8337721 Stats: 11 lines in 2 files changed: 0 ins; 11 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/20438.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20438/head:pull/20438 PR: https://git.openjdk.org/jdk/pull/20438 From ayang at openjdk.org Fri Aug 2 06:55:32 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Fri, 2 Aug 2024 06:55:32 GMT Subject: RFR: 8336299: Improve GCLocker stall diagnostics [v4] In-Reply-To: <6WQVJcdTTpmLHN11SLuikvQGtPYOiB82dFd-cShd-Qk=.5a20a8f4-cab9-4fd2-9f00-93c064fe7ceb@github.com> References: <6WQVJcdTTpmLHN11SLuikvQGtPYOiB82dFd-cShd-Qk=.5a20a8f4-cab9-4fd2-9f00-93c064fe7ceb@github.com> Message-ID: On Thu, 1 Aug 2024 21:13:48 GMT, Neethu Prasad wrote: >> **Notes** >> Adding logs to get more visibility into how fast a thread resumes from allocation stall. >> >> **Testing** >> * tier 1, tier 2, hotspot_gc tests. >> >> Example log messages >> >> 1. Last thread exiting. Performing GC after exiting critical section. Thread "main" 0 locked. >> >> 2. Thread exiting critical region Thread "main" 0 locked. >> >> 3. Thread stalled by JNI critical section. Resumed after 586ms. Thread "Thread-0". >> >> 4. Thread blocked to enter critical region. Resumed after 1240ms. Thread "SIGINT handler". > > Neethu Prasad has updated the pull request incrementally with one additional commit since the last revision: > > Address formating issue and code clean up feedback src/hotspot/share/gc/shared/gcLocker.cpp line 56: > 54: > 55: ~GCLockerTimingDebugLogger() { > 56: const Tickspan elapsed_time = Ticks::now() - _start; Why is this outside the `if` logger-enabled check? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/20277#discussion_r1701374847 From tschatzl at openjdk.org Fri Aug 2 07:43:32 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Fri, 2 Aug 2024 07:43:32 GMT Subject: RFR: 8337642: Serial: Remove redundant counter update in DefNewGeneration::gc_epilogue In-Reply-To: <7-9_ZwueknC-L3QGr2xeMipto28jnnmtsLMGNKs3ouA=.19c3c6bd-2d1d-43e0-b48e-de6df2da3032@github.com> References:

<7-9_ZwueknC-L3QGr2xeMipto28jnnmtsLMGNKs3ouA=.19c3c6bd-2d1d-43e0-b48e-de6df2da3032@github.com> Message-ID: On Thu, 1 Aug 2024 09:50:50 GMT, Albert Mingkun Yang wrote: > > the implied contract seems to be that update_counters is called at appropriate locations > > Mabye we can remove it from the base class. Callers of this method always live in gc-specific location where the concrete policy-counter type is known. Imo the point of this and similar APIs to avoid the need to think about whether a given collector uses a particular implementation, so that would run counter to the intent of such generic API about handling? I.e. that regardless of type of `GCPolicyCounters` that is actually used, one can be sure that everything is fine as long as you call that `update` method, not needing to think about the concrete policy type. Is the single empty call that much of a (performance) issue? ------------- PR Comment: https://git.openjdk.org/jdk/pull/20416#issuecomment-2264771219 From tschatzl at openjdk.org Fri Aug 2 08:01:34 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Fri, 2 Aug 2024 08:01:34 GMT Subject: RFR: 8336299: Improve GCLocker stall diagnostics [v4] In-Reply-To: References: <6WQVJcdTTpmLHN11SLuikvQGtPYOiB82dFd-cShd-Qk=.5a20a8f4-cab9-4fd2-9f00-93c064fe7ceb@github.com> Message-ID: On Fri, 2 Aug 2024 06:52:53 GMT, Albert Mingkun Yang wrote: >> Neethu Prasad has updated the pull request incrementally with one additional commit since the last revision: >> >> Address formating issue and code clean up feedback > > src/hotspot/share/gc/shared/gcLocker.cpp line 56: > >> 54: >> 55: ~GCLockerTimingDebugLogger() { >> 56: const Tickspan elapsed_time = Ticks::now() - _start; > > Why is this outside the `if` logger-enabled check? Please move within the `if`. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/20277#discussion_r1701459168 From tschatzl at openjdk.org Fri Aug 2 08:01:33 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Fri, 2 Aug 2024 08:01:33 GMT Subject: RFR: 8336299: Improve GCLocker stall diagnostics [v4] In-Reply-To: <6WQVJcdTTpmLHN11SLuikvQGtPYOiB82dFd-cShd-Qk=.5a20a8f4-cab9-4fd2-9f00-93c064fe7ceb@github.com> References: <6WQVJcdTTpmLHN11SLuikvQGtPYOiB82dFd-cShd-Qk=.5a20a8f4-cab9-4fd2-9f00-93c064fe7ceb@github.com> Message-ID: On Thu, 1 Aug 2024 21:13:48 GMT, Neethu Prasad wrote: >> **Notes** >> Adding logs to get more visibility into how fast a thread resumes from allocation stall. >> >> **Testing** >> * tier 1, tier 2, hotspot_gc tests. >> >> Example log messages >> >> 1. Last thread exiting. Performing GC after exiting critical section. Thread "main" 0 locked. >> >> 2. Thread exiting critical region Thread "main" 0 locked. >> >> 3. Thread stalled by JNI critical section. Resumed after 586ms. Thread "Thread-0". >> >> 4. Thread blocked to enter critical region. Resumed after 1240ms. Thread "SIGINT handler". > > Neethu Prasad has updated the pull request incrementally with one additional commit since the last revision: > > Address formating issue and code clean up feedback Changes requested by tschatzl (Reviewer). src/hotspot/share/gc/shared/gcLocker.cpp line 50: > 48: public: > 49: GCLockerTimingDebugLogger(const char* log_message) : > 50: _log_message(log_message) { Indentation of the entire class is one level too deep; the first `private` visibility specifier can be ommitted. There are two spaces before `_log_message`. src/hotspot/share/gc/shared/gcLocker.cpp line 53: > 51: assert(_log_message != nullptr, "GC locker debug message must be set."); > 52: _start = Ticks::now(); > 53: } I think this `}` should align with the method name, i.e. the body of this constructor seems to be nested one level too deep. ------------- PR Review: https://git.openjdk.org/jdk/pull/20277#pullrequestreview-2214937023 PR Review Comment: https://git.openjdk.org/jdk/pull/20277#discussion_r1701456382 PR Review Comment: https://git.openjdk.org/jdk/pull/20277#discussion_r1701458868 From tschatzl at openjdk.org Fri Aug 2 08:17:30 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Fri, 2 Aug 2024 08:17:30 GMT Subject: RFR: 8337721: G1: Remove unused G1CollectedHeap::young_collection_verify_type In-Reply-To: References: Message-ID: <0SliAGiQTVkw13wR8sq3OVbeOUCpcYyOpiruvzMwfxY=.843fed21-1ae2-4890-a170-36d6e4934719@github.com> On Fri, 2 Aug 2024 06:47:17 GMT, Albert Mingkun Yang wrote: > Trivial removing dead code. Lgtm and trivial. ------------- Marked as reviewed by tschatzl (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/20438#pullrequestreview-2214974144 From ayang at openjdk.org Fri Aug 2 10:56:38 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Fri, 2 Aug 2024 10:56:38 GMT Subject: RFR: 8337721: G1: Remove unused G1CollectedHeap::young_collection_verify_type In-Reply-To: References: Message-ID: <7M8bnjQ8rk7S8SeGPk-gGqKxDfNTIYhA--QnopL4eRI=.6ba40ec8-dd2a-49f8-b890-5c374099f654@github.com> On Fri, 2 Aug 2024 06:47:17 GMT, Albert Mingkun Yang wrote: > Trivial removing dead code. Thanks for review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/20438#issuecomment-2265101741 From ayang at openjdk.org Fri Aug 2 10:56:38 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Fri, 2 Aug 2024 10:56:38 GMT Subject: Integrated: 8337721: G1: Remove unused G1CollectedHeap::young_collection_verify_type In-Reply-To: References: Message-ID: On Fri, 2 Aug 2024 06:47:17 GMT, Albert Mingkun Yang wrote: > Trivial removing dead code. This pull request has now been integrated. Changeset: a89b5251 Author: Albert Mingkun Yang URL: https://git.openjdk.org/jdk/commit/a89b525189fbc0559be9edc0de9f4288ca676139 Stats: 11 lines in 2 files changed: 0 ins; 11 del; 0 mod 8337721: G1: Remove unused G1CollectedHeap::young_collection_verify_type Reviewed-by: tschatzl ------------- PR: https://git.openjdk.org/jdk/pull/20438 From kbarrett at openjdk.org Fri Aug 2 19:41:03 2024 From: kbarrett at openjdk.org (Kim Barrett) Date: Fri, 2 Aug 2024 19:41:03 GMT Subject: RFR: 8337709: Use allocated states for chunking large array processing Message-ID: Please review this change to the G1 young/mixed collector to use allocated states to encode partial array task chunking. States are allocated from per-worker-thread arena+free-list pairs, and released to the free-list for the worker that completed use. They are refcounted to track the number of refering tasks. Various other approaches (such as a single arena+FreeListAllocator) were tested, but found to have worse performance, though in some cases fewer allocations. The per-worker arena+free-list pair was the only option that didn't show a regression compared to the previous PartialArrayScanTask approach on a stress test. In addition to the changes to ScannerTask to support the new PartialArrayState, it temporarily continues to support PartialArrayScanTask. This is because ParallelGC will continue to use the latter until it is changed to use PartialArrayState. The intent is to update ParallelGC in a followup CR. Testing: mach5 tier1-5 G1 performance suite ------------- Commit messages: - G1 young update - add PartialArrayState - move chunk size inside stepper Changes: https://git.openjdk.org/jdk/pull/20445/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=20445&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8337709 Stats: 501 lines in 9 files changed: 356 ins; 57 del; 88 mod Patch: https://git.openjdk.org/jdk/pull/20445.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20445/head:pull/20445 PR: https://git.openjdk.org/jdk/pull/20445 From ayang at openjdk.org Mon Aug 5 08:04:30 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 5 Aug 2024 08:04:30 GMT Subject: RFR: 8337642: Serial: Remove redundant counter update in DefNewGeneration::gc_epilogue In-Reply-To: References:

<7-9_ZwueknC-L3QGr2xeMipto28jnnmtsLMGNKs3ouA=.19c3c6bd-2d1d-43e0-b48e-de6df2da3032@github.com> Message-ID: On Fri, 2 Aug 2024 07:41:14 GMT, Thomas Schatzl wrote: > so that would run counter to the intent of such generic API about handling One can view `GCPolicyCounters` as a plain data-structure with only getters. The only two non-getter methods are the empty `update_counters` and the unused `kind`. If both are removed, `GCPolicyCounters` doesn't expose any action-related APIs any more. > one can be sure that everything is fine as long as you call that update method, That's a false sense of security. The two actual vars, `_tenuring_threshold` and `_desired_survivor_size`, that requires updating, are updated in two diff places in Serial and G1, after and before young-gc. IOW, diff GCs differ enough so that not exposing an `update` API, i.e. treating `GCPolicyCounters` as plain-old-data, offers more flexibility, IMO. > Is the single empty call that much of a (performance) issue? It's more about removing effectively dead code to simplify the logic. ------------- PR Comment: https://git.openjdk.org/jdk/pull/20416#issuecomment-2268424338 From tschatzl at openjdk.org Mon Aug 5 09:18:31 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 5 Aug 2024 09:18:31 GMT Subject: RFR: 8337642: Serial: Remove redundant counter update in DefNewGeneration::gc_epilogue In-Reply-To: References:

<7-9_ZwueknC-L3QGr2xeMipto28jnnmtsLMGNKs3ouA=.19c3c6bd-2d1d-43e0-b48e-de6df2da3032@github.com>

Message-ID: On Mon, 5 Aug 2024 08:01:27 GMT, Albert Mingkun Yang wrote: > > so that would run counter to the intent of such generic API about handling > > One can view `GCPolicyCounters` as a plain data-structure with only getters. The only two non-getter methods are the empty `update_counters` and the unused `kind`. If both are removed, `GCPolicyCounters` doesn't expose any action-related APIs any more. Then let's do that instead of removing only the call. The `update_counters` API as it is used now does not seem to help at all. The `gc_overhead_limit_exceeded_counter` could also be moved into the Parallel specific class because it's only used there. I object to only remove the call and keep the bad API; removing them isn't that much more work. ------------- PR Comment: https://git.openjdk.org/jdk/pull/20416#issuecomment-2268573514 From ayang at openjdk.org Mon Aug 5 09:41:04 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 5 Aug 2024 09:41:04 GMT Subject: RFR: 8337642: Serial: Remove redundant counter update in DefNewGeneration::gc_epilogue [v2] In-Reply-To: References: Message-ID: > Trivial removing an empty method call. (Only subclasses have non-empty method body, which is not used by Serial.) Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains three additional commits since the last revision: - review - Merge branch 'master' into s1-trivial - s1-trivial ------------- Changes: - all: https://git.openjdk.org/jdk/pull/20416/files - new: https://git.openjdk.org/jdk/pull/20416/files/54f148df..86e46b66 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=20416&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=20416&range=00-01 Stats: 4398 lines in 184 files changed: 2047 ins; 1389 del; 962 mod Patch: https://git.openjdk.org/jdk/pull/20416.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20416/head:pull/20416 PR: https://git.openjdk.org/jdk/pull/20416 From tschatzl at openjdk.org Mon Aug 5 09:46:36 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 5 Aug 2024 09:46:36 GMT Subject: RFR: 8337642: Remove unused APIs of GCPolicyCounters [v2] In-Reply-To: References:

Message-ID: On Mon, 5 Aug 2024 09:41:04 GMT, Albert Mingkun Yang wrote: >> Trivial removing an empty method call. (Only subclasses have non-empty method body, which is not used by Serial.) > > Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains three additional commits since the last revision: > > - review > - Merge branch 'master' into s1-trivial > - s1-trivial Thanks for review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/20416#issuecomment-2270525653 From duke at openjdk.org Tue Aug 6 14:00:02 2024 From: duke at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Tue, 6 Aug 2024 14:00:02 GMT Subject: RFR: 8310675: Fix -Wconversion warnings in ZGC code Message-ID: <_eEPqKVsKunCsC5ogqfaHPfgndjqBoJKV1iOsZAYxio=.723085df-731a-4c98-b74a-91575860e1ec@github.com> Fixed `-Wconversion` warnings in ZGC code, either by adding an explicit type cast, changin the type of the variable or calling an equivalent method with other types. The largest change is the addition of `ZStatDurationSample`, which typecasts `Tickspan::value()` to a `uint64_t` and calls `ZStatSample` to make the code more readable. I isolated the `-Wconversion` warnings for ZGC by adding the flag to clangd and displaying the errors in my IDE and going through each file directly associated with ZGC one by one. Tested with tiers 1-3. ------------- Commit messages: - 8310675: Fixed -Wconversion warnings in ZGC Changes: https://git.openjdk.org/jdk/pull/20406/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=20406&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8310675 Stats: 120 lines in 33 files changed: 5 ins; 0 del; 115 mod Patch: https://git.openjdk.org/jdk/pull/20406.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20406/head:pull/20406 PR: https://git.openjdk.org/jdk/pull/20406 From stefank at openjdk.org Tue Aug 6 14:24:32 2024 From: stefank at openjdk.org (Stefan Karlsson) Date: Tue, 6 Aug 2024 14:24:32 GMT Subject: RFR: 8310675: Fix -Wconversion warnings in ZGC code In-Reply-To: <_eEPqKVsKunCsC5ogqfaHPfgndjqBoJKV1iOsZAYxio=.723085df-731a-4c98-b74a-91575860e1ec@github.com> References: <_eEPqKVsKunCsC5ogqfaHPfgndjqBoJKV1iOsZAYxio=.723085df-731a-4c98-b74a-91575860e1ec@github.com> Message-ID: <75lHlbAzUEAS7EkEbyGjQrIT2l4qB2_-oQKI6CYNX6k=.59849893-7e20-4594-b93a-5675a6943d97@github.com> On Wed, 31 Jul 2024 13:01:50 GMT, Joel Sikstr?m wrote: > Fixed `-Wconversion` warnings in ZGC code, either by adding an explicit type cast, changin the type of the variable or calling an equivalent method with other types. The largest change is the addition of `ZStatDurationSample`, which typecasts `Tickspan::value()` to a `uint64_t` and calls `ZStatSample` to make the code more readable. > > I isolated the `-Wconversion` warnings for ZGC by adding the flag to clangd and displaying the errors in my IDE and going through each file directly associated with ZGC one by one. > > Tested with tiers 1-3. This change looks good to me. I've looked through the code with Joel and I think that this is good set of changes to the ZGC code base. When fixing -Wconversion warnings there are always multiple ways to juggle around the types. Some of the added casts could probably be cleaned up by updating non-ZGC code instead (E.g. TimeHelper), but for Joel's first patch we wanted to limit the changes to the ZGC code base. Note that we're intentionally only fixing the Generational ZGC code and leaving the single-generation code left as is. ------------- Marked as reviewed by stefank (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/20406#pullrequestreview-2221462496 From ayang at openjdk.org Tue Aug 6 14:28:32 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Tue, 6 Aug 2024 14:28:32 GMT Subject: RFR: 8310675: Fix -Wconversion warnings in ZGC code In-Reply-To: <_eEPqKVsKunCsC5ogqfaHPfgndjqBoJKV1iOsZAYxio=.723085df-731a-4c98-b74a-91575860e1ec@github.com> References: <_eEPqKVsKunCsC5ogqfaHPfgndjqBoJKV1iOsZAYxio=.723085df-731a-4c98-b74a-91575860e1ec@github.com> Message-ID: On Wed, 31 Jul 2024 13:01:50 GMT, Joel Sikstr?m wrote: > Fixed `-Wconversion` warnings in ZGC code, either by adding an explicit type cast, changin the type of the variable or calling an equivalent method with other types. The largest change is the addition of `ZStatDurationSample`, which typecasts `Tickspan::value()` to a `uint64_t` and calls `ZStatSample` to make the code more readable. > > I isolated the `-Wconversion` warnings for ZGC by adding the flag to clangd and displaying the errors in my IDE and going through each file directly associated with ZGC one by one. > > Tested with tiers 1-3. Marked as reviewed by ayang (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/20406#pullrequestreview-2221474896 From nprasad at openjdk.org Tue Aug 6 18:11:05 2024 From: nprasad at openjdk.org (Neethu Prasad) Date: Tue, 6 Aug 2024 18:11:05 GMT Subject: RFR: 8336742: Shenandoah: Add more verbose logging/stats for mark termination attempts [v2] In-Reply-To: References: Message-ID: > **Notes** > This PR adds the following > 1. info logging on number of SATB flush attempts > 2. total time spend on handshaking all threads requesting them to flush their SATB buffers. > > As suggested by William in [JDK-8336742 ](https://bugs.openjdk.org/browse/JDK-83367420), we can use handshake logging to get time spend and other stats for each handshake. > > [4.515s][info][handshake ] Handshake "Shenandoah Flush SATB Handshake", Targeted threads: 1036, Executed by requesting thread: 1035, Total completion time: 597004 ns > [4.517s][info][handshake ] Handshake "Shenandoah Flush SATB Handshake", Targeted threads: 1036, Executed by requesting thread: 1033, Total completion time: 207402 ns > > > **Testing** > 1. tier1, tier2 and hotspot_gc_shenandoah tests. > 2. **-Xlog:gc+stats=info** > > > [4.058s][info][gc,stats ] Concurrent Marking = 0.080 s (a = 5351 us) (n = 15) (lvls, us = 4746, 5000, 5156, 5684, 5988) > [4.058s][info][gc,stats ] SATB Flush Rendezvous = 0.013 s (a = 860 us) (n = 15) (lvls, us = 764, 814, 836, 885, 961) > [4.058s][info][gc,stats ] Pause Final Mark (G) = 0.058 s (a = 3839 us) (n = 15) (lvls, us = 3047, 3320, 3867, 4121, 4930) > [4.058s][info][gc,stats ] Pause Final Mark (N) = 0.054 s (a = 3592 us) (n = 15) (lvls, us = 2812, 3047, 3574, 3887, 4597) > [4.058s][info][gc,stats ] Finish Mark = 0.028 s (a = 1843 us) (n = 15) (lvls, us = 1602, 1641, 1816, 1934, 2045) > [4.058s][info][gc,stats ] Update Region States = 0.006 s (a = 386 us) (n = 15) (lvls, us = 375, 375, 381, 389, 413) > [4.058s][info][gc,stats ] Choose Collection Set = 0.018 s (a = 1186 us) (n = 15) (lvls, us = 609, 619, 1309, 1387, 2109) > [4.058s][info][gc,stats ] Rebuild Free Set = 0.001 s (a = 43 us) (n = 15) (lvls, us = 40, 41, 42, 43, 53) > [4.058s][info][gc,stats ] Concurrent Weak References = 0.007 s (a = 452 us) (n = 15) (lvls, us = 420, 438, 443, 455, 487) > > > on app termination > > > [5.299s][info][gc,stats] GC STATISTICS: > [5.299s][info][gc,stats] "(G)" (gross) pauses include VM time: time to notify and block threads, do the pre- > [5.299s][info][gc,stats] and post-safepoint housekee... Neethu Prasad has updated the pull request incrementally with one additional commit since the last revision: ShenandoahTimingsTracker to support aggregation of cycle times ------------- Changes: - all: https://git.openjdk.org/jdk/pull/20318/files - new: https://git.openjdk.org/jdk/pull/20318/files/7c3d4a84..6e7fdd5e Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=20318&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=20318&range=00-01 Stats: 31 lines in 5 files changed: 9 ins; 6 del; 16 mod Patch: https://git.openjdk.org/jdk/pull/20318.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20318/head:pull/20318 PR: https://git.openjdk.org/jdk/pull/20318 From shade at openjdk.org Tue Aug 6 18:24:35 2024 From: shade at openjdk.org (Aleksey Shipilev) Date: Tue, 6 Aug 2024 18:24:35 GMT Subject: RFR: 8336742: Shenandoah: Add more verbose logging/stats for mark termination attempts [v2] In-Reply-To: References:

Message-ID: On Tue, 6 Aug 2024 18:11:05 GMT, Neethu Prasad wrote: >> **Revision 2 Notes** >> 1. Added time spent on handshaking all threads requesting them to flush their SATB buffers as part of GC stats. >> 2. As mentioned in PR feedback, will raise separate PR to adding logging in ShenandoahTimingsTracker. >> >> **Revision 1 Notes** >> This PR adds the following >> 1. info logging on number of SATB flush attempts >> 3. total time spend on handshaking all threads requesting them to flush their SATB buffers. >> >> As suggested by William in [JDK-8336742 ](https://bugs.openjdk.org/browse/JDK-83367420), we can use handshake logging to get time spend and other stats for each handshake. >> >> [4.515s][info][handshake ] Handshake "Shenandoah Flush SATB Handshake", Targeted threads: 1036, Executed by requesting thread: 1035, Total completion time: 597004 ns >> [4.517s][info][handshake ] Handshake "Shenandoah Flush SATB Handshake", Targeted threads: 1036, Executed by requesting thread: 1033, Total completion time: 207402 ns >> >> >> **Testing** >> 1. tier1, tier2 and hotspot_gc_shenandoah tests. >> 2. **-Xlog:gc+stats=info** >> >> >> [37.087s][info][gc,stats] CMR: VM Strong Roots 413 us, workers (us): 64, 57, 52, 47, 38, 31, 30, 25, 20, 21, 17, 10, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, >> [37.087s][info][gc,stats] CMR: CLDG Roots 449 us, workers (us): 4, ---, ---, 406, ---, 15, ---, 4, 4, ---, ---, 17, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, >> [37.087s][info][gc,stats] Concurrent Marking 5002 us >> [37.087s][info][gc,stats] SATB Flush Rendezvous 1748 us >> [37.087s][info][gc,stats] Pause Final Mark (G) 57272 us >> [37.087s][info][gc,stats] Pause Final Mark (N) 56985 us >> [37.087s][info][gc,stats] Finish Mark 387 us >> [37.087s][info][gc,stats] Update Region States 109 us >> [37.087s][info][gc,stats] Choose Collection Set 56395 us >> [37.087s][info][gc,stats] Rebuild Free Set 40 us >> >> >> on app termination >> >> >> [40.640s][info][gc,stats] Concurrent Reset = 0.914 s (a = 65255 us) (n = 14) (lvls, us = 54883, 55859, 63867, 65234, 97096) >> [40.640s][info][gc,stats] Pause Init Mark (G) = 1.755 s (a = 125380 us) (n = 14) (lvls, us = 119141, 123047, 125000, 125000, 128042) >> [40.640s][info][gc,stats] Pause Init Mark (N) = 1.697 s (a = 121241 us... > > Neethu Prasad has updated the pull request incrementally with one additional commit since the last revision: > > ShenandoahTimingsTracker to support aggregation of cycle times Looks okay, only stylistic comments: src/hotspot/share/gc/shenandoah/shenandoahPhaseTimings.cpp line 142: > 140: void ShenandoahPhaseTimings::set_cycle_data(Phase phase, double time, bool should_aggregate_cycles) { > 141: if (should_aggregate_cycles) { > 142: _cycle_data[phase] = _cycle_data[phase] <= 0 ? time : _cycle_data[phase] + time; I *think* `<= 0` is too broad, and assumes things about the value of `uninitialized()`. Check for `uninitialized()` explicitly. src/hotspot/share/gc/shenandoah/shenandoahUtils.cpp line 127: > 125: const double end_time = os::elapsedTime(); > 126: const double phase_elapsed_time = end_time - _start; > 127: _timings->record_phase_time(_phase, phase_elapsed_time, _should_aggregate_cycles); No need to introduce local variables here, right? The expression can stay inlined. src/hotspot/share/gc/shenandoah/shenandoahUtils.hpp line 69: > 67: ShenandoahPhaseTimings::Phase _parent_phase; > 68: double _start; > 69: bool _should_aggregate_cycles; How about simplifying it to `_should_aggregate`? src/hotspot/share/gc/shenandoah/shenandoahUtils.hpp line 72: > 70: > 71: public: > 72: ShenandoahTimingsTracker(ShenandoahPhaseTimings::Phase phase, bool should_aggregate_cycles=false); Here and everywhere else, need whitespaces: `bool should_aggregate_cycles = false` ------------- PR Review: https://git.openjdk.org/jdk/pull/20318#pullrequestreview-2221968957 PR Review Comment: https://git.openjdk.org/jdk/pull/20318#discussion_r1705945023 PR Review Comment: https://git.openjdk.org/jdk/pull/20318#discussion_r1705943889 PR Review Comment: https://git.openjdk.org/jdk/pull/20318#discussion_r1705943175 PR Review Comment: https://git.openjdk.org/jdk/pull/20318#discussion_r1705943455 From nprasad at openjdk.org Tue Aug 6 19:23:46 2024 From: nprasad at openjdk.org (Neethu Prasad) Date: Tue, 6 Aug 2024 19:23:46 GMT Subject: RFR: 8336742: Shenandoah: Add more verbose logging/stats for mark termination attempts [v3] In-Reply-To: References: Message-ID: > **Revision 2 Notes** > 1. Added time spent on handshaking all threads requesting them to flush their SATB buffers as part of GC stats. > 2. As mentioned in PR feedback, will raise separate PR to adding logging in ShenandoahTimingsTracker. > > **Revision 1 Notes** > This PR adds the following > 1. info logging on number of SATB flush attempts > 3. total time spend on handshaking all threads requesting them to flush their SATB buffers. > > As suggested by William in [JDK-8336742 ](https://bugs.openjdk.org/browse/JDK-83367420), we can use handshake logging to get time spend and other stats for each handshake. > > [4.515s][info][handshake ] Handshake "Shenandoah Flush SATB Handshake", Targeted threads: 1036, Executed by requesting thread: 1035, Total completion time: 597004 ns > [4.517s][info][handshake ] Handshake "Shenandoah Flush SATB Handshake", Targeted threads: 1036, Executed by requesting thread: 1033, Total completion time: 207402 ns > > > **Testing** > 1. tier1, tier2 and hotspot_gc_shenandoah tests. > 2. **-Xlog:gc+stats=info** > > > [37.087s][info][gc,stats] CMR: VM Strong Roots 413 us, workers (us): 64, 57, 52, 47, 38, 31, 30, 25, 20, 21, 17, 10, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, > [37.087s][info][gc,stats] CMR: CLDG Roots 449 us, workers (us): 4, ---, ---, 406, ---, 15, ---, 4, 4, ---, ---, 17, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, > [37.087s][info][gc,stats] Concurrent Marking 5002 us > [37.087s][info][gc,stats] SATB Flush Rendezvous 1748 us > [37.087s][info][gc,stats] Pause Final Mark (G) 57272 us > [37.087s][info][gc,stats] Pause Final Mark (N) 56985 us > [37.087s][info][gc,stats] Finish Mark 387 us > [37.087s][info][gc,stats] Update Region States 109 us > [37.087s][info][gc,stats] Choose Collection Set 56395 us > [37.087s][info][gc,stats] Rebuild Free Set 40 us > > > on app termination > > > [40.640s][info][gc,stats] Concurrent Reset = 0.914 s (a = 65255 us) (n = 14) (lvls, us = 54883, 55859, 63867, 65234, 97096) > [40.640s][info][gc,stats] Pause Init Mark (G) = 1.755 s (a = 125380 us) (n = 14) (lvls, us = 119141, 123047, 125000, 125000, 128042) > [40.640s][info][gc,stats] Pause Init Mark (N) = 1.697 s (a = 121241 us) (n = 14) (lvls, us = 117188, 119141, 121094, 121094, 123880) > ... Neethu Prasad has updated the pull request incrementally with one additional commit since the last revision: Address feedback on code style and uninitialized check ------------- Changes: - all: https://git.openjdk.org/jdk/pull/20318/files - new: https://git.openjdk.org/jdk/pull/20318/files/6e7fdd5e..a7c0514a Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=20318&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=20318&range=01-02 Stats: 17 lines in 4 files changed: 1 ins; 3 del; 13 mod Patch: https://git.openjdk.org/jdk/pull/20318.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20318/head:pull/20318 PR: https://git.openjdk.org/jdk/pull/20318 From kbarrett at openjdk.org Wed Aug 7 04:45:38 2024 From: kbarrett at openjdk.org (Kim Barrett) Date: Wed, 7 Aug 2024 04:45:38 GMT Subject: RFR: 8335925: Serial: Move allocation API from Generation to subclasses [v3] In-Reply-To: References:

Message-ID: <7KXk5bzbb7ONFhufUey4cDGNuqMckpBbQTdclprrr1A=.fdff505e-e920-4b0c-a95b-03bf034a8ef2@github.com> On Fri, 26 Jul 2024 10:18:08 GMT, Albert Mingkun Yang wrote: >> Trivial moving methods from parent class to subclasses. The unused second arg is also removed along the way. The API names are descriptive enough so that the accompanying comments are dropped as well. > > Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains five additional commits since the last revision: > > - review > - Merge branch 'master' into s1-gen-alloc > - review > - Merge branch 'master' into s1-gen-alloc > - s1-gen-alloc Looks good. Probably the name "Generation" ought to be changed at some point, as part of the described future development. ------------- Marked as reviewed by kbarrett (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/20084#pullrequestreview-2222616549 From ayang at openjdk.org Wed Aug 7 07:50:38 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 7 Aug 2024 07:50:38 GMT Subject: RFR: 8335925: Serial: Move allocation API from Generation to subclasses [v3] In-Reply-To: References:

Message-ID: On Fri, 26 Jul 2024 10:18:08 GMT, Albert Mingkun Yang wrote: >> Trivial moving methods from parent class to subclasses. The unused second arg is also removed along the way. The API names are descriptive enough so that the accompanying comments are dropped as well. > > Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains five additional commits since the last revision: > > - review > - Merge branch 'master' into s1-gen-alloc > - review > - Merge branch 'master' into s1-gen-alloc > - s1-gen-alloc Thanks for review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/20084#issuecomment-2272835593 From ayang at openjdk.org Wed Aug 7 07:50:38 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 7 Aug 2024 07:50:38 GMT Subject: Integrated: 8335925: Serial: Move allocation API from Generation to subclasses In-Reply-To: References: Message-ID: On Mon, 8 Jul 2024 20:00:31 GMT, Albert Mingkun Yang wrote: > Trivial moving methods from parent class to subclasses. The unused second arg is also removed along the way. The API names are descriptive enough so that the accompanying comments are dropped as well. This pull request has now been integrated. Changeset: 41f784fe Author: Albert Mingkun Yang URL: https://git.openjdk.org/jdk/commit/41f784fe63f8e06a25e1fe00dc96e398874adf81 Stats: 57 lines in 7 files changed: 3 ins; 35 del; 19 mod 8335925: Serial: Move allocation API from Generation to subclasses Reviewed-by: gli, kbarrett, iwalulya ------------- PR: https://git.openjdk.org/jdk/pull/20084 From iwalulya at openjdk.org Wed Aug 7 08:21:34 2024 From: iwalulya at openjdk.org (Ivan Walulya) Date: Wed, 7 Aug 2024 08:21:34 GMT Subject: RFR: 8337709: Use allocated states for chunking large array processing In-Reply-To: References: Message-ID: On Fri, 2 Aug 2024 19:36:47 GMT, Kim Barrett wrote: > Please review this change to the G1 young/mixed collector to use allocated > states to encode partial array task chunking. > > States are allocated from per-worker-thread arena+free-list pairs, and > released to the free-list for the worker that completed use. They are > refcounted to track the number of refering tasks. > > Various other approaches (such as a single arena+FreeListAllocator) were > tested, but found to have worse performance, though in some cases fewer > allocations. The per-worker arena+free-list pair was the only option that > didn't show a regression compared to the previous PartialArrayScanTask > approach on a stress test. > > In addition to the changes to ScannerTask to support the new > PartialArrayState, it temporarily continues to support PartialArrayScanTask. > This is because ParallelGC will continue to use the latter until it is changed > to use PartialArrayState. The intent is to update ParallelGC in a followup CR. > > Testing: > mach5 tier1-5 > G1 performance suite LGTM! Was there any observable impact on G1 performance suite? ------------- Marked as reviewed by iwalulya (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/20445#pullrequestreview-2223259036 From shade at openjdk.org Wed Aug 7 08:27:35 2024 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 7 Aug 2024 08:27:35 GMT Subject: RFR: 8336742: Shenandoah: Add more verbose logging/stats for mark termination attempts [v3] In-Reply-To: References:

Message-ID: On Tue, 6 Aug 2024 19:23:46 GMT, Neethu Prasad wrote: >> **Revision 2 Notes** >> 1. Added time spent on handshaking all threads requesting them to flush their SATB buffers as part of GC stats. >> 2. As mentioned in PR feedback, will raise separate PR to adding logging in ShenandoahTimingsTracker. >> >> **Revision 1 Notes** >> This PR adds the following >> 1. info logging on number of SATB flush attempts >> 3. total time spend on handshaking all threads requesting them to flush their SATB buffers. >> >> As suggested by William in [JDK-8336742 ](https://bugs.openjdk.org/browse/JDK-83367420), we can use handshake logging to get time spend and other stats for each handshake. >> >> [4.515s][info][handshake ] Handshake "Shenandoah Flush SATB Handshake", Targeted threads: 1036, Executed by requesting thread: 1035, Total completion time: 597004 ns >> [4.517s][info][handshake ] Handshake "Shenandoah Flush SATB Handshake", Targeted threads: 1036, Executed by requesting thread: 1033, Total completion time: 207402 ns >> >> >> **Testing** >> 1. tier1, tier2 and hotspot_gc_shenandoah tests. >> 2. **-Xlog:gc+stats=info** >> >> >> [37.087s][info][gc,stats] CMR: VM Strong Roots 413 us, workers (us): 64, 57, 52, 47, 38, 31, 30, 25, 20, 21, 17, 10, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, >> [37.087s][info][gc,stats] CMR: CLDG Roots 449 us, workers (us): 4, ---, ---, 406, ---, 15, ---, 4, 4, ---, ---, 17, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, >> [37.087s][info][gc,stats] Concurrent Marking 5002 us >> [37.087s][info][gc,stats] SATB Flush Rendezvous 1748 us >> [37.087s][info][gc,stats] Pause Final Mark (G) 57272 us >> [37.087s][info][gc,stats] Pause Final Mark (N) 56985 us >> [37.087s][info][gc,stats] Finish Mark 387 us >> [37.087s][info][gc,stats] Update Region States 109 us >> [37.087s][info][gc,stats] Choose Collection Set 56395 us >> [37.087s][info][gc,stats] Rebuild Free Set 40 us >> >> >> on app termination >> >> >> [40.640s][info][gc,stats] Concurrent Reset = 0.914 s (a = 65255 us) (n = 14) (lvls, us = 54883, 55859, 63867, 65234, 97096) >> [40.640s][info][gc,stats] Pause Init Mark (G) = 1.755 s (a = 125380 us) (n = 14) (lvls, us = 119141, 123047, 125000, 125000, 128042) >> [40.640s][info][gc,stats] Pause Init Mark (N) = 1.697 s (a = 121241 us... > > Neethu Prasad has updated the pull request incrementally with one additional commit since the last revision: > > Address feedback on code style and uninitialized check Looks fine. Consider the remaining nits: src/hotspot/share/gc/shenandoah/shenandoahPhaseTimings.cpp line 143: > 141: const double cycle_data = _cycle_data[phase]; > 142: if (should_aggregate) { > 143: _cycle_data[phase] = (cycle_data == uninitialized()) ? time : cycle_data + time; Suggestion: _cycle_data[phase] = (cycle_data == uninitialized()) ? time : (cycle_data + time); src/hotspot/share/gc/shenandoah/shenandoahUtils.hpp line 69: > 67: ShenandoahPhaseTimings::Phase _parent_phase; > 68: double _start; > 69: bool _should_aggregate; Should probably be `const bool`? ------------- Marked as reviewed by shade (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/20318#pullrequestreview-2223286637 PR Review Comment: https://git.openjdk.org/jdk/pull/20318#discussion_r1706594323 PR Review Comment: https://git.openjdk.org/jdk/pull/20318#discussion_r1706595588 From shade at openjdk.org Wed Aug 7 11:57:02 2024 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 7 Aug 2024 11:57:02 GMT Subject: RFR: 8337981: ShenandoahHeap::is_in should check for alive regions Message-ID: The expected behavior of `CollectedHeap::is_in` is to check whether the object belongs to the committed parts of the heap. This is useful to check if object resides in the parts of the heap the GC knows are not dead. Yet, Shenandoah's check just verifies that oop is within the heap bounds. So `is_in` check for an object that is in trashed/empty region would pass by accident, and we will miss detecting bugs. This should be rectified. I also re-wired assertions/verification code to be clear whether we check for heap bounds or actual in-heap conditions. Additional testing: - [ ] Linux AArch64 server fastdebug, `all` with `-XX:+UseShenandoahGC` ------------- Commit messages: - Fix Changes: https://git.openjdk.org/jdk/pull/20492/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=20492&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8337981 Stats: 74 lines in 9 files changed: 35 ins; 0 del; 39 mod Patch: https://git.openjdk.org/jdk/pull/20492.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20492/head:pull/20492 PR: https://git.openjdk.org/jdk/pull/20492 From nprasad at openjdk.org Wed Aug 7 13:19:12 2024 From: nprasad at openjdk.org (Neethu Prasad) Date: Wed, 7 Aug 2024 13:19:12 GMT Subject: RFR: 8336742: Shenandoah: Add more verbose logging/stats for mark termination attempts [v4] In-Reply-To: References: Message-ID: > **Revision 2 Notes** > 1. Added time spent on handshaking all threads requesting them to flush their SATB buffers as part of GC stats. > 2. As mentioned in PR feedback, will raise separate PR to adding logging in ShenandoahTimingsTracker. > > **Revision 1 Notes** > This PR adds the following > 1. info logging on number of SATB flush attempts > 3. total time spend on handshaking all threads requesting them to flush their SATB buffers. > > As suggested by William in [JDK-8336742 ](https://bugs.openjdk.org/browse/JDK-83367420), we can use handshake logging to get time spend and other stats for each handshake. > > [4.515s][info][handshake ] Handshake "Shenandoah Flush SATB Handshake", Targeted threads: 1036, Executed by requesting thread: 1035, Total completion time: 597004 ns > [4.517s][info][handshake ] Handshake "Shenandoah Flush SATB Handshake", Targeted threads: 1036, Executed by requesting thread: 1033, Total completion time: 207402 ns > > > **Testing** > 1. tier1, tier2 and hotspot_gc_shenandoah tests. > 2. **-Xlog:gc+stats=info** > > > [37.087s][info][gc,stats] CMR: VM Strong Roots 413 us, workers (us): 64, 57, 52, 47, 38, 31, 30, 25, 20, 21, 17, 10, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, > [37.087s][info][gc,stats] CMR: CLDG Roots 449 us, workers (us): 4, ---, ---, 406, ---, 15, ---, 4, 4, ---, ---, 17, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, > [37.087s][info][gc,stats] Concurrent Marking 5002 us > [37.087s][info][gc,stats] SATB Flush Rendezvous 1748 us > [37.087s][info][gc,stats] Pause Final Mark (G) 57272 us > [37.087s][info][gc,stats] Pause Final Mark (N) 56985 us > [37.087s][info][gc,stats] Finish Mark 387 us > [37.087s][info][gc,stats] Update Region States 109 us > [37.087s][info][gc,stats] Choose Collection Set 56395 us > [37.087s][info][gc,stats] Rebuild Free Set 40 us > > > on app termination > > > [40.640s][info][gc,stats] Concurrent Reset = 0.914 s (a = 65255 us) (n = 14) (lvls, us = 54883, 55859, 63867, 65234, 97096) > [40.640s][info][gc,stats] Pause Init Mark (G) = 1.755 s (a = 125380 us) (n = 14) (lvls, us = 119141, 123047, 125000, 125000, 128042) > [40.640s][info][gc,stats] Pause Init Mark (N) = 1.697 s (a = 121241 us) (n = 14) (lvls, us = 117188, 119141, 121094, 121094, 123880) > ... Neethu Prasad has updated the pull request incrementally with one additional commit since the last revision: Address feedback on code style ------------- Changes: - all: https://git.openjdk.org/jdk/pull/20318/files - new: https://git.openjdk.org/jdk/pull/20318/files/a7c0514a..9649c2ca Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=20318&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=20318&range=02-03 Stats: 5 lines in 3 files changed: 1 ins; 2 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/20318.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20318/head:pull/20318 PR: https://git.openjdk.org/jdk/pull/20318 From duke at openjdk.org Wed Aug 7 13:42:06 2024 From: duke at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Wed, 7 Aug 2024 13:42:06 GMT Subject: RFR: 8310675: Fix -Wconversion warnings in ZGC code [v2] In-Reply-To: <_eEPqKVsKunCsC5ogqfaHPfgndjqBoJKV1iOsZAYxio=.723085df-731a-4c98-b74a-91575860e1ec@github.com> References: <_eEPqKVsKunCsC5ogqfaHPfgndjqBoJKV1iOsZAYxio=.723085df-731a-4c98-b74a-91575860e1ec@github.com> Message-ID: > Fixed `-Wconversion` warnings in ZGC code, either by adding an explicit type cast, changin the type of the variable or calling an equivalent method with other types. The largest change is the addition of `ZStatDurationSample`, which typecasts `Tickspan::value()` to a `uint64_t` and calls `ZStatSample` to make the code more readable. > > I isolated the `-Wconversion` warnings for ZGC by adding the flag to clangd and displaying the errors in my IDE and going through each file directly associated with ZGC one by one. > > Tested with tiers 1-3. Joel Sikstr?m has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains two additional commits since the last revision: - Merge branch 'master' into JDK-8310675 - 8310675: Fixed -Wconversion warnings in ZGC ------------- Changes: - all: https://git.openjdk.org/jdk/pull/20406/files - new: https://git.openjdk.org/jdk/pull/20406/files/5c3206cd..2b28a82f Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=20406&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=20406&range=00-01 Stats: 13259 lines in 509 files changed: 6983 ins; 4162 del; 2114 mod Patch: https://git.openjdk.org/jdk/pull/20406.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20406/head:pull/20406 PR: https://git.openjdk.org/jdk/pull/20406 From stefank at openjdk.org Wed Aug 7 13:42:06 2024 From: stefank at openjdk.org (Stefan Karlsson) Date: Wed, 7 Aug 2024 13:42:06 GMT Subject: RFR: 8310675: Fix -Wconversion warnings in ZGC code [v2] In-Reply-To: References: <_eEPqKVsKunCsC5ogqfaHPfgndjqBoJKV1iOsZAYxio=.723085df-731a-4c98-b74a-91575860e1ec@github.com> Message-ID: On Wed, 7 Aug 2024 13:39:15 GMT, Joel Sikstr?m wrote: >> Fixed `-Wconversion` warnings in ZGC code, either by adding an explicit type cast, changin the type of the variable or calling an equivalent method with other types. The largest change is the addition of `ZStatDurationSample`, which typecasts `Tickspan::value()` to a `uint64_t` and calls `ZStatSample` to make the code more readable. >> >> I isolated the `-Wconversion` warnings for ZGC by adding the flag to clangd and displaying the errors in my IDE and going through each file directly associated with ZGC one by one. >> >> Tested with tiers 1-3. > > Joel Sikstr?m has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains two additional commits since the last revision: > > - Merge branch 'master' into JDK-8310675 > - 8310675: Fixed -Wconversion warnings in ZGC Marked as reviewed by stefank (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/20406#pullrequestreview-2225135283 From duke at openjdk.org Wed Aug 7 13:59:36 2024 From: duke at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Wed, 7 Aug 2024 13:59:36 GMT Subject: RFR: 8310675: Fix -Wconversion warnings in ZGC code [v2] In-Reply-To: References: <_eEPqKVsKunCsC5ogqfaHPfgndjqBoJKV1iOsZAYxio=.723085df-731a-4c98-b74a-91575860e1ec@github.com> Message-ID: On Tue, 6 Aug 2024 14:26:17 GMT, Albert Mingkun Yang wrote: >> Joel Sikstr?m has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains two additional commits since the last revision: >> >> - Merge branch 'master' into JDK-8310675 >> - 8310675: Fixed -Wconversion warnings in ZGC > > Marked as reviewed by ayang (Reviewer). Thank you for reviews! @albertnetymk @stefank ------------- PR Comment: https://git.openjdk.org/jdk/pull/20406#issuecomment-2273541083 From duke at openjdk.org Wed Aug 7 13:59:37 2024 From: duke at openjdk.org (duke) Date: Wed, 7 Aug 2024 13:59:37 GMT Subject: RFR: 8310675: Fix -Wconversion warnings in ZGC code [v2] In-Reply-To: References: <_eEPqKVsKunCsC5ogqfaHPfgndjqBoJKV1iOsZAYxio=.723085df-731a-4c98-b74a-91575860e1ec@github.com> Message-ID: On Wed, 7 Aug 2024 13:42:06 GMT, Joel Sikstr?m wrote: >> Fixed `-Wconversion` warnings in ZGC code, either by adding an explicit type cast, changin the type of the variable or calling an equivalent method with other types. The largest change is the addition of `ZStatDurationSample`, which typecasts `Tickspan::value()` to a `uint64_t` and calls `ZStatSample` to make the code more readable. >> >> I isolated the `-Wconversion` warnings for ZGC by adding the flag to clangd and displaying the errors in my IDE and going through each file directly associated with ZGC one by one. >> >> Tested with tiers 1-3. > > Joel Sikstr?m has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains two additional commits since the last revision: > > - Merge branch 'master' into JDK-8310675 > - 8310675: Fixed -Wconversion warnings in ZGC @jsikstro Your change (at version 2b28a82f20ce24d33de4fbe90455aa2ed05249e0) is now ready to be sponsored by a Committer. ------------- PR Comment: https://git.openjdk.org/jdk/pull/20406#issuecomment-2273543069 From duke at openjdk.org Wed Aug 7 14:05:58 2024 From: duke at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Wed, 7 Aug 2024 14:05:58 GMT Subject: RFR: 8337939: ZGC: Make assertions and checks less convoluted and explicit Message-ID: There are currently cases where calls to type converters are made only to assert whether the conversion is reasonable or not and then discarding the result. For example, to_zaddress(...) is used to check if the pointer passed to it is a valid zaddress or not, whilst discarding the result of the conversion. Additionally, a call like oopDesc::is_oop(to_oop(o)) is convoluted since a similar check to is_oop() is already done inside to_oop(), which should be a separate operation in its entirety. Asserts/checks in affected places should be separated so that assertion/checking can be explicitly made and not done more than necessary. Tested with tiers 1-7 on linux64 and linux64-debug. ------------- Commit messages: - Update zPage.inline.hpp - 8337939: ZGC: Make assertions and checks less convoluted and explicit Changes: https://git.openjdk.org/jdk/pull/20478/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=20478&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8337939 Stats: 57 lines in 10 files changed: 32 ins; 8 del; 17 mod Patch: https://git.openjdk.org/jdk/pull/20478.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20478/head:pull/20478 PR: https://git.openjdk.org/jdk/pull/20478 From stefank at openjdk.org Wed Aug 7 14:17:32 2024 From: stefank at openjdk.org (Stefan Karlsson) Date: Wed, 7 Aug 2024 14:17:32 GMT Subject: RFR: 8337939: ZGC: Make assertions and checks less convoluted and explicit In-Reply-To: References: Message-ID: <_661fPm-naPJKMvyhdmi3r2SktldtzcS7ooXTRkhDwg=.de998533-26c7-4af2-ac6b-363632ad3378@github.com> On Tue, 6 Aug 2024 15:15:57 GMT, Joel Sikstr?m wrote: > There are currently cases where calls to type converters are made only to assert whether the conversion is reasonable or not and then discarding the result. For example, to_zaddress(...) is used to check if the pointer passed to it is a valid zaddress or not, whilst discarding the result of the conversion. > > Additionally, a call like oopDesc::is_oop(to_oop(o)) is convoluted since a similar check to is_oop() is already done inside to_oop(), which should be a separate operation in its entirety. > > Asserts/checks in affected places should be separated so that assertion/checking can be explicitly made and not done more than necessary. > > Tested with tiers 1-7 on linux64 and linux64-debug. Changes requested by stefank (Reviewer). src/hotspot/share/gc/z/zVerify.cpp line 122: > 120: const oop obj = cast_to_oop(o); > 121: guarantee(oopDesc::is_oop(obj), BAD_OOP_ARG(o, p)); > 122: } I pre-reviewed this part, but I realize now that I'd like to update the parameter name for the zaddress. Would you mind updating the code this? Suggestion: static void z_verify_root_oop_object(zaddress addr, void* p) { const oop obj = cast_to_oop(addr); guarantee(oopDesc::is_oop(obj), BAD_OOP_ARG(addr, p)); } ------------- PR Review: https://git.openjdk.org/jdk/pull/20478#pullrequestreview-2225333707 PR Review Comment: https://git.openjdk.org/jdk/pull/20478#discussion_r1707089369 From duke at openjdk.org Wed Aug 7 14:18:36 2024 From: duke at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Wed, 7 Aug 2024 14:18:36 GMT Subject: Integrated: 8310675: Fix -Wconversion warnings in ZGC code In-Reply-To: <_eEPqKVsKunCsC5ogqfaHPfgndjqBoJKV1iOsZAYxio=.723085df-731a-4c98-b74a-91575860e1ec@github.com> References: <_eEPqKVsKunCsC5ogqfaHPfgndjqBoJKV1iOsZAYxio=.723085df-731a-4c98-b74a-91575860e1ec@github.com> Message-ID: On Wed, 31 Jul 2024 13:01:50 GMT, Joel Sikstr?m wrote: > Fixed `-Wconversion` warnings in ZGC code, either by adding an explicit type cast, changin the type of the variable or calling an equivalent method with other types. The largest change is the addition of `ZStatDurationSample`, which typecasts `Tickspan::value()` to a `uint64_t` and calls `ZStatSample` to make the code more readable. > > I isolated the `-Wconversion` warnings for ZGC by adding the flag to clangd and displaying the errors in my IDE and going through each file directly associated with ZGC one by one. > > Tested with tiers 1-3. This pull request has now been integrated. Changeset: 21f710e7 Author: Joel Sikstr?m Committer: Albert Mingkun Yang URL: https://git.openjdk.org/jdk/commit/21f710e7f6698b12b06cc3685cefa31f5fcff2a2 Stats: 120 lines in 33 files changed: 5 ins; 0 del; 115 mod 8310675: Fix -Wconversion warnings in ZGC code Reviewed-by: stefank, ayang ------------- PR: https://git.openjdk.org/jdk/pull/20406 From shade at openjdk.org Wed Aug 7 14:28:32 2024 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 7 Aug 2024 14:28:32 GMT Subject: RFR: 8336742: Shenandoah: Add more verbose logging/stats for mark termination attempts [v4] In-Reply-To: References:

Message-ID: On Wed, 7 Aug 2024 13:19:12 GMT, Neethu Prasad wrote: >> **Revision 2 Notes** >> 1. Added time spent on handshaking all threads requesting them to flush their SATB buffers as part of GC stats. >> 2. As mentioned in PR feedback, will raise separate PR to adding logging in ShenandoahTimingsTracker. >> >> **Revision 1 Notes** >> This PR adds the following >> 1. info logging on number of SATB flush attempts >> 3. total time spend on handshaking all threads requesting them to flush their SATB buffers. >> >> As suggested by William in [JDK-8336742 ](https://bugs.openjdk.org/browse/JDK-83367420), we can use handshake logging to get time spend and other stats for each handshake. >> >> [4.515s][info][handshake ] Handshake "Shenandoah Flush SATB Handshake", Targeted threads: 1036, Executed by requesting thread: 1035, Total completion time: 597004 ns >> [4.517s][info][handshake ] Handshake "Shenandoah Flush SATB Handshake", Targeted threads: 1036, Executed by requesting thread: 1033, Total completion time: 207402 ns >> >> >> **Testing** >> 1. tier1, tier2 and hotspot_gc_shenandoah tests. >> 2. **-Xlog:gc+stats=info** >> >> >> [37.087s][info][gc,stats] CMR: VM Strong Roots 413 us, workers (us): 64, 57, 52, 47, 38, 31, 30, 25, 20, 21, 17, 10, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, >> [37.087s][info][gc,stats] CMR: CLDG Roots 449 us, workers (us): 4, ---, ---, 406, ---, 15, ---, 4, 4, ---, ---, 17, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, ---, >> [37.087s][info][gc,stats] Concurrent Marking 5002 us >> [37.087s][info][gc,stats] SATB Flush Rendezvous 1748 us >> [37.087s][info][gc,stats] Pause Final Mark (G) 57272 us >> [37.087s][info][gc,stats] Pause Final Mark (N) 56985 us >> [37.087s][info][gc,stats] Finish Mark 387 us >> [37.087s][info][gc,stats] Update Region States 109 us >> [37.087s][info][gc,stats] Choose Collection Set 56395 us >> [37.087s][info][gc,stats] Rebuild Free Set 40 us >> >> >> on app termination >> >> >> [40.640s][info][gc,stats] Concurrent Reset = 0.914 s (a = 65255 us) (n = 14) (lvls, us = 54883, 55859, 63867, 65234, 97096) >> [40.640s][info][gc,stats] Pause Init Mark (G) = 1.755 s (a = 125380 us) (n = 14) (lvls, us = 119141, 123047, 125000, 125000, 128042) >> [40.640s][info][gc,stats] Pause Init Mark (N) = 1.697 s (a = 121241 us... > > Neethu Prasad has updated the pull request incrementally with one additional commit since the last revision: > > Address feedback on code style Marked as reviewed by shade (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/20318#pullrequestreview-2225370812 From shade at openjdk.org Wed Aug 7 14:57:35 2024 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 7 Aug 2024 14:57:35 GMT Subject: RFR: 8337981: ShenandoahHeap::is_in should check for alive regions In-Reply-To: References: Message-ID: On Wed, 7 Aug 2024 11:51:25 GMT, Aleksey Shipilev wrote: > The expected behavior of `CollectedHeap::is_in` is to check whether the object belongs to the committed parts of the heap. This is useful to check if object resides in the parts of the heap the GC knows are not dead. Yet, Shenandoah's check just verifies that oop is within the heap bounds. So `is_in` check for an object that is in trashed/empty region would pass by accident, and we will miss detecting bugs. This should be rectified. > > I also re-wired assertions/verification code to be clear whether we check for heap bounds or actual in-heap conditions. > > Additional testing: > - [ ] Linux AArch64 server fastdebug, `all` with `-XX:+UseShenandoahGC` Test failures, there are verifier paths that touch dead Reference.referent, apparently. Figuring it out. ------------- PR Comment: https://git.openjdk.org/jdk/pull/20492#issuecomment-2273672661 From shade at openjdk.org Wed Aug 7 17:07:35 2024 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 7 Aug 2024 17:07:35 GMT Subject: RFR: 8335865: Shenandoah: Improve THP pretouch after JDK-8315923 In-Reply-To: References: Message-ID: <1HLgUoF7ByaXQkgUB3UYK35VzxayzTXZl562fDBWKZ8=.3641cb43-c0f4-44c1-bbce-af168d02ead2@github.com> On Fri, 19 Jul 2024 14:28:24 GMT, Neethu Prasad wrote: > **Notes** > os::pretouch is now using madvice now when available and has a fall back to using vm page size [JDK-8315923](https://bugs.openjdk.org/browse/JDK-8315923) > Hence removing code that sets _pretouch_heap_page_size & _pretouch_bitmap_page_size in Shenandoah. > > **Testing** > > * Ran test in Linux 5.10 and Linux 6.x and confirmed that there is no regression. I could not replicate the issue or performance improvement though. [add results] > * Ran [TestTransparentHugePageUsage](https://github.com/openjdk/jdk/commit/a65a89522d2f24b1767e1c74f6689a22ea32ca6a) for Shenandoah and verified that test passed > * Ran tier 1, tier 2 , tier1_gc_shenandoah, tier2_gc_shenandoah, tier3_gc_shenandoah and hotspot_gc_shenandoah. I am approving, since the "problem" appears to be a kernel version between 5.8 and 5.14. So THP is broken there, and MADV_POPULATE_WRITE is still not available. Reading the JDK-8315923 code, it essentially does what this code was doing, so we do not actually regress anything. I think we only need to confirm using the one-liner I had above that >=5.14 really works, and <5.8 does not regress the speed with which we wire up `AnonHugePages`. src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 287: > 285: // Reserve aux bitmap for use in object_iterate(). We don't commit it here. > 286: size_t aux_bitmap_page_size = bitmap_page_size; > 287: I think this newline is unnecessary. ------------- Marked as reviewed by shade (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/20254#pullrequestreview-2225728082 PR Review Comment: https://git.openjdk.org/jdk/pull/20254#discussion_r1707457414 From shade at openjdk.org Wed Aug 7 18:48:36 2024 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 7 Aug 2024 18:48:36 GMT Subject: RFR: 8335865: Shenandoah: Improve THP pretouch after JDK-8315923 In-Reply-To: <1HLgUoF7ByaXQkgUB3UYK35VzxayzTXZl562fDBWKZ8=.3641cb43-c0f4-44c1-bbce-af168d02ead2@github.com> References: <1HLgUoF7ByaXQkgUB3UYK35VzxayzTXZl562fDBWKZ8=.3641cb43-c0f4-44c1-bbce-af168d02ead2@github.com> Message-ID: <-pE70sSWPv6wUCDfLwEp1f8ZSbV1N-Gn3lOlcINCxww=.9fe8a8e4-b69b-45cc-a891-7565a6ff8572@github.com> On Wed, 7 Aug 2024 17:04:38 GMT, Aleksey Shipilev wrote: > I think we only need to confirm using the one-liner I had above that >=5.14 really works I confirmed Shenandoah THP+Pretouch works well on my desktop with 5.15, either by default or with `-XX:-UseMadvPopulateWrite`. ------------- PR Comment: https://git.openjdk.org/jdk/pull/20254#issuecomment-2274119099 From shade at openjdk.org Wed Aug 7 18:50:47 2024 From: shade at openjdk.org (Aleksey Shipilev) Date: Wed, 7 Aug 2024 18:50:47 GMT Subject: RFR: 8337981: ShenandoahHeap::is_in should check for alive regions [v2] In-Reply-To: References: Message-ID: > The expected behavior of `CollectedHeap::is_in` is to check whether the object belongs to the committed parts of the heap: > https://github.com/openjdk/jdk/blob/d19ba81ce12a99de1114c1bfe67392f5aee2104e/src/hotspot/share/gc/shared/collectedHeap.hpp#L273-L276 > > This is useful to check if object resides in the parts of the heap the GC knows are not dead. Yet, Shenandoah's check just verifies that oop is within the heap bounds. So `is_in` check for an object that is in trashed/empty region would pass by accident, and we will miss detecting bugs. This should be rectified. I believe "committed" is too weak for the test as well, since we really want to know if we can touch the object, i.e. if it is in active region. > > I re-wired assertions/verification code to be clear whether we check for heap bounds or actual in-heap conditions. > > Deeper testing revealed that reference processing code potentially loads a dead referent, but only to null-check it, or ask bitmap about it. Still, more precise `in_heap` check fails asserts in `CompressedOops::decode`. That required a bit of touchup as well. > > Additional testing: > - [x] Linux AArch64 server fastdebug, `all` with `-XX:+UseShenandoahGC` Aleksey Shipilev has updated the pull request incrementally with three additional commits since the last revision: - Style touchups - Fixing ShenandoahReferenceProcessor - Verifier fix ------------- Changes: - all: https://git.openjdk.org/jdk/pull/20492/files - new: https://git.openjdk.org/jdk/pull/20492/files/dbab6d43..69c66853 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=20492&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=20492&range=00-01 Stats: 35 lines in 2 files changed: 22 ins; 7 del; 6 mod Patch: https://git.openjdk.org/jdk/pull/20492.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20492/head:pull/20492 PR: https://git.openjdk.org/jdk/pull/20492 From duke at openjdk.org Wed Aug 7 20:10:03 2024 From: duke at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Wed, 7 Aug 2024 20:10:03 GMT Subject: RFR: 8337939: ZGC: Make assertions and checks less convoluted and explicit [v2] In-Reply-To: References: Message-ID: > There are currently cases where calls to type converters are made only to assert whether the conversion is reasonable or not and then discarding the result. For example, to_zaddress(...) is used to check if the pointer passed to it is a valid zaddress or not, whilst discarding the result of the conversion. > > Additionally, a call like oopDesc::is_oop(to_oop(o)) is convoluted since a similar check to is_oop() is already done inside to_oop(), which should be a separate operation in its entirety. > > Asserts/checks in affected places should be separated so that assertion/checking can be explicitly made and not done more than necessary. > > Tested with tiers 1-7 on linux64 and linux64-debug. Joel Sikstr?m has updated the pull request incrementally with one additional commit since the last revision: Fix zaddress parameter name ------------- Changes: - all: https://git.openjdk.org/jdk/pull/20478/files - new: https://git.openjdk.org/jdk/pull/20478/files/70f13835..42044a86 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=20478&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=20478&range=00-01 Stats: 3 lines in 1 file changed: 0 ins; 0 del; 3 mod Patch: https://git.openjdk.org/jdk/pull/20478.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20478/head:pull/20478 PR: https://git.openjdk.org/jdk/pull/20478 From stefank at openjdk.org Thu Aug 8 07:17:32 2024 From: stefank at openjdk.org (Stefan Karlsson) Date: Thu, 8 Aug 2024 07:17:32 GMT Subject: RFR: 8337939: ZGC: Make assertions and checks less convoluted and explicit [v2] In-Reply-To: References:

Message-ID: On Mon, 5 Aug 2024 14:42:45 GMT, Ivan Walulya wrote: >> Hi all, >> >> Please review this change to assign a single G1CardSet to all young regions. As young regions are collected at the same, and we do not have young-to-young remembered sets, we can maintain a single G1CardSet for all young regions. >> >> This reduces the memory overhead of the G1CardSets and the time taken to merge per region G1CardSets during GC pause. >> >> Testing: Tier 1-5 > > Ivan Walulya has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains seven commits: > > - Albert Review > - Merge remote-tracking branch 'upstream/master' into YoungOnlyCardSet > - Merge remote-tracking branch 'upstream/master' into YoungOnlyCardSet > - cleanup > - merge > - Merge remote-tracking branch 'upstream/master' into YoungOnlyCardSet > - init Marked as reviewed by ayang (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/20134#pullrequestreview-2227206638 From ayang at openjdk.org Thu Aug 8 08:41:08 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Thu, 8 Aug 2024 08:41:08 GMT Subject: RFR: 8338036: Serial: Remove Generation::update_counters Message-ID: Trivial removing redundant code. ------------- Commit messages: - s1-perf-counter Changes: https://git.openjdk.org/jdk/pull/20509/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=20509&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8338036 Stats: 2 lines in 1 file changed: 0 ins; 1 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/20509.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20509/head:pull/20509 PR: https://git.openjdk.org/jdk/pull/20509 From eosterlund at openjdk.org Thu Aug 8 13:29:34 2024 From: eosterlund at openjdk.org (Erik =?UTF-8?B?w5ZzdGVybHVuZA==?=) Date: Thu, 8 Aug 2024 13:29:34 GMT Subject: RFR: 8337939: ZGC: Make assertions and checks less convoluted and explicit [v2] In-Reply-To: References:

Message-ID: On Wed, 7 Aug 2024 20:10:03 GMT, Joel Sikstr?m wrote: >> There are currently cases where calls to type converters are made only to assert whether the conversion is reasonable or not and then discarding the result. For example, to_zaddress(...) is used to check if the pointer passed to it is a valid zaddress or not, whilst discarding the result of the conversion. >> >> Additionally, a call like oopDesc::is_oop(to_oop(o)) is convoluted since a similar check to is_oop() is already done inside to_oop(), which should be a separate operation in its entirety. >> >> Asserts/checks in affected places should be separated so that assertion/checking can be explicitly made and not done more than necessary. >> >> Tested with tiers 1-7 on linux64 and linux64-debug. > > Joel Sikstr?m has updated the pull request incrementally with one additional commit since the last revision: > > Fix zaddress parameter name Looks good. ------------- Marked as reviewed by eosterlund (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/20478#pullrequestreview-2227880615 From rcastanedalo at openjdk.org Thu Aug 8 14:17:24 2024 From: rcastanedalo at openjdk.org (Roberto =?UTF-8?B?Q2FzdGHDsWVkYQ==?= Lozano) Date: Thu, 8 Aug 2024 14:17:24 GMT Subject: RFR: 8334060: Implementation of Late Barrier Expansion for G1 [v3] In-Reply-To: References: Message-ID: > This changeset implements JEP 475 (Late Barrier Expansion for G1), including support for the x64 and aarch64 platforms. See the [JEP description](https://openjdk.org/jeps/475) for further detail. > > We aim to integrate this work in JDK 24. The purpose of this pull request is double-fold: > > - to allow maintainers of the arm (32-bit), ppc, riscv, s390, and x86 (32-bit) ports to contribute a port of these platforms in time for JDK 24; and > - to allow reviewers to review the platform-independent, x64 and aarch64, and test changes in parallel with the porting work. > > ## Summary of the Changes > > ### Platform-Independent Changes (`src/hotspot/share`) > > These consist mainly of: > > - a complete rewrite of `G1BarrierSetC2`, to instruct C2 to expand G1 barriers late instead of early; > - a few minor changes to C2 itself, to support removal of redundant decompression operations and to address an OopMap construction issue triggered by this JEP's increased usage of ADL `TEMP` operands; and > - temporary support for porting the JEP to the remaining platforms. > > The temporary support code (guarded by the pre-processor flag `G1_LATE_BARRIER_MIGRATION_SUPPORT`) will **not** be part of the final pull request, and hence does not need to be reviewed. > > ### Platform-Dependent Changes (`src/hotspot/cpu`) > > These include changes to the ADL instruction definitions and the `G1BarrierSetAssembler` class of the x64 and aarch64 platforms. > > #### ADL Changes > > The changeset uses ADL predicates to force C2 to implement memory accesses tagged with barrier information using G1-specific, barrier-aware instruction versions (e.g. `g1StoreP` instead of the GC-agnostic `storeP`). These new instruction versions generate machine code accordingly to the corresponding tagged barrier information, relying on the G1 barrier implementations provided by the `G1BarrierSetAssembler` class. In the aarch64 platform, the bulk of the ADL code is generated from a higher-level version using m4, to reduce redundancy. > > #### `G1BarrierSetAssembler` Changes > > Both platforms basically reuse the barrier implementation for the bytecode interpreter, with the different barrier tests and operations refactored into dedicated functions. Besides this, `G1BarrierSetAssembler` is extended with assembly-stub routines that implement the out-of-line, slow path of the barriers. These routines include calls from the barrier into the JVM, which require support for saving and restoring live registers, provided by the `SaveLiveRegisters` class. This c... Roberto Casta?eda Lozano has updated the pull request incrementally with one additional commit since the last revision: Flatten barrier assembly generation code by removing helpers individual barrier tests and operations ------------- Changes: - all: https://git.openjdk.org/jdk/pull/19746/files - new: https://git.openjdk.org/jdk/pull/19746/files/d722d4c7..20ef68c8 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=19746&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=19746&range=01-02 Stats: 263 lines in 2 files changed: 77 ins; 116 del; 70 mod Patch: https://git.openjdk.org/jdk/pull/19746.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/19746/head:pull/19746 PR: https://git.openjdk.org/jdk/pull/19746 From rcastanedalo at openjdk.org Thu Aug 8 14:23:36 2024 From: rcastanedalo at openjdk.org (Roberto =?UTF-8?B?Q2FzdGHDsWVkYQ==?= Lozano) Date: Thu, 8 Aug 2024 14:23:36 GMT Subject: RFR: 8334060: Implementation of Late Barrier Expansion for G1 [v3] In-Reply-To: References:

Message-ID: On Wed, 19 Jun 2024 08:45:45 GMT, Albert Mingkun Yang wrote: >> Note that if we want to optimize the barrier code layout (see the [JEP description](https://openjdk.org/jeps/475), *Candidate optimizations* sub-section), splitting the assembly of each barrier in at least two blocks is necessary, since we need to separate the inline from the out-of-line (barrier stub) code. And since the assembly code has to be split into multiple functions anyway, I think it makes sense to group the code by logical blocks (different barrier tests, queue insertion, etc.), as proposed in this changeset. This also improves code reuse, e.g. the same `generate_queue_insertion` implementation is used for the pre- and post-barriers. >> If you still think there is value in grouping together the blocks that can be grouped together (e.g. `generate_single_region_test` + `generate_new_val_null_test` + `generate_card_young_test`), I can prototype the refactoring and let the G1 maintainers decide which alternative is more readable/maintainable. > >> This also improves code reuse > > In this area, I think code duplication is less of an issue -- it's more crucial that one can follow the asm flow as if reading real asm. (Ofc, this is subjective; feel free to keep as is.) I'm back from vacation now and resuming my work in this JEP. After some offline discussions, I have pushed a new version (commit 20ef68c81e) without helper functions, except for `generate_queue_insertion()` which is still included. @albertnetymk please have a look and let me know if you find the new style more readable. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/19746#discussion_r1709618766 From duke at openjdk.org Thu Aug 8 14:33:09 2024 From: duke at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Thu, 8 Aug 2024 14:33:09 GMT Subject: RFR: 8337939: ZGC: Make assertions and checks less convoluted and explicit [v3] In-Reply-To: References: Message-ID: > There are currently cases where calls to type converters are made only to assert whether the conversion is reasonable or not and then discarding the result. For example, to_zaddress(...) is used to check if the pointer passed to it is a valid zaddress or not, whilst discarding the result of the conversion. > > Additionally, a call like oopDesc::is_oop(to_oop(o)) is convoluted since a similar check to is_oop() is already done inside to_oop(), which should be a separate operation in its entirety. > > Asserts/checks in affected places should be separated so that assertion/checking can be explicitly made and not done more than necessary. > > Tested with tiers 1-7 on linux64 and linux64-debug. Joel Sikstr?m has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains five additional commits since the last revision: - Merge branch 'master' into zgc_assert_check_cleanup - Update copyright years - Fix zaddress parameter name - Update zPage.inline.hpp - 8337939: ZGC: Make assertions and checks less convoluted and explicit ------------- Changes: - all: https://git.openjdk.org/jdk/pull/20478/files - new: https://git.openjdk.org/jdk/pull/20478/files/42044a86..426c4be6 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=20478&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=20478&range=01-02 Stats: 7463 lines in 163 files changed: 2134 ins; 4812 del; 517 mod Patch: https://git.openjdk.org/jdk/pull/20478.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20478/head:pull/20478 PR: https://git.openjdk.org/jdk/pull/20478 From stefank at openjdk.org Thu Aug 8 15:12:34 2024 From: stefank at openjdk.org (Stefan Karlsson) Date: Thu, 8 Aug 2024 15:12:34 GMT Subject: RFR: 8337939: ZGC: Make assertions and checks less convoluted and explicit [v3] In-Reply-To: References:

Message-ID: On Thu, 8 Aug 2024 14:33:09 GMT, Joel Sikstr?m wrote: >> There are currently cases where calls to type converters are made only to assert whether the conversion is reasonable or not and then discarding the result. For example, to_zaddress(...) is used to check if the pointer passed to it is a valid zaddress or not, whilst discarding the result of the conversion. >> >> Additionally, a call like oopDesc::is_oop(to_oop(o)) is convoluted since a similar check to is_oop() is already done inside to_oop(), which should be a separate operation in its entirety. >> >> Asserts/checks in affected places should be separated so that assertion/checking can be explicitly made and not done more than necessary. >> >> Tested with tiers 1-7 on linux64 and linux64-debug. > > Joel Sikstr?m has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains five additional commits since the last revision: > > - Merge branch 'master' into zgc_assert_check_cleanup > - Update copyright years > - Fix zaddress parameter name > - Update zPage.inline.hpp > - 8337939: ZGC: Make assertions and checks less convoluted and explicit Marked as reviewed by stefank (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/20478#pullrequestreview-2228179678 From rcastanedalo at openjdk.org Thu Aug 8 15:37:19 2024 From: rcastanedalo at openjdk.org (Roberto =?UTF-8?B?Q2FzdGHDsWVkYQ==?= Lozano) Date: Thu, 8 Aug 2024 15:37:19 GMT Subject: RFR: 8334060: Implementation of Late Barrier Expansion for G1 [v4] In-Reply-To: References: Message-ID: > This changeset implements JEP 475 (Late Barrier Expansion for G1), including support for the x64 and aarch64 platforms. See the [JEP description](https://openjdk.org/jeps/475) for further detail. > > We aim to integrate this work in JDK 24. The purpose of this pull request is double-fold: > > - to allow maintainers of the arm (32-bit), ppc, riscv, s390, and x86 (32-bit) ports to contribute a port of these platforms in time for JDK 24; and > - to allow reviewers to review the platform-independent, x64 and aarch64, and test changes in parallel with the porting work. > > ## Summary of the Changes > > ### Platform-Independent Changes (`src/hotspot/share`) > > These consist mainly of: > > - a complete rewrite of `G1BarrierSetC2`, to instruct C2 to expand G1 barriers late instead of early; > - a few minor changes to C2 itself, to support removal of redundant decompression operations and to address an OopMap construction issue triggered by this JEP's increased usage of ADL `TEMP` operands; and > - temporary support for porting the JEP to the remaining platforms. > > The temporary support code (guarded by the pre-processor flag `G1_LATE_BARRIER_MIGRATION_SUPPORT`) will **not** be part of the final pull request, and hence does not need to be reviewed. > > ### Platform-Dependent Changes (`src/hotspot/cpu`) > > These include changes to the ADL instruction definitions and the `G1BarrierSetAssembler` class of the x64 and aarch64 platforms. > > #### ADL Changes > > The changeset uses ADL predicates to force C2 to implement memory accesses tagged with barrier information using G1-specific, barrier-aware instruction versions (e.g. `g1StoreP` instead of the GC-agnostic `storeP`). These new instruction versions generate machine code accordingly to the corresponding tagged barrier information, relying on the G1 barrier implementations provided by the `G1BarrierSetAssembler` class. In the aarch64 platform, the bulk of the ADL code is generated from a higher-level version using m4, to reduce redundancy. > > #### `G1BarrierSetAssembler` Changes > > Both platforms basically reuse the barrier implementation for the bytecode interpreter, with the different barrier tests and operations refactored into dedicated functions. Besides this, `G1BarrierSetAssembler` is extended with assembly-stub routines that implement the out-of-line, slow path of the barriers. These routines include calls from the barrier into the JVM, which require support for saving and restoring live registers, provided by the `SaveLiveRegisters` class. This c... Roberto Casta?eda Lozano has updated the pull request incrementally with one additional commit since the last revision: Also include HOTSPOT_TARGET_CPU_ARCH-based G1 ADL source file ------------- Changes: - all: https://git.openjdk.org/jdk/pull/19746/files - new: https://git.openjdk.org/jdk/pull/19746/files/20ef68c8..47079ea1 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=19746&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=19746&range=02-03 Stats: 1 line in 1 file changed: 1 ins; 0 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/19746.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/19746/head:pull/19746 PR: https://git.openjdk.org/jdk/pull/19746 From rcastanedalo at openjdk.org Thu Aug 8 15:37:19 2024 From: rcastanedalo at openjdk.org (Roberto =?UTF-8?B?Q2FzdGHDsWVkYQ==?= Lozano) Date: Thu, 8 Aug 2024 15:37:19 GMT Subject: RFR: 8334060: Implementation of Late Barrier Expansion for G1 [v2] In-Reply-To: References:

Message-ID: On Sat, 29 Jun 2024 03:51:29 GMT, Amit Kumar wrote: >> make/hotspot/gensrc/GensrcAdlc.gmk line 205: >> >>> 203: ifeq ($(call check-jvm-feature, g1gc), true) >>> 204: AD_SRC_FILES += $(call uniq, $(wildcard $(foreach d, $(AD_SRC_ROOTS), \ >>> 205: $d/cpu/$(HOTSPOT_TARGET_CPU_ARCH)/gc/g1/g1_$(HOTSPOT_TARGET_CPU).ad \ >> >> on s390, `g1_s390.ad` file is not compiled with current code. >> >> Suggestion: >> >> $d/cpu/$(HOTSPOT_TARGET_CPU_ARCH)/gc/g1/g1_$(HOTSPOT_TARGET_CPU_ARCH).ad \ > > I guess this one might be better: > > diff --git a/make/hotspot/gensrc/GensrcAdlc.gmk b/make/hotspot/gensrc/GensrcAdlc.gmk > index e34f0725397..ef9c15b2975 100644 > --- a/make/hotspot/gensrc/GensrcAdlc.gmk > +++ b/make/hotspot/gensrc/GensrcAdlc.gmk > @@ -203,6 +203,7 @@ ifeq ($(call check-jvm-feature, compiler2), true) > ifeq ($(call check-jvm-feature, g1gc), true) > AD_SRC_FILES += $(call uniq, $(wildcard $(foreach d, $(AD_SRC_ROOTS), \ > $d/cpu/$(HOTSPOT_TARGET_CPU_ARCH)/gc/g1/g1_$(HOTSPOT_TARGET_CPU).ad \ > + $d/cpu/$(HOTSPOT_TARGET_CPU_ARCH)/gc/g1/g1_$(HOTSPOT_TARGET_CPU_ARCH).ad \ > ))) > endif > > > Build is fine with both changes, (tested on Mac-M1) Thanks! I went with the second option (commit 47079ea1) for consistency with other collectors. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/19746#discussion_r1709781421 From shade at openjdk.org Thu Aug 8 16:23:39 2024 From: shade at openjdk.org (Aleksey Shipilev) Date: Thu, 8 Aug 2024 16:23:39 GMT Subject: RFR: 8336742: Shenandoah: Add more verbose logging/stats for mark termination attempts [v4] In-Reply-To: References:

Message-ID: On Thu, 8 Aug 2024 15:37:19 GMT, Roberto Casta?eda Lozano wrote: >> This changeset implements JEP 475 (Late Barrier Expansion for G1), including support for the x64 and aarch64 platforms. See the [JEP description](https://openjdk.org/jeps/475) for further detail. >> >> We aim to integrate this work in JDK 24. The purpose of this pull request is double-fold: >> >> - to allow maintainers of the arm (32-bit), ppc, riscv, s390, and x86 (32-bit) ports to contribute a port of these platforms in time for JDK 24; and >> - to allow reviewers to review the platform-independent, x64 and aarch64, and test changes in parallel with the porting work. >> >> ## Summary of the Changes >> >> ### Platform-Independent Changes (`src/hotspot/share`) >> >> These consist mainly of: >> >> - a complete rewrite of `G1BarrierSetC2`, to instruct C2 to expand G1 barriers late instead of early; >> - a few minor changes to C2 itself, to support removal of redundant decompression operations and to address an OopMap construction issue triggered by this JEP's increased usage of ADL `TEMP` operands; and >> - temporary support for porting the JEP to the remaining platforms. >> >> The temporary support code (guarded by the pre-processor flag `G1_LATE_BARRIER_MIGRATION_SUPPORT`) will **not** be part of the final pull request, and hence does not need to be reviewed. >> >> ### Platform-Dependent Changes (`src/hotspot/cpu`) >> >> These include changes to the ADL instruction definitions and the `G1BarrierSetAssembler` class of the x64 and aarch64 platforms. >> >> #### ADL Changes >> >> The changeset uses ADL predicates to force C2 to implement memory accesses tagged with barrier information using G1-specific, barrier-aware instruction versions (e.g. `g1StoreP` instead of the GC-agnostic `storeP`). These new instruction versions generate machine code accordingly to the corresponding tagged barrier information, relying on the G1 barrier implementations provided by the `G1BarrierSetAssembler` class. In the aarch64 platform, the bulk of the ADL code is generated from a higher-level version using m4, to reduce redundancy. >> >> #### `G1BarrierSetAssembler` Changes >> >> Both platforms basically reuse the barrier implementation for the bytecode interpreter, with the different barrier tests and operations refactored into dedicated functions. Besides this, `G1BarrierSetAssembler` is extended with assembly-stub routines that implement the out-of-line, slow path of the barriers. These routines include calls from the barrier into the JVM, which require support for saving and restoring live ... > > Roberto Casta?eda Lozano has updated the pull request incrementally with one additional commit since the last revision: > > Also include HOTSPOT_TARGET_CPU_ARCH-based G1 ADL source file Some naming comments/suggestions, up to you. g1_write_barrier_post_c2 generate_c2_post_barrier_stub The latter is the "next" step if slower path is taken. I wonder if it can be renamed to sth like "...write_barrier_post_c2_stub" to make it obvious that they are related. Both "write_barrier_pre" and "pre_write_barrier" exist. It's not obvious whether that is intended (to highlight some diff) or not. ------------- Marked as reviewed by ayang (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/19746#pullrequestreview-2228393022 From duke at openjdk.org Thu Aug 8 17:44:33 2024 From: duke at openjdk.org (duke) Date: Thu, 8 Aug 2024 17:44:33 GMT Subject: RFR: 8337939: ZGC: Make assertions and checks less convoluted and explicit [v3] In-Reply-To: References:

Message-ID: On Wed, 7 Aug 2024 08:18:50 GMT, Ivan Walulya wrote: > Was there any observable impact on G1 performance suite? No, it looked like just the usual random noise. ------------- PR Comment: https://git.openjdk.org/jdk/pull/20445#issuecomment-2276365582 From kbarrett at openjdk.org Fri Aug 9 07:07:35 2024 From: kbarrett at openjdk.org (Kim Barrett) Date: Fri, 9 Aug 2024 07:07:35 GMT Subject: RFR: 8338036: Serial: Remove Generation::update_counters In-Reply-To: References: Message-ID: <6z4eo6-KCKUKXeZv23ifKvLX1ZQACNFZBBcMxO5BC34=.fb5403be-03ea-4aaf-a3dc-ad40f5666275@github.com> On Thu, 8 Aug 2024 08:35:40 GMT, Albert Mingkun Yang wrote: > Trivial removing redundant code. Looks good, and trivial. ------------- Marked as reviewed by kbarrett (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/20509#pullrequestreview-2229461965 From duke at openjdk.org Fri Aug 9 07:32:41 2024 From: duke at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Fri, 9 Aug 2024 07:32:41 GMT Subject: RFR: 8337939: ZGC: Make assertions and checks less convoluted and explicit [v3] In-Reply-To: References:

Message-ID: On Thu, 8 Aug 2024 15:10:24 GMT, Stefan Karlsson wrote: >> Joel Sikstr?m has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains five additional commits since the last revision: >> >> - Merge branch 'master' into zgc_assert_check_cleanup >> - Update copyright years >> - Fix zaddress parameter name >> - Update zPage.inline.hpp >> - 8337939: ZGC: Make assertions and checks less convoluted and explicit > > Marked as reviewed by stefank (Reviewer). Thank you for the reviews! @stefank, @albertnetymk, @fisk ------------- PR Comment: https://git.openjdk.org/jdk/pull/20478#issuecomment-2276343869 From duke at openjdk.org Fri Aug 9 07:32:41 2024 From: duke at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Fri, 9 Aug 2024 07:32:41 GMT Subject: Integrated: 8337939: ZGC: Make assertions and checks less convoluted and explicit In-Reply-To: References: Message-ID: On Tue, 6 Aug 2024 15:15:57 GMT, Joel Sikstr?m wrote: > There are currently cases where calls to type converters are made only to assert whether the conversion is reasonable or not and then discarding the result. For example, to_zaddress(...) is used to check if the pointer passed to it is a valid zaddress or not, whilst discarding the result of the conversion. > > Additionally, a call like oopDesc::is_oop(to_oop(o)) is convoluted since a similar check to is_oop() is already done inside to_oop(), which should be a separate operation in its entirety. > > Asserts/checks in affected places should be separated so that assertion/checking can be explicitly made and not done more than necessary. > > Tested with tiers 1-7 on linux64 and linux64-debug. This pull request has now been integrated. Changeset: f74109bd Author: Joel Sikstr?m URL: https://git.openjdk.org/jdk/commit/f74109bd178c92a9dff1ca6fce03b25f51a0384f Stats: 63 lines in 10 files changed: 32 ins; 8 del; 23 mod 8337939: ZGC: Make assertions and checks less convoluted and explicit Reviewed-by: stefank, ayang, eosterlund ------------- PR: https://git.openjdk.org/jdk/pull/20478 From ayang at openjdk.org Fri Aug 9 08:28:42 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Fri, 9 Aug 2024 08:28:42 GMT Subject: RFR: 8338036: Serial: Remove Generation::update_counters In-Reply-To: References: Message-ID: On Thu, 8 Aug 2024 08:35:40 GMT, Albert Mingkun Yang wrote: > Trivial removing redundant code. Thanks for review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/20509#issuecomment-2277422911 From ayang at openjdk.org Fri Aug 9 08:28:42 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Fri, 9 Aug 2024 08:28:42 GMT Subject: Integrated: 8338036: Serial: Remove Generation::update_counters In-Reply-To: References: Message-ID: On Thu, 8 Aug 2024 08:35:40 GMT, Albert Mingkun Yang wrote: > Trivial removing redundant code. This pull request has now been integrated. Changeset: 6ebd5d74 Author: Albert Mingkun Yang URL: https://git.openjdk.org/jdk/commit/6ebd5d74d57b334e7cf0b1282d7bb469a56fb3d6 Stats: 2 lines in 1 file changed: 0 ins; 1 del; 1 mod 8338036: Serial: Remove Generation::update_counters Reviewed-by: kbarrett ------------- PR: https://git.openjdk.org/jdk/pull/20509 From tschatzl at openjdk.org Fri Aug 9 09:43:32 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Fri, 9 Aug 2024 09:43:32 GMT Subject: RFR: 8337709: Use allocated states for chunking large array processing In-Reply-To: References: Message-ID: On Fri, 2 Aug 2024 19:36:47 GMT, Kim Barrett wrote: > Please review this change to the G1 young/mixed collector to use allocated > states to encode partial array task chunking. > > States are allocated from per-worker-thread arena+free-list pairs, and > released to the free-list for the worker that completed use. They are > refcounted to track the number of refering tasks. > > Various other approaches (such as a single arena+FreeListAllocator) were > tested, but found to have worse performance, though in some cases fewer > allocations. The per-worker arena+free-list pair was the only option that > didn't show a regression compared to the previous PartialArrayScanTask > approach on a stress test. > > In addition to the changes to ScannerTask to support the new > PartialArrayState, it temporarily continues to support PartialArrayScanTask. > This is because ParallelGC will continue to use the latter until it is changed > to use PartialArrayState. The intent is to update ParallelGC in a followup CR. > > Testing: > mach5 tier1-5 > G1 performance suite Marked as reviewed by tschatzl (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/20445#pullrequestreview-2229794656 From rcastanedalo at openjdk.org Fri Aug 9 11:48:17 2024 From: rcastanedalo at openjdk.org (Roberto =?UTF-8?B?Q2FzdGHDsWVkYQ==?= Lozano) Date: Fri, 9 Aug 2024 11:48:17 GMT Subject: RFR: 8334060: Implementation of Late Barrier Expansion for G1 [v5] In-Reply-To: References: Message-ID: > This changeset implements JEP 475 (Late Barrier Expansion for G1), including support for the x64 and aarch64 platforms. See the [JEP description](https://openjdk.org/jeps/475) for further detail. > > We aim to integrate this work in JDK 24. The purpose of this pull request is double-fold: > > - to allow maintainers of the arm (32-bit), ppc, riscv, s390, and x86 (32-bit) ports to contribute a port of these platforms in time for JDK 24; and > - to allow reviewers to review the platform-independent, x64 and aarch64, and test changes in parallel with the porting work. > > ## Summary of the Changes > > ### Platform-Independent Changes (`src/hotspot/share`) > > These consist mainly of: > > - a complete rewrite of `G1BarrierSetC2`, to instruct C2 to expand G1 barriers late instead of early; > - a few minor changes to C2 itself, to support removal of redundant decompression operations and to address an OopMap construction issue triggered by this JEP's increased usage of ADL `TEMP` operands; and > - temporary support for porting the JEP to the remaining platforms. > > The temporary support code (guarded by the pre-processor flag `G1_LATE_BARRIER_MIGRATION_SUPPORT`) will **not** be part of the final pull request, and hence does not need to be reviewed. > > ### Platform-Dependent Changes (`src/hotspot/cpu`) > > These include changes to the ADL instruction definitions and the `G1BarrierSetAssembler` class of the x64 and aarch64 platforms. > > #### ADL Changes > > The changeset uses ADL predicates to force C2 to implement memory accesses tagged with barrier information using G1-specific, barrier-aware instruction versions (e.g. `g1StoreP` instead of the GC-agnostic `storeP`). These new instruction versions generate machine code accordingly to the corresponding tagged barrier information, relying on the G1 barrier implementations provided by the `G1BarrierSetAssembler` class. In the aarch64 platform, the bulk of the ADL code is generated from a higher-level version using m4, to reduce redundancy. > > #### `G1BarrierSetAssembler` Changes > > Both platforms basically reuse the barrier implementation for the bytecode interpreter, with the different barrier tests and operations refactored into dedicated functions. Besides this, `G1BarrierSetAssembler` is extended with assembly-stub routines that implement the out-of-line, slow path of the barriers. These routines include calls from the barrier into the JVM, which require support for saving and restoring live registers, provided by the `SaveLiveRegisters` class. This c... Roberto Casta?eda Lozano has updated the pull request incrementally with one additional commit since the last revision: Give barrier generation helper functions a more consistent name ------------- Changes: - all: https://git.openjdk.org/jdk/pull/19746/files - new: https://git.openjdk.org/jdk/pull/19746/files/47079ea1..1834bf41 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=19746&range=04 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=19746&range=03-04 Stats: 455 lines in 3 files changed: 0 ins; 0 del; 455 mod Patch: https://git.openjdk.org/jdk/pull/19746.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/19746/head:pull/19746 PR: https://git.openjdk.org/jdk/pull/19746 From rcastanedalo at openjdk.org Fri Aug 9 11:52:35 2024 From: rcastanedalo at openjdk.org (Roberto =?UTF-8?B?Q2FzdGHDsWVkYQ==?= Lozano) Date: Fri, 9 Aug 2024 11:52:35 GMT Subject: RFR: 8334060: Implementation of Late Barrier Expansion for G1 [v5] In-Reply-To: References:

Message-ID: On Fri, 9 Aug 2024 11:48:17 GMT, Roberto Casta?eda Lozano wrote: >> This changeset implements JEP 475 (Late Barrier Expansion for G1), including support for the x64 and aarch64 platforms. See the [JEP description](https://openjdk.org/jeps/475) for further detail. >> >> We aim to integrate this work in JDK 24. The purpose of this pull request is double-fold: >> >> - to allow maintainers of the arm (32-bit), ppc, riscv, s390, and x86 (32-bit) ports to contribute a port of these platforms in time for JDK 24; and >> - to allow reviewers to review the platform-independent, x64 and aarch64, and test changes in parallel with the porting work. >> >> ## Summary of the Changes >> >> ### Platform-Independent Changes (`src/hotspot/share`) >> >> These consist mainly of: >> >> - a complete rewrite of `G1BarrierSetC2`, to instruct C2 to expand G1 barriers late instead of early; >> - a few minor changes to C2 itself, to support removal of redundant decompression operations and to address an OopMap construction issue triggered by this JEP's increased usage of ADL `TEMP` operands; and >> - temporary support for porting the JEP to the remaining platforms. >> >> The temporary support code (guarded by the pre-processor flag `G1_LATE_BARRIER_MIGRATION_SUPPORT`) will **not** be part of the final pull request, and hence does not need to be reviewed. >> >> ### Platform-Dependent Changes (`src/hotspot/cpu`) >> >> These include changes to the ADL instruction definitions and the `G1BarrierSetAssembler` class of the x64 and aarch64 platforms. >> >> #### ADL Changes >> >> The changeset uses ADL predicates to force C2 to implement memory accesses tagged with barrier information using G1-specific, barrier-aware instruction versions (e.g. `g1StoreP` instead of the GC-agnostic `storeP`). These new instruction versions generate machine code accordingly to the corresponding tagged barrier information, relying on the G1 barrier implementations provided by the `G1BarrierSetAssembler` class. In the aarch64 platform, the bulk of the ADL code is generated from a higher-level version using m4, to reduce redundancy. >> >> #### `G1BarrierSetAssembler` Changes >> >> Both platforms basically reuse the barrier implementation for the bytecode interpreter, with the different barrier tests and operations refactored into dedicated functions. Besides this, `G1BarrierSetAssembler` is extended with assembly-stub routines that implement the out-of-line, slow path of the barriers. These routines include calls from the barrier into the JVM, which require support for saving and restoring live ... > > Roberto Casta?eda Lozano has updated the pull request incrementally with one additional commit since the last revision: > > Give barrier generation helper functions a more consistent name Thanks for reviewing, Albert! > ``` > g1_write_barrier_post_c2 > generate_c2_post_barrier_stub > ``` > > The latter is the "next" step if slower path is taken. I wonder if it can be renamed to sth like "...write_barrier_post_c2_stub" to make it obvious that they are related. I agree with your suggestion, but will postpone it to a follow-up task to avoid interfering with the ongoing port work (the names are dictated by the platform-independent `G1PreBarrierStubC2::emit_code()` and `G1PostBarrierStubC2::emit_code()` functions, so a name change would affect every platform). > Both "write_barrier_pre" and "pre_write_barrier" exist. It's not obvious whether that is intended (to highlight some diff) or not. This is accidental, as far as I can see. `write_barrier_pre` is the pre-existing name for the interpreter barrier generation functions, I would rather leave it as-is to avoid making this changeset even larger. Instead, I have renamed the helper functions `g1_pre_write_barrier()` and `g1_post_write_barrier()` to `write_barrier_pre()` and `write_barrier_post()`, for consistency (and dropped `g1_` since it is obvious from the context) in commit 1834bf4. ------------- PR Comment: https://git.openjdk.org/jdk/pull/19746#issuecomment-2277770042 From rcastanedalo at openjdk.org Fri Aug 9 12:03:37 2024 From: rcastanedalo at openjdk.org (Roberto =?UTF-8?B?Q2FzdGHDsWVkYQ==?= Lozano) Date: Fri, 9 Aug 2024 12:03:37 GMT Subject: RFR: 8334060: Implementation of Late Barrier Expansion for G1 [v2] In-Reply-To: <4c-MLXwKcNcSnloSkYkuk3gnv3ux5i5beS51Fd9Z8MQ=.cd0a7eba-ff26-4855-a01c-d1ae5182100b@github.com> References:

<4c-MLXwKcNcSnloSkYkuk3gnv3ux5i5beS51Fd9Z8MQ=.cd0a7eba-ff26-4855-a01c-d1ae5182100b@github.com> Message-ID: <5Q8PqULlpKfoPLXRqI0ua0dVWAy3zPBqtFpycNwBg0Y=.f2830c84-63ba-43cd-85e3-2245e4ac8917@github.com> On Sun, 21 Jul 2024 08:21:39 GMT, Martin Doerr wrote: >> Roberto Casta?eda Lozano has updated the pull request incrementally with one additional commit since the last revision: >> >> Build barrier data in G1BarrierSetC2::get_store_barrier() by adding, rather than removing, barrier tags > > src/hotspot/cpu/x86/gc/g1/g1_x86_64.ad line 86: > >> 84: // an indirect memory operand) to reduce C2's scheduling and register >> 85: // allocation pressure (fewer Mach nodes). The same holds for g1StoreN and >> 86: // g1EncodePAndStoreN. > > I'm not convinced that this is beneficial. We're wasting a temp register just for an addition? I agree that using indirect memory operands is the most readable choice, and is slightly less wasteful from a register usage perspective. However, when I tried this choice a couple of months ago, I observed timeouts in some CTW runs, which as far as I remember were caused when LCM processed huge basic blocks with lots of memory writes (e.g. arising from static initializations of large String arrays such as in [here](https://github.com/apache/lucene/blob/ea562f6ef2b32fe6eadf57c6381d9a69acb043c7/lucene/analysis/common/src/java/org/apache/lucene/analysis/en/KStemData1.java#L47-L748)), in combination with C2 stress options. In these scenarios, the large number of additional Mach nodes seemed to cause the timeouts. I settled for materializing the store address internally to guard against such corner cases. I did not see any significant performance difference between the two choices in my benchmark results. I would like to study whether LCM can be made more robust in this scenario, which would enable using indirect memory operands here, but I think this would be best addressed in a separate RFE. Would it be OK by now to extend the code comment with the details provided in the above explanation? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/19746#discussion_r1711337413 From duke at openjdk.org Fri Aug 9 12:57:44 2024 From: duke at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Fri, 9 Aug 2024 12:57:44 GMT Subject: RFR: 8337938: ZUtils::alloc_aligned allocates without reporting to NMT Message-ID: Replaces usage of posix_memalign/_aligned_malloc with os::malloc and manual alignment to report memory usage to NMT. Manually aligning the memory makes the returned address unfreeable by malloc (as clarified by the added comment), which is reasonable since the memory used by ZUtils::alloc_aligned is never freed. Tested with tiers 1-3. ------------- Commit messages: - Remove trailing whitespace - 8337938: ZUtils::alloc_aligned allocates without reporting to NMT Changes: https://git.openjdk.org/jdk/pull/20523/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=20523&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8337938 Stats: 101 lines in 6 files changed: 13 ins; 83 del; 5 mod Patch: https://git.openjdk.org/jdk/pull/20523.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20523/head:pull/20523 PR: https://git.openjdk.org/jdk/pull/20523 From stefank at openjdk.org Fri Aug 9 13:30:32 2024 From: stefank at openjdk.org (Stefan Karlsson) Date: Fri, 9 Aug 2024 13:30:32 GMT Subject: RFR: 8337938: ZUtils::alloc_aligned allocates without reporting to NMT In-Reply-To: References: Message-ID: On Fri, 9 Aug 2024 12:47:18 GMT, Joel Sikstr?m wrote: > Replaces usage of posix_memalign/_aligned_malloc with os::malloc and manual alignment to report memory usage to NMT. Manually aligning the memory makes the returned address unfreeable by malloc (as clarified by the added comment), which is reasonable since the memory used by ZUtils::alloc_aligned is never freed. > > Tested with tiers 1-3. Looks good. ------------- Marked as reviewed by stefank (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/20523#pullrequestreview-2230191554 From mdoerr at openjdk.org Fri Aug 9 14:08:34 2024 From: mdoerr at openjdk.org (Martin Doerr) Date: Fri, 9 Aug 2024 14:08:34 GMT Subject: RFR: 8334060: Implementation of Late Barrier Expansion for G1 [v2] In-Reply-To: <5Q8PqULlpKfoPLXRqI0ua0dVWAy3zPBqtFpycNwBg0Y=.f2830c84-63ba-43cd-85e3-2245e4ac8917@github.com> References:

<4c-MLXwKcNcSnloSkYkuk3gnv3ux5i5beS51Fd9Z8MQ=.cd0a7eba-ff26-4855-a01c-d1ae5182100b@github.com> <5Q8PqULlpKfoPLXRqI0ua0dVWAy3zPBqtFpycNwBg0Y=.f2830c84-63ba-43cd-85e3-2245e4ac8917@github.com> Message-ID: On Fri, 9 Aug 2024 12:00:26 GMT, Roberto Casta?eda Lozano wrote: >> src/hotspot/cpu/x86/gc/g1/g1_x86_64.ad line 86: >> >>> 84: // an indirect memory operand) to reduce C2's scheduling and register >>> 85: // allocation pressure (fewer Mach nodes). The same holds for g1StoreN and >>> 86: // g1EncodePAndStoreN. >> >> I'm not convinced that this is beneficial. We're wasting a temp register just for an addition? > > I agree that using indirect memory operands is the most readable choice, and is slightly less wasteful from a register usage perspective. However, when I tried this choice a couple of months ago, I observed timeouts in some CTW runs, which as far as I remember were caused when LCM processed huge basic blocks with lots of memory writes (e.g. arising from static initializations of large String arrays such as in [here](https://github.com/apache/lucene/blob/ea562f6ef2b32fe6eadf57c6381d9a69acb043c7/lucene/analysis/common/src/java/org/apache/lucene/analysis/en/KStemData1.java#L47-L748)), in combination with C2 stress options. In these scenarios, the large number of additional Mach nodes seemed to cause the timeouts. I settled for materializing the store address internally to guard against such corner cases. I did not see any significant performance difference between the two choices in my benchmark results. > > I would like to study whether LCM can be made more robust in this scenario, which would enable using indirect memory operands here, but I think this would be best addressed in a separate RFE. Would it be OK by now to extend the code comment with the details provided in the above explanation? Ok, doing it in a separate RFE is fine with me. This sounds like a C2 problem which should get investigated. It may cause other performance problems, too. Maybe a native profiler can show what takes too much time. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/19746#discussion_r1711536279 From nprasad at openjdk.org Fri Aug 9 14:54:26 2024 From: nprasad at openjdk.org (Neethu Prasad) Date: Fri, 9 Aug 2024 14:54:26 GMT Subject: RFR: 8336299: Improve GCLocker stall diagnostics [v5] In-Reply-To: References: Message-ID: > **Notes** > Adding logs to get more visibility into how fast a thread resumes from allocation stall. > > **Testing** > * tier 1, tier 2, hotspot_gc tests. > > Example log messages > > 1. Last thread exiting. Performing GC after exiting critical section. Thread "main" 0 locked. > > 2. Thread exiting critical region Thread "main" 0 locked. > > 3. Thread stalled by JNI critical section. Resumed after 586ms. Thread "Thread-0". > > 4. Thread blocked to enter critical region. Resumed after 1240ms. Thread "SIGINT handler". Neethu Prasad has updated the pull request incrementally with one additional commit since the last revision: address code style feedback ------------- Changes: - all: https://git.openjdk.org/jdk/pull/20277/files - new: https://git.openjdk.org/jdk/pull/20277/files/c53dc9cf..77fd9d55 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=20277&range=04 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=20277&range=03-04 Stats: 21 lines in 1 file changed: 4 ins; 7 del; 10 mod Patch: https://git.openjdk.org/jdk/pull/20277.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20277/head:pull/20277 PR: https://git.openjdk.org/jdk/pull/20277 From btaylor at openjdk.org Fri Aug 9 17:58:58 2024 From: btaylor at openjdk.org (Ben Taylor) Date: Fri, 9 Aug 2024 17:58:58 GMT Subject: RFR: 8337815: Relax G1EvacStats atomic operations Message-ID: This PR should slightly improve the performance of G1EvacStats by using `memory_order_relaxed` instead of the default `memory_order_conservative`. Since the original bug report says >I doubt it would show on benchmarks, this is a paper-cut issue. I haven't benchmarked this change for performance. The change passes all tests in `gc/g1` locally on x86_64 linux. ------------- Commit messages: - 8337815: Relax G1EvacStats atomic operations Changes: https://git.openjdk.org/jdk/pull/20529/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=20529&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8337815 Stats: 7 lines in 1 file changed: 0 ins; 0 del; 7 mod Patch: https://git.openjdk.org/jdk/pull/20529.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20529/head:pull/20529 PR: https://git.openjdk.org/jdk/pull/20529 From kbarrett at openjdk.org Sun Aug 11 16:40:30 2024 From: kbarrett at openjdk.org (Kim Barrett) Date: Sun, 11 Aug 2024 16:40:30 GMT Subject: RFR: 8337815: Relax G1EvacStats atomic operations In-Reply-To: References: Message-ID: On Fri, 9 Aug 2024 17:46:43 GMT, Ben Taylor wrote: > This PR should slightly improve the performance of G1EvacStats by using `memory_order_relaxed` instead of the default `memory_order_conservative`. > Since the original bug report says > >>I doubt it would show on benchmarks, this is a paper-cut issue. > > I haven't benchmarked this change for performance. > > The change passes all tests in `gc/g1` locally on x86_64 linux. Looks good. ------------- Marked as reviewed by kbarrett (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/20529#pullrequestreview-2231854153 From kbarrett at openjdk.org Sun Aug 11 18:27:35 2024 From: kbarrett at openjdk.org (Kim Barrett) Date: Sun, 11 Aug 2024 18:27:35 GMT Subject: RFR: 8337709: Use allocated states for chunking large array processing In-Reply-To: References:

Message-ID: On Wed, 7 Aug 2024 08:18:50 GMT, Ivan Walulya wrote: >> Please review this change to the G1 young/mixed collector to use allocated >> states to encode partial array task chunking. >> >> States are allocated from per-worker-thread arena+free-list pairs, and >> released to the free-list for the worker that completed use. They are >> refcounted to track the number of refering tasks. >> >> Various other approaches (such as a single arena+FreeListAllocator) were >> tested, but found to have worse performance, though in some cases fewer >> allocations. The per-worker arena+free-list pair was the only option that >> didn't show a regression compared to the previous PartialArrayScanTask >> approach on a stress test. >> >> In addition to the changes to ScannerTask to support the new >> PartialArrayState, it temporarily continues to support PartialArrayScanTask. >> This is because ParallelGC will continue to use the latter until it is changed >> to use PartialArrayState. The intent is to update ParallelGC in a followup CR. >> >> Testing: >> mach5 tier1-5 >> G1 performance suite > > LGTM! > > Was there any observable impact on G1 performance suite? Thanks for reviews @walulyai and @tschatzl ------------- PR Comment: https://git.openjdk.org/jdk/pull/20445#issuecomment-2282846972 From kbarrett at openjdk.org Sun Aug 11 18:36:38 2024 From: kbarrett at openjdk.org (Kim Barrett) Date: Sun, 11 Aug 2024 18:36:38 GMT Subject: Integrated: 8337709: Use allocated states for chunking large array processing In-Reply-To: References: Message-ID: On Fri, 2 Aug 2024 19:36:47 GMT, Kim Barrett wrote: > Please review this change to the G1 young/mixed collector to use allocated > states to encode partial array task chunking. > > States are allocated from per-worker-thread arena+free-list pairs, and > released to the free-list for the worker that completed use. They are > refcounted to track the number of refering tasks. > > Various other approaches (such as a single arena+FreeListAllocator) were > tested, but found to have worse performance, though in some cases fewer > allocations. The per-worker arena+free-list pair was the only option that > didn't show a regression compared to the previous PartialArrayScanTask > approach on a stress test. > > In addition to the changes to ScannerTask to support the new > PartialArrayState, it temporarily continues to support PartialArrayScanTask. > This is because ParallelGC will continue to use the latter until it is changed > to use PartialArrayState. The intent is to update ParallelGC in a followup CR. > > Testing: > mach5 tier1-5 > G1 performance suite This pull request has now been integrated. Changeset: 6a3d0452 Author: Kim Barrett URL: https://git.openjdk.org/jdk/commit/6a3d045221c338fefec9bd59245324eae60b156b Stats: 501 lines in 9 files changed: 356 ins; 57 del; 88 mod 8337709: Use allocated states for chunking large array processing Reviewed-by: iwalulya, tschatzl ------------- PR: https://git.openjdk.org/jdk/pull/20445 From kbarrett at openjdk.org Mon Aug 12 05:21:32 2024 From: kbarrett at openjdk.org (Kim Barrett) Date: Mon, 12 Aug 2024 05:21:32 GMT Subject: RFR: 8337938: ZUtils::alloc_aligned allocates without reporting to NMT In-Reply-To: References: Message-ID: On Fri, 9 Aug 2024 12:47:18 GMT, Joel Sikstr?m wrote: > Replaces usage of posix_memalign/_aligned_malloc with os::malloc and manual alignment to report memory usage to NMT. Manually aligning the memory makes the returned address unfreeable by malloc (as clarified by the added comment), which is reasonable since the memory used by ZUtils::alloc_aligned is never freed. > > Tested with tiers 1-3. Looks good, except for some copyrights needing update. ------------- Marked as reviewed by kbarrett (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/20523#pullrequestreview-2232062157 From amitkumar at openjdk.org Mon Aug 12 05:25:33 2024 From: amitkumar at openjdk.org (Amit Kumar) Date: Mon, 12 Aug 2024 05:25:33 GMT Subject: RFR: 8334060: Implementation of Late Barrier Expansion for G1 [v5] In-Reply-To: References:

Message-ID: On Fri, 9 Aug 2024 11:48:17 GMT, Roberto Casta?eda Lozano wrote: >> This changeset implements JEP 475 (Late Barrier Expansion for G1), including support for the x64 and aarch64 platforms. See the [JEP description](https://openjdk.org/jeps/475) for further detail. >> >> We aim to integrate this work in JDK 24. The purpose of this pull request is double-fold: >> >> - to allow maintainers of the arm (32-bit), ppc, riscv, s390, and x86 (32-bit) ports to contribute a port of these platforms in time for JDK 24; and >> - to allow reviewers to review the platform-independent, x64 and aarch64, and test changes in parallel with the porting work. >> >> ## Summary of the Changes >> >> ### Platform-Independent Changes (`src/hotspot/share`) >> >> These consist mainly of: >> >> - a complete rewrite of `G1BarrierSetC2`, to instruct C2 to expand G1 barriers late instead of early; >> - a few minor changes to C2 itself, to support removal of redundant decompression operations and to address an OopMap construction issue triggered by this JEP's increased usage of ADL `TEMP` operands; and >> - temporary support for porting the JEP to the remaining platforms. >> >> The temporary support code (guarded by the pre-processor flag `G1_LATE_BARRIER_MIGRATION_SUPPORT`) will **not** be part of the final pull request, and hence does not need to be reviewed. >> >> ### Platform-Dependent Changes (`src/hotspot/cpu`) >> >> These include changes to the ADL instruction definitions and the `G1BarrierSetAssembler` class of the x64 and aarch64 platforms. >> >> #### ADL Changes >> >> The changeset uses ADL predicates to force C2 to implement memory accesses tagged with barrier information using G1-specific, barrier-aware instruction versions (e.g. `g1StoreP` instead of the GC-agnostic `storeP`). These new instruction versions generate machine code accordingly to the corresponding tagged barrier information, relying on the G1 barrier implementations provided by the `G1BarrierSetAssembler` class. In the aarch64 platform, the bulk of the ADL code is generated from a higher-level version using m4, to reduce redundancy. >> >> #### `G1BarrierSetAssembler` Changes >> >> Both platforms basically reuse the barrier implementation for the bytecode interpreter, with the different barrier tests and operations refactored into dedicated functions. Besides this, `G1BarrierSetAssembler` is extended with assembly-stub routines that implement the out-of-line, slow path of the barriers. These routines include calls from the barrier into the JVM, which require support for saving and restoring live ... > > Roberto Casta?eda Lozano has updated the pull request incrementally with one additional commit since the last revision: > > Give barrier generation helper functions a more consistent name is there issue if we replace this code: if (in_bytes(SATBMarkQueue::byte_width_of_active()) == 4) { __ ldrw(rscratch1, in_progress); } else { assert(in_bytes(SATBMarkQueue::byte_width_of_active()) == 1, "Assumption"); __ ldrb(rscratch1, in_progress); } in method `G1BarrierSetAssembler::gen_write_ref_array_pre_barrier` with `generate_queue_test_and_insertion(masm, rthread, rscratch1)` ? Though you have to move the `gen_write_ref_array_pre_barrier` on top otherwise compiler wouldn't be able to find it. ------------- PR Review: https://git.openjdk.org/jdk/pull/19746#pullrequestreview-2232065079 From duke at openjdk.org Mon Aug 12 06:29:48 2024 From: duke at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Mon, 12 Aug 2024 06:29:48 GMT Subject: RFR: 8337938: ZUtils::alloc_aligned allocates without reporting to NMT [v2] In-Reply-To: References: Message-ID: > Replaces usage of posix_memalign/_aligned_malloc with os::malloc and manual alignment to report memory usage to NMT. Manually aligning the memory makes the returned address unfreeable by malloc (as clarified by the added comment), which is reasonable since the memory used by ZUtils::alloc_aligned is never freed. > > Tested with tiers 1-3. Joel Sikstr?m has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision: - Merge branch 'master' into zgc_zutils_alloc_aligned - Updated copyright years - Remove trailing whitespace - 8337938: ZUtils::alloc_aligned allocates without reporting to NMT ------------- Changes: - all: https://git.openjdk.org/jdk/pull/20523/files - new: https://git.openjdk.org/jdk/pull/20523/files/cb942b1b..d227e0de Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=20523&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=20523&range=00-01 Stats: 551 lines in 25 files changed: 357 ins; 77 del; 117 mod Patch: https://git.openjdk.org/jdk/pull/20523.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20523/head:pull/20523 PR: https://git.openjdk.org/jdk/pull/20523 From kbarrett at openjdk.org Mon Aug 12 07:12:37 2024 From: kbarrett at openjdk.org (Kim Barrett) Date: Mon, 12 Aug 2024 07:12:37 GMT Subject: RFR: 8337938: ZUtils::alloc_aligned allocates without reporting to NMT [v2] In-Reply-To: References:

Message-ID: <6QCBXfFDwLUTkzz9hezTYFbRwUExGs3HOyNrXEDshks=.08f22e57-c52b-4374-9925-4f66ceed25a0@github.com> On Mon, 12 Aug 2024 06:29:48 GMT, Joel Sikstr?m wrote: >> Replaces usage of posix_memalign/_aligned_malloc with os::malloc and manual alignment to report memory usage to NMT. Manually aligning the memory makes the returned address unfreeable by malloc (as clarified by the added comment), which is reasonable since the memory used by ZUtils::alloc_aligned is never freed. >> >> Tested with tiers 1-3. > > Joel Sikstr?m has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision: > > - Merge branch 'master' into zgc_zutils_alloc_aligned > - Updated copyright years > - Remove trailing whitespace > - 8337938: ZUtils::alloc_aligned allocates without reporting to NMT Looks good. ------------- Marked as reviewed by kbarrett (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/20523#pullrequestreview-2232190875 From tschatzl at openjdk.org Mon Aug 12 07:44:35 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 12 Aug 2024 07:44:35 GMT Subject: RFR: 8337815: Relax G1EvacStats atomic operations In-Reply-To: References: Message-ID: On Fri, 9 Aug 2024 17:46:43 GMT, Ben Taylor wrote: > This PR should slightly improve the performance of G1EvacStats by using `memory_order_relaxed` instead of the default `memory_order_conservative`. > Since the original bug report says > >>I doubt it would show on benchmarks, this is a paper-cut issue. > > I haven't benchmarked this change for performance. > > The change passes all tests in `gc/g1` locally on x86_64 linux. Marked as reviewed by tschatzl (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/20529#pullrequestreview-2232247488 From tschatzl at openjdk.org Mon Aug 12 07:45:33 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 12 Aug 2024 07:45:33 GMT Subject: RFR: 8336299: Improve GCLocker stall diagnostics [v5] In-Reply-To: References:

Message-ID: On Fri, 9 Aug 2024 14:54:26 GMT, Neethu Prasad wrote: >> **Notes** >> Adding logs to get more visibility into how fast a thread resumes from allocation stall. >> >> **Testing** >> * tier 1, tier 2, hotspot_gc tests. >> >> Example log messages >> >> 1. Last thread exiting. Performing GC after exiting critical section. Thread "main" 0 locked. >> >> 2. Thread exiting critical region Thread "main" 0 locked. >> >> 3. Thread stalled by JNI critical section. Resumed after 586ms. Thread "Thread-0". >> >> 4. Thread blocked to enter critical region. Resumed after 1240ms. Thread "SIGINT handler". > > Neethu Prasad has updated the pull request incrementally with one additional commit since the last revision: > > address code style feedback src/hotspot/share/gc/shared/gcLocker.cpp line 139: > 137: // Wait for _needs_gc to be cleared > 138: while (needs_gc()) { > 139: GCLockerTimingDebugLogger logger("Thread stalled by JNI critical section."); If a spurious wakeup occurs, the logger will be instantiated multiple times, this can lead to confusing log msgs, right? If so, I wonder whether it makes sense to extract `logger` out of the while-iteration. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/20277#discussion_r1713314902 From shade at openjdk.org Mon Aug 12 08:24:32 2024 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 12 Aug 2024 08:24:32 GMT Subject: RFR: 8337815: Relax G1EvacStats atomic operations In-Reply-To: References: Message-ID: On Fri, 9 Aug 2024 17:46:43 GMT, Ben Taylor wrote: > This PR should slightly improve the performance of G1EvacStats by using `memory_order_relaxed` instead of the default `memory_order_conservative`. > Since the original bug report says > >>I doubt it would show on benchmarks, this is a paper-cut issue. > > I haven't benchmarked this change for performance. > > The change passes all tests in `gc/g1` locally on x86_64 linux. Marked as reviewed by shade (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/20529#pullrequestreview-2232324912 From rcastanedalo at openjdk.org Mon Aug 12 08:38:37 2024 From: rcastanedalo at openjdk.org (Roberto =?UTF-8?B?Q2FzdGHDsWVkYQ==?= Lozano) Date: Mon, 12 Aug 2024 08:38:37 GMT Subject: RFR: 8334060: Implementation of Late Barrier Expansion for G1 [v5] In-Reply-To: References:

Message-ID: On Mon, 12 Aug 2024 05:23:06 GMT, Amit Kumar wrote: > is there issue if we replace this code: > > ``` > if (in_bytes(SATBMarkQueue::byte_width_of_active()) == 4) { > __ ldrw(rscratch1, in_progress); > } else { > assert(in_bytes(SATBMarkQueue::byte_width_of_active()) == 1, "Assumption"); > __ ldrb(rscratch1, in_progress); > } > ``` > > in method `G1BarrierSetAssembler::gen_write_ref_array_pre_barrier` with `generate_queue_test_and_insertion(masm, rthread, rscratch1)` ? > > Though you have to move the `gen_write_ref_array_pre_barrier` on top otherwise compiler wouldn't be able to find it. Thanks for the suggestion Amit! this refactoring would work (assuming you mean `generate_pre_barrier_fast_path` instead of `generate_queue_test_and_insertion`), however I am hesitant to apply it because 1) it would further increase the size of the changelog and hence the burden of reviewing it and 2) it is not a clear maintainability win: some engineers prefer a little bit of code duplication to preserve the assembly code flow (see discussion [here](https://github.com/openjdk/jdk/pull/19746#discussion_r1645713269)). ------------- PR Comment: https://git.openjdk.org/jdk/pull/19746#issuecomment-2283395013 From rcastanedalo at openjdk.org Mon Aug 12 08:46:16 2024 From: rcastanedalo at openjdk.org (Roberto =?UTF-8?B?Q2FzdGHDsWVkYQ==?= Lozano) Date: Mon, 12 Aug 2024 08:46:16 GMT Subject: RFR: 8334060: Implementation of Late Barrier Expansion for G1 [v6] In-Reply-To: References: Message-ID: > This changeset implements JEP 475 (Late Barrier Expansion for G1), including support for the x64 and aarch64 platforms. See the [JEP description](https://openjdk.org/jeps/475) for further detail. > > We aim to integrate this work in JDK 24. The purpose of this pull request is double-fold: > > - to allow maintainers of the arm (32-bit), ppc, riscv, s390, and x86 (32-bit) ports to contribute a port of these platforms in time for JDK 24; and > - to allow reviewers to review the platform-independent, x64 and aarch64, and test changes in parallel with the porting work. > > ## Summary of the Changes > > ### Platform-Independent Changes (`src/hotspot/share`) > > These consist mainly of: > > - a complete rewrite of `G1BarrierSetC2`, to instruct C2 to expand G1 barriers late instead of early; > - a few minor changes to C2 itself, to support removal of redundant decompression operations and to address an OopMap construction issue triggered by this JEP's increased usage of ADL `TEMP` operands; and > - temporary support for porting the JEP to the remaining platforms. > > The temporary support code (guarded by the pre-processor flag `G1_LATE_BARRIER_MIGRATION_SUPPORT`) will **not** be part of the final pull request, and hence does not need to be reviewed. > > ### Platform-Dependent Changes (`src/hotspot/cpu`) > > These include changes to the ADL instruction definitions and the `G1BarrierSetAssembler` class of the x64 and aarch64 platforms. > > #### ADL Changes > > The changeset uses ADL predicates to force C2 to implement memory accesses tagged with barrier information using G1-specific, barrier-aware instruction versions (e.g. `g1StoreP` instead of the GC-agnostic `storeP`). These new instruction versions generate machine code accordingly to the corresponding tagged barrier information, relying on the G1 barrier implementations provided by the `G1BarrierSetAssembler` class. In the aarch64 platform, the bulk of the ADL code is generated from a higher-level version using m4, to reduce redundancy. > > #### `G1BarrierSetAssembler` Changes > > Both platforms basically reuse the barrier implementation for the bytecode interpreter, with the different barrier tests and operations refactored into dedicated functions. Besides this, `G1BarrierSetAssembler` is extended with assembly-stub routines that implement the out-of-line, slow path of the barriers. These routines include calls from the barrier into the JVM, which require support for saving and restoring live registers, provided by the `SaveLiveRegisters` class. This c... Roberto Casta?eda Lozano has updated the pull request incrementally with one additional commit since the last revision: Further motivate the choice of internal store address materialization in x64 ------------- Changes: - all: https://git.openjdk.org/jdk/pull/19746/files - new: https://git.openjdk.org/jdk/pull/19746/files/1834bf41..d21104ca Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=19746&range=05 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=19746&range=04-05 Stats: 4 lines in 1 file changed: 1 ins; 0 del; 3 mod Patch: https://git.openjdk.org/jdk/pull/19746.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/19746/head:pull/19746 PR: https://git.openjdk.org/jdk/pull/19746 From rcastanedalo at openjdk.org Mon Aug 12 08:46:16 2024 From: rcastanedalo at openjdk.org (Roberto =?UTF-8?B?Q2FzdGHDsWVkYQ==?= Lozano) Date: Mon, 12 Aug 2024 08:46:16 GMT Subject: RFR: 8334060: Implementation of Late Barrier Expansion for G1 [v2] In-Reply-To: References:

Message-ID: On Mon, 12 Aug 2024 08:35:57 GMT, Roberto Casta?eda Lozano wrote: > > is there issue if we replace this code: > > ``` > > if (in_bytes(SATBMarkQueue::byte_width_of_active()) == 4) { > > __ ldrw(rscratch1, in_progress); > > } else { > > assert(in_bytes(SATBMarkQueue::byte_width_of_active()) == 1, "Assumption"); > > __ ldrb(rscratch1, in_progress); > > } > > ``` > > > > > > > > > > > > > > > > > > > > > > > > in method `G1BarrierSetAssembler::gen_write_ref_array_pre_barrier` with `generate_queue_test_and_insertion(masm, rthread, rscratch1)` ? > > Though you have to move the `gen_write_ref_array_pre_barrier` on top otherwise compiler wouldn't be able to find it. > > Thanks for the suggestion Amit! this refactoring would work (assuming you mean `generate_pre_barrier_fast_path` instead of `generate_queue_test_and_insertion`), however I am hesitant to apply it because 1) it would further increase the size of the changelog and hence the burden of reviewing it and 2) it is not a clear maintainability win: some engineers prefer a little bit of code duplication to preserve the assembly code flow (see discussion [here](https://github.com/openjdk/jdk/pull/19746#discussion_r1645713269)). Ha! makes sense. Are you planning to rebase it with master ? Nothing important, but there were couple of failures which are fixed after this PR. So will make test result a bit clean for us ?. ------------- PR Comment: https://git.openjdk.org/jdk/pull/19746#issuecomment-2283418237 From shade at openjdk.org Mon Aug 12 09:55:40 2024 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 12 Aug 2024 09:55:40 GMT Subject: RFR: 8336299: Improve GCLocker stall diagnostics [v5] In-Reply-To: References: