From zgu at openjdk.org Fri Nov 1 13:06:33 2024 From: zgu at openjdk.org (Zhengyu Gu) Date: Fri, 1 Nov 2024 13:06:33 GMT Subject: RFR: 8343333: Parallel: Cleanup comment referring Solaris in MutableNUMASpace In-Reply-To: References: Message-ID: On Thu, 31 Oct 2024 02:07:29 GMT, Zhengyu Gu wrote: > A trivial cleanup that removes comment referring Solaris. Thanks, @tschatzl ------------- PR Comment: https://git.openjdk.org/jdk/pull/21796#issuecomment-2451838741 From zgu at openjdk.org Fri Nov 1 13:06:34 2024 From: zgu at openjdk.org (Zhengyu Gu) Date: Fri, 1 Nov 2024 13:06:34 GMT Subject: Integrated: 8343333: Parallel: Cleanup comment referring Solaris in MutableNUMASpace In-Reply-To: References: Message-ID: On Thu, 31 Oct 2024 02:07:29 GMT, Zhengyu Gu wrote: > A trivial cleanup that removes comment referring Solaris. This pull request has now been integrated. Changeset: da0e9e38 Author: Zhengyu Gu URL: https://git.openjdk.org/jdk/commit/da0e9e38e378ad14ddf4577924597462d9b0595f Stats: 7 lines in 1 file changed: 0 ins; 2 del; 5 mod 8343333: Parallel: Cleanup comment referring Solaris in MutableNUMASpace Reviewed-by: tschatzl ------------- PR: https://git.openjdk.org/jdk/pull/21796 From ayang at openjdk.org Mon Nov 4 05:39:58 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 4 Nov 2024 05:39:58 GMT Subject: RFR: 8343507: Parallel: Fail if verify_complete finds incorrect states Message-ID: Trivial change of replacing `log_warning` with `fatal`, because incorrect `destination_count` always indicate some problem. Test: tier1-3 ------------- Commit messages: - pgc-fatal Changes: https://git.openjdk.org/jdk/pull/21865/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=21865&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8343507 Stats: 4 lines in 1 file changed: 0 ins; 0 del; 4 mod Patch: https://git.openjdk.org/jdk/pull/21865.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/21865/head:pull/21865 PR: https://git.openjdk.org/jdk/pull/21865 From ayang at openjdk.org Mon Nov 4 06:21:59 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 4 Nov 2024 06:21:59 GMT Subject: RFR: 8343508: Parallel: Use ordinary klass accessor in verify_filler_in_dense_prefix Message-ID: One line change to use the common API to make the caller logic less obtrusive. Test: tier1-3 ------------- Commit messages: - pgc-klass-accessor Changes: https://git.openjdk.org/jdk/pull/21866/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=21866&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8343508 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/21866.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/21866/head:pull/21866 PR: https://git.openjdk.org/jdk/pull/21866 From tschatzl at openjdk.org Mon Nov 4 07:42:29 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 4 Nov 2024 07:42:29 GMT Subject: RFR: 8343507: Parallel: Fail if verify_complete finds incorrect states In-Reply-To: References: Message-ID: On Mon, 4 Nov 2024 05:31:26 GMT, Albert Mingkun Yang wrote: > Trivial change of replacing `log_warning` with `fatal`, because incorrect `destination_count` always indicate some problem. > > Test: tier1-3 Changes requested by tschatzl (Reviewer). src/hotspot/share/gc/parallel/psParallelCompact.cpp line 1917: > 1915: if (!c->completed()) { > 1916: fatal("region %zu not filled: destination_count=%u", > 1917: cur_region, c->destination_count()); I would prefer to use `assert(c->completed(), ...)` in both cases similar to other failures due to verification (like the one above). ------------- PR Review: https://git.openjdk.org/jdk/pull/21865#pullrequestreview-2412338308 PR Review Comment: https://git.openjdk.org/jdk/pull/21865#discussion_r1827295363 From tschatzl at openjdk.org Mon Nov 4 07:44:28 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 4 Nov 2024 07:44:28 GMT Subject: RFR: 8343508: Parallel: Use ordinary klass accessor in verify_filler_in_dense_prefix In-Reply-To: References: Message-ID: <8jvzF7sIyWQJdLuOouswz4uDiKWT95-x3iDJWmh7fZ0=.4e33866c-8ba9-4272-89ac-3a50f86b4c8f@github.com> On Mon, 4 Nov 2024 06:16:15 GMT, Albert Mingkun Yang wrote: > One line change to use the common API to make the caller logic less obtrusive. > > Test: tier1-3 Marked as reviewed by tschatzl (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/21866#pullrequestreview-2412348941 From ayang at openjdk.org Mon Nov 4 07:55:04 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 4 Nov 2024 07:55:04 GMT Subject: RFR: 8343507: Parallel: Fail if verify_complete finds incorrect states [v2] In-Reply-To: References: Message-ID: <7SGR3x5jFTxIVogdh8N9gyaMBS9J1HWDDV8sGlxliwc=.abd6c8b3-2157-4e55-8afa-1cf087e0302c@github.com> > Trivial change of replacing `log_warning` with `fatal`, because incorrect `destination_count` always indicate some problem. > > Test: tier1-3 Albert Mingkun Yang has updated the pull request incrementally with one additional commit since the last revision: review ------------- Changes: - all: https://git.openjdk.org/jdk/pull/21865/files - new: https://git.openjdk.org/jdk/pull/21865/files/f1e2d474..f089f3df Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=21865&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=21865&range=00-01 Stats: 8 lines in 1 file changed: 0 ins; 4 del; 4 mod Patch: https://git.openjdk.org/jdk/pull/21865.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/21865/head:pull/21865 PR: https://git.openjdk.org/jdk/pull/21865 From kbarrett at openjdk.org Mon Nov 4 09:15:28 2024 From: kbarrett at openjdk.org (Kim Barrett) Date: Mon, 4 Nov 2024 09:15:28 GMT Subject: RFR: 8343507: Parallel: Fail if verify_complete finds incorrect states [v2] In-Reply-To: <7SGR3x5jFTxIVogdh8N9gyaMBS9J1HWDDV8sGlxliwc=.abd6c8b3-2157-4e55-8afa-1cf087e0302c@github.com> References: <7SGR3x5jFTxIVogdh8N9gyaMBS9J1HWDDV8sGlxliwc=.abd6c8b3-2157-4e55-8afa-1cf087e0302c@github.com> Message-ID: On Mon, 4 Nov 2024 07:55:04 GMT, Albert Mingkun Yang wrote: >> Trivial change of replacing `log_warning` with `fatal`, because incorrect `destination_count` always indicate some problem. >> >> Test: tier1-3 > > Albert Mingkun Yang has updated the pull request incrementally with one additional commit since the last revision: > > review Looks good. ------------- Marked as reviewed by kbarrett (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/21865#pullrequestreview-2412532188 From simonis at openjdk.org Mon Nov 4 09:49:01 2024 From: simonis at openjdk.org (Volker Simonis) Date: Mon, 4 Nov 2024 09:49:01 GMT Subject: RFR: 8343531: Improve print_location for invalid heap pointers Message-ID: Currently `BlockLocationPrinter::print_location()` checks for a pointer if it points into the heap and if that's true, it either prints it as an oop if `is_valid_obj()` is true or it tries to find the the start address of an oop for that pointer by calling `CollectedHeapT::heap()->block_start()`. However, the `block_start()` functionality is not fully implemented for all GCs (e.g. the young generation of `ParallelScavengeHeap`) and for these cases `block_start()` returns NULL. Because of this NULL return value `os::print_location()` will finally qualify the corresponding pointer as pointing "into unknown readable memory" although we already know that it actually points into an invalid heap area. In such cases, print at least that the pointer is pointing into an unknown part of the heap instead of just saying that it points into unknown memory. I've manually tested the new functionality in GDB. ------------- Commit messages: - 8343531: Improve print_location for invalid heap pointers Changes: https://git.openjdk.org/jdk/pull/21870/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=21870&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8343531 Stats: 8 lines in 1 file changed: 7 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/21870.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/21870/head:pull/21870 PR: https://git.openjdk.org/jdk/pull/21870 From tschatzl at openjdk.org Mon Nov 4 09:52:29 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 4 Nov 2024 09:52:29 GMT Subject: RFR: 8343507: Parallel: Fail if verify_complete finds incorrect states [v2] In-Reply-To: <7SGR3x5jFTxIVogdh8N9gyaMBS9J1HWDDV8sGlxliwc=.abd6c8b3-2157-4e55-8afa-1cf087e0302c@github.com> References: <7SGR3x5jFTxIVogdh8N9gyaMBS9J1HWDDV8sGlxliwc=.abd6c8b3-2157-4e55-8afa-1cf087e0302c@github.com> Message-ID: On Mon, 4 Nov 2024 07:55:04 GMT, Albert Mingkun Yang wrote: >> Trivial change of replacing `log_warning` with `fatal`, because incorrect `destination_count` always indicate some problem. >> >> Test: tier1-3 > > Albert Mingkun Yang has updated the pull request incrementally with one additional commit since the last revision: > > review Marked as reviewed by tschatzl (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/21865#pullrequestreview-2412609729 From ayang at openjdk.org Mon Nov 4 10:06:28 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 4 Nov 2024 10:06:28 GMT Subject: RFR: 8343531: Improve print_location for invalid heap pointers In-Reply-To: References: Message-ID: On Mon, 4 Nov 2024 09:43:18 GMT, Volker Simonis wrote: > However, the block_start() functionality is not fully implemented for all GCs (e.g. the young generation of ParallelScavengeHeap) and for these cases block_start() returns NULL. Can we implement it properly for all gcs, instead of working around the issue in the caller? ------------- PR Comment: https://git.openjdk.org/jdk/pull/21870#issuecomment-2454270214 From tschatzl at openjdk.org Mon Nov 4 10:06:29 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 4 Nov 2024 10:06:29 GMT Subject: RFR: 8343531: Improve print_location for invalid heap pointers In-Reply-To: References: Message-ID: On Mon, 4 Nov 2024 09:43:18 GMT, Volker Simonis wrote: > Currently `BlockLocationPrinter::print_location()` checks for a pointer if it points into the heap and if that's true, it either prints it as an oop if `is_valid_obj()` is true or it tries to find the the start address of an oop for that pointer by calling `CollectedHeapT::heap()->block_start()`. > > However, the `block_start()` functionality is not fully implemented for all GCs (e.g. the young generation of `ParallelScavengeHeap`) and for these cases `block_start()` returns NULL. Because of this NULL return value `os::print_location()` will finally qualify the corresponding pointer as pointing "into unknown readable memory" although we already know that it actually points into an invalid heap area. > > In such cases, print at least that the pointer is pointing into an unknown part of the heap instead of just saying that it points into unknown memory. > > I've manually tested the new functionality in GDB. src/hotspot/share/gc/shared/locationPrinter.inline.hpp line 57: > 55: // Check if addr points into Java heap. > 56: if (CollectedHeapT::heap()->is_in(addr)) { > 57: // base_oop_or_null() might be unimplemented and return NULL for some GCs/generations In such cases where the flag that we later set is dependent on the complete condition, it seems nicer to assign the result of the condition to it right away. That saves the assignment later too, having only a single assignment to it. Ymmv. Suggestion: // Check if addr points into Java heap. bool in_heap = CollectedHeapT::heap()->is_in(addr); if (in_heap) { // base_oop_or_null() might be unimplemented and return NULL for some GCs/generations. (And drop the assignment to `in_heap` later). ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/21870#discussion_r1827475425 From ayang at openjdk.org Mon Nov 4 10:34:37 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 4 Nov 2024 10:34:37 GMT Subject: RFR: 8343507: Parallel: Fail if verify_complete finds incorrect states [v2] In-Reply-To: <7SGR3x5jFTxIVogdh8N9gyaMBS9J1HWDDV8sGlxliwc=.abd6c8b3-2157-4e55-8afa-1cf087e0302c@github.com> References: <7SGR3x5jFTxIVogdh8N9gyaMBS9J1HWDDV8sGlxliwc=.abd6c8b3-2157-4e55-8afa-1cf087e0302c@github.com> Message-ID: On Mon, 4 Nov 2024 07:55:04 GMT, Albert Mingkun Yang wrote: >> Trivial change of replacing `log_warning` with `fatal`, because incorrect `destination_count` always indicate some problem. >> >> Test: tier1-3 > > Albert Mingkun Yang has updated the pull request incrementally with one additional commit since the last revision: > > review Thanks for review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/21865#issuecomment-2454345797 From ayang at openjdk.org Mon Nov 4 10:34:37 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 4 Nov 2024 10:34:37 GMT Subject: Integrated: 8343507: Parallel: Fail if verify_complete finds incorrect states In-Reply-To: References: Message-ID: On Mon, 4 Nov 2024 05:31:26 GMT, Albert Mingkun Yang wrote: > Trivial change of replacing `log_warning` with `fatal`, because incorrect `destination_count` always indicate some problem. > > Test: tier1-3 This pull request has now been integrated. Changeset: 452a5fbd Author: Albert Mingkun Yang URL: https://git.openjdk.org/jdk/commit/452a5fbd9c29e0991758ab97ed5bdbf1922b6a11 Stats: 8 lines in 1 file changed: 0 ins; 4 del; 4 mod 8343507: Parallel: Fail if verify_complete finds incorrect states Reviewed-by: tschatzl, kbarrett ------------- PR: https://git.openjdk.org/jdk/pull/21865 From simonis at openjdk.org Mon Nov 4 10:46:02 2024 From: simonis at openjdk.org (Volker Simonis) Date: Mon, 4 Nov 2024 10:46:02 GMT Subject: RFR: 8343531: Improve print_location for invalid heap pointers [v2] In-Reply-To: References: Message-ID: > Currently `BlockLocationPrinter::print_location()` checks for a pointer if it points into the heap and if that's true, it either prints it as an oop if `is_valid_obj()` is true or it tries to find the the start address of an oop for that pointer by calling `CollectedHeapT::heap()->block_start()`. > > However, the `block_start()` functionality is not fully implemented for all GCs (e.g. the young generation of `ParallelScavengeHeap`) and for these cases `block_start()` returns NULL. Because of this NULL return value `os::print_location()` will finally qualify the corresponding pointer as pointing "into unknown readable memory" although we already know that it actually points into an invalid heap area. > > In such cases, print at least that the pointer is pointing into an unknown part of the heap instead of just saying that it points into unknown memory. > > I've manually tested the new functionality in GDB. Volker Simonis has updated the pull request incrementally with one additional commit since the last revision: Small refactoring based on tschatzl's review ------------- Changes: - all: https://git.openjdk.org/jdk/pull/21870/files - new: https://git.openjdk.org/jdk/pull/21870/files/80cc0ee7..f5886102 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=21870&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=21870&range=00-01 Stats: 4 lines in 1 file changed: 1 ins; 2 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/21870.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/21870/head:pull/21870 PR: https://git.openjdk.org/jdk/pull/21870 From simonis at openjdk.org Mon Nov 4 11:00:31 2024 From: simonis at openjdk.org (Volker Simonis) Date: Mon, 4 Nov 2024 11:00:31 GMT Subject: RFR: 8343531: Improve print_location for invalid heap pointers [v2] In-Reply-To: References:

Message-ID: On Mon, 4 Nov 2024 10:03:34 GMT, Thomas Schatzl wrote: >> Volker Simonis has updated the pull request incrementally with one additional commit since the last revision: >> >> Small refactoring based on tschatzl's review > > src/hotspot/share/gc/shared/locationPrinter.inline.hpp line 57: > >> 55: // Check if addr points into Java heap. >> 56: if (CollectedHeapT::heap()->is_in(addr)) { >> 57: // base_oop_or_null() might be unimplemented and return NULL for some GCs/generations > > In such cases where the flag that we later set is dependent on the complete condition, it seems nicer to assign the result of the condition to it right away. That saves the assignment later too, having only a single assignment to it. Ymmv. > Suggestion: > > // Check if addr points into Java heap. > bool in_heap = CollectedHeapT::heap()->is_in(addr); > if (in_heap) { > // base_oop_or_null() might be unimplemented and return NULL for some GCs/generations. > > > (And drop the assignment to `in_heap` later). Thanks for looking at this PR. Your suggestion sound like a reasonable simplification. I've updated the code accordingly. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/21870#discussion_r1827548054 From ayang at openjdk.org Mon Nov 4 11:01:40 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 4 Nov 2024 11:01:40 GMT Subject: RFR: 8339162: [REDO] JDK-8338440 Parallel: Improve fragmentation mitigation in Full GC Message-ID: This PR consists of two commits, the original and bug-fix. The original patch calculates the dest-count for the preceding live words incorrectly -- `preceding_destination` can be on region-boundary. Test: TEST=gc/TestSoftReferencesBehaviorOnOOME.java fails ~4/100 without the fix but passes with the fix. ------------- Commit messages: - fix - original Changes: https://git.openjdk.org/jdk/pull/21872/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=21872&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8339162 Stats: 568 lines in 2 files changed: 209 ins; 143 del; 216 mod Patch: https://git.openjdk.org/jdk/pull/21872.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/21872/head:pull/21872 PR: https://git.openjdk.org/jdk/pull/21872 From ayang at openjdk.org Mon Nov 4 11:01:40 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 4 Nov 2024 11:01:40 GMT Subject: RFR: 8339162: [REDO] JDK-8338440 Parallel: Improve fragmentation mitigation in Full GC In-Reply-To: References: Message-ID: On Mon, 4 Nov 2024 10:55:45 GMT, Albert Mingkun Yang wrote: > This PR consists of two commits, the original and bug-fix. > > The original patch calculates the dest-count for the preceding live words incorrectly -- `preceding_destination` can be on region-boundary. > > Test: TEST=gc/TestSoftReferencesBehaviorOnOOME.java fails ~4/100 without the fix but passes with the fix. @lgxbslgx @zhengyu123 @walulyai Could you take a look? ------------- PR Comment: https://git.openjdk.org/jdk/pull/21872#issuecomment-2454400845 From simonis at openjdk.org Mon Nov 4 11:03:29 2024 From: simonis at openjdk.org (Volker Simonis) Date: Mon, 4 Nov 2024 11:03:29 GMT Subject: RFR: 8343531: Improve print_location for invalid heap pointers In-Reply-To: References:

Message-ID: On Mon, 4 Nov 2024 10:01:39 GMT, Albert Mingkun Yang wrote: > > However, the block_start() functionality is not fully implemented for all GCs (e.g. the young generation of ParallelScavengeHeap) and for these cases block_start() returns NULL. > > Can we implement it properly for all gcs, instead of working around the issue in the caller? Everything is possible :) but I think it is not trivial. The problem is that we can crash at any time. In order to implement it reliably, we would have to make the heap walkable but I don't think we want to do this in the crash handler. Any suggestions? ------------- PR Comment: https://git.openjdk.org/jdk/pull/21870#issuecomment-2454412489 From ayang at openjdk.org Mon Nov 4 11:25:28 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 4 Nov 2024 11:25:28 GMT Subject: RFR: 8343531: Improve print_location for invalid heap pointers In-Reply-To: References:

Message-ID: On Mon, 4 Nov 2024 11:01:21 GMT, Volker Simonis wrote: > but I think it is not trivial. I was thinking copying the Serial impl into `ParallelScavengeHeap::block_start`; nothing sophisticated. I suspect the following oddly looking code is used to workaround the unimplemented branch of block_start. if (DebuggingContext::is_enabled() || VMError::is_error_reported()) { return nullptr; } ------------- PR Comment: https://git.openjdk.org/jdk/pull/21870#issuecomment-2454455414 From shade at openjdk.org Mon Nov 4 12:04:29 2024 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 4 Nov 2024 12:04:29 GMT Subject: RFR: 8343531: Improve print_location for invalid heap pointers [v2] In-Reply-To: References:

Message-ID: <18YEfgfqK5a_YG8Noc_NKRFMfRYBhU8vcspBrli63CM=.d46c87e1-1e4f-4fce-90a0-4281a773bcda@github.com> On Mon, 4 Nov 2024 10:46:02 GMT, Volker Simonis wrote: >> Currently `BlockLocationPrinter::print_location()` checks for a pointer if it points into the heap and if that's true, it either prints it as an oop if `is_valid_obj()` is true or it tries to find the the start address of an oop for that pointer by calling `CollectedHeapT::heap()->block_start()`. >> >> However, the `block_start()` functionality is not fully implemented for all GCs (e.g. the young generation of `ParallelScavengeHeap`) and for these cases `block_start()` returns NULL. Because of this NULL return value `os::print_location()` will finally qualify the corresponding pointer as pointing "into unknown readable memory" although we already know that it actually points into an invalid heap area. >> >> In such cases, print at least that the pointer is pointing into an unknown part of the heap instead of just saying that it points into unknown memory. >> >> I've manually tested the new functionality in GDB. > > Volker Simonis has updated the pull request incrementally with one additional commit since the last revision: > > Small refactoring based on tschatzl's review I think this is a fine papercut fix, and implementing block information for all GCs could be tackled separately. I do have a question, though: src/hotspot/share/gc/shared/locationPrinter.inline.hpp line 90: > 88: if (in_heap) { > 89: st->print_cr(PTR_FORMAT " is an unknown heap location", p2i(addr)); > 90: return true; So why not put this block as `else` branch in `base_oop_or_null` check at L67? This would also remove any ambiguity whether the in-heap pointer would look like a compressed pointer to object, which would be accidentally handled by the block at L64..L86? ------------- PR Review: https://git.openjdk.org/jdk/pull/21870#pullrequestreview-2412881446 PR Review Comment: https://git.openjdk.org/jdk/pull/21870#discussion_r1827621215 From simonis at openjdk.org Mon Nov 4 14:37:34 2024 From: simonis at openjdk.org (Volker Simonis) Date: Mon, 4 Nov 2024 14:37:34 GMT Subject: RFR: 8343531: Improve print_location for invalid heap pointers [v2] In-Reply-To: <18YEfgfqK5a_YG8Noc_NKRFMfRYBhU8vcspBrli63CM=.d46c87e1-1e4f-4fce-90a0-4281a773bcda@github.com> References:

<18YEfgfqK5a_YG8Noc_NKRFMfRYBhU8vcspBrli63CM=.d46c87e1-1e4f-4fce-90a0-4281a773bcda@github.com> Message-ID: On Mon, 4 Nov 2024 12:00:38 GMT, Aleksey Shipilev wrote: >> Volker Simonis has updated the pull request incrementally with one additional commit since the last revision: >> >> Small refactoring based on tschatzl's review > > src/hotspot/share/gc/shared/locationPrinter.inline.hpp line 90: > >> 88: if (in_heap) { >> 89: st->print_cr(PTR_FORMAT " is an unknown heap location", p2i(addr)); >> 90: return true; > > So why not put this block as `else` branch in `base_oop_or_null` check at L67? This would also remove any ambiguity whether the in-heap pointer would look like a compressed pointer to object, which would be accidentally handled by the block at L64..L86? That was actually the first thing I did. But then I thought that (especially with zero-based compressed oops) we might get quite some valid compressed oops pointers unnecessarily printed as "unknown heap location". On the other hand, I don't think that there's a high probability for a real invalid heap pointer to be classified as compressed oops pointer because the compressed oops detection code uses `is_valid_obj()` anyway. So this change is conservative in the sense that it doesn't change any behavior except that pointers which have been printed as pointing "into unknown readable memory" can now be detect as "invalid heap pointers". If you still think we should prioritize the detection steps differently, please let me know. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/21870#discussion_r1827834445 From simonis at openjdk.org Mon Nov 4 16:28:29 2024 From: simonis at openjdk.org (Volker Simonis) Date: Mon, 4 Nov 2024 16:28:29 GMT Subject: RFR: 8343531: Improve print_location for invalid heap pointers [v2] In-Reply-To: <1qZ2t5EIMp-DpFdvnOJwd5o5D4g74fbgGG4VBy5frq4=.aff55256-6f89-4fd5-9eda-3f5ed1540fa8@github.com> References:

<18YEfgfqK5a_YG8Noc_NKRFMfRYBhU8vcspBrli63CM=.d46c87e1-1e4f-4fce-90a0-4281a773bcda@github.com> <1qZ2t5EIMp-DpFdvnOJwd5o5D4g74fbgGG4VBy5frq4=.aff55256-6f89-4fd5-9eda-3f5ed1540fa8@github.com> Message-ID: On Mon, 4 Nov 2024 15:52:59 GMT, Aleksey Shipilev wrote: > Oh, OK. Compressed pointers make this whole thing a bit messy. I think current code is not handling the case of compressed interior pointers all that well; IDK if we even have those in Hotspot. > Yes, that's true. I first thought about calling `BlockLocationPrinter::print_location()` recursively for the compressed oops case to avoid code duplication and get the same handling for regular and compressed oops, but that would have been a much larger change. I think we can have compressed oops in registers, e.g. when GC iterates the heap or when compiled code loads a field but the cases are probably more rare than regular oops. > I think there is an ambiguity between compressed pointers and regular pointers at this level, which we cannot reasonably resolve. E.g. if we have zero-based compressed oops with 2-bit shift and 16 GB heap, passing `0x1000000` as the `addr` here cannot distinguish between cases of "regular pointer, points to `0x1000000`" and "compressed pointer, decodes as `0x4000000`". I guess we would like to print both interpretations. But this is way beyond the scope for this PR. That's also true, but remember that `is_valid_obj()` does quite some checks. So in your example, in order to make it really ambiguous, it would require that at address `0x1000000 + 8` as well as at address `0x4000000 + 8` we have properly aligned, valid (possibly compressed) pointers into MetaSpace pointing to a valid `Klass` object (which is probably not so common for most adresses). > This version would do meanwhile. Thanks for the review. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/21870#discussion_r1828017494 From shade at openjdk.org Mon Nov 4 15:55:29 2024 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 4 Nov 2024 15:55:29 GMT Subject: RFR: 8343531: Improve print_location for invalid heap pointers [v2] In-Reply-To: References:

Message-ID: <0tmSW_c-jMzTApXLMSo06DCBrjZFBLjGQAAxOYx-rS8=.1ec378eb-11e4-44fb-a34a-185c74724631@github.com> On Mon, 4 Nov 2024 10:46:02 GMT, Volker Simonis wrote: >> Currently `BlockLocationPrinter::print_location()` checks for a pointer if it points into the heap and if that's true, it either prints it as an oop if `is_valid_obj()` is true or it tries to find the the start address of an oop for that pointer by calling `CollectedHeapT::heap()->block_start()`. >> >> However, the `block_start()` functionality is not fully implemented for all GCs (e.g. the young generation of `ParallelScavengeHeap`) and for these cases `block_start()` returns NULL. Because of this NULL return value `os::print_location()` will finally qualify the corresponding pointer as pointing "into unknown readable memory" although we already know that it actually points into an invalid heap area. >> >> In such cases, print at least that the pointer is pointing into an unknown part of the heap instead of just saying that it points into unknown memory. >> >> I've manually tested the new functionality in GDB. > > Volker Simonis has updated the pull request incrementally with one additional commit since the last revision: > > Small refactoring based on tschatzl's review Marked as reviewed by shade (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/21870#pullrequestreview-2413449596 From shade at openjdk.org Mon Nov 4 15:55:30 2024 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 4 Nov 2024 15:55:30 GMT Subject: RFR: 8343531: Improve print_location for invalid heap pointers [v2] In-Reply-To: References:

<18YEfgfqK5a_YG8Noc_NKRFMfRYBhU8vcspBrli63CM=.d46c87e1-1e4f-4fce-90a0-4281a773bcda@github.com> Message-ID: <1qZ2t5EIMp-DpFdvnOJwd5o5D4g74fbgGG4VBy5frq4=.aff55256-6f89-4fd5-9eda-3f5ed1540fa8@github.com> On Mon, 4 Nov 2024 14:34:28 GMT, Volker Simonis wrote: >> src/hotspot/share/gc/shared/locationPrinter.inline.hpp line 90: >> >>> 88: if (in_heap) { >>> 89: st->print_cr(PTR_FORMAT " is an unknown heap location", p2i(addr)); >>> 90: return true; >> >> So why not put this block as `else` branch in `base_oop_or_null` check at L67? This would also remove any ambiguity whether the in-heap pointer would look like a compressed pointer to object, which would be accidentally handled by the block at L64..L86? > > That was actually the first thing I did. But then I thought that (especially with zero-based compressed oops) we might get quite some valid compressed oops pointers unnecessarily printed as "unknown heap location". > On the other hand, I don't think that there's a high probability for a real invalid heap pointer to be classified as compressed oops pointer because the compressed oops detection code uses `is_valid_obj()` anyway. > So this change is conservative in the sense that it doesn't change any behavior except that pointers which have been printed as pointing "into unknown readable memory" can now be detect as "invalid heap pointers". > > If you still think we should prioritize the detection steps differently, please let me know. Oh, OK. Compressed pointers make this whole thing a bit messy. I think current code is not handling the case of compressed interior pointers all that well; IDK if we even have those in Hotspot. I think there is an ambiguity between compressed pointers and regular pointers at this level, which we cannot reasonably resolve. E.g. if we have zero-based compressed oops with 2-bit shift and 16 GB heap, passing `0x1000000` as the `addr` here cannot distinguish between cases of "regular pointer, points to `0x1000000`" and "compressed pointer, decodes as `0x4000000`". I guess we would like to print both interpretations. But this is way beyond the scope for this PR. This version would do meanwhile. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/21870#discussion_r1827965996 From tschatzl at openjdk.org Mon Nov 4 15:11:02 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 4 Nov 2024 15:11:02 GMT Subject: RFR: 8343189: [REDO] JDK-8295269 G1: Improve slow startup due to predictor initialization Message-ID: Hi all, please review this redo of [JDK-8295269](https://bugs.openjdk.org/browse/JDK-8295269) G1: Improve slow startup due to predictor initialization. The cause are issues with the `runtime/cds/DeterministicDump.java` test, that is currently being fixed in #21871. There has been no change in these changes. Testing: running a few thousand times with the fixed `runtime/cds/DeterministicDump.java` test Thanks, Thomas ------------- Depends on: https://git.openjdk.org/jdk/pull/21871 Commit messages: - Revert "8343086: [BACKOUT] JDK-8295269 G1: Improve slow startup due to predictor initialization" Changes: https://git.openjdk.org/jdk/pull/21876/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=21876&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8343189 Stats: 7 lines in 2 files changed: 6 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/21876.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/21876/head:pull/21876 PR: https://git.openjdk.org/jdk/pull/21876 From simonis at openjdk.org Mon Nov 4 15:10:28 2024 From: simonis at openjdk.org (Volker Simonis) Date: Mon, 4 Nov 2024 15:10:28 GMT Subject: RFR: 8343531: Improve print_location for invalid heap pointers In-Reply-To: References:

Message-ID: On Mon, 4 Nov 2024 11:22:47 GMT, Albert Mingkun Yang wrote: > > but I think it is not trivial. > > I was thinking copying the Serial impl into `ParallelScavengeHeap::block_start`; nothing sophisticated. > Unfortunately, the Serial implementation doesn't really work reliably if running with `-XX:+UseTLAB` (which is the default). If called with a pointer which points into unallocated TLAB buffer, `ContiguousSpace::block_start_const()` will just crash with a SIGSEGV (or a secondary crash during error reporting when called from `VMError`): #0 0x00007ffff57d78ce in oopDesc::size_given_klass (this=0x7ffde5616c70, klass=0x7ffda2000000) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/oops/oop.inline.hpp:196 #1 0x00007ffff57d7756 in oopDesc::size (this=0x7ffde5616c70) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/oops/oop.inline.hpp:153 #2 0x00007ffff689a421 in ContiguousSpace::block_start_const (this=0x7ffff004c880, p=0x7ffde5616ca0) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/gc/shared/space.cpp:565 #3 0x00007ffff689b7ba in Space::block_start (this=0x7ffff004c880, p=0x7ffde5616ca0) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/gc/shared/space.inline.hpp:43 #4 0x00007ffff60f4144 in GenerationBlockStartClosure::do_space (this=0x7ffff530ef30, s=0x7ffff004c880) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/gc/shared/generation.cpp:191 #5 0x00007ffff5e560c5 in DefNewGeneration::space_iterate (this=0x7ffff004b9c0, blk=0x7ffff530ef30, usedOnly=false) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/gc/serial/defNewGeneration.cpp:674 #6 0x00007ffff60f3527 in Generation::block_start (this=0x7ffff004b9c0, p=0x7ffde5616ca0) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/gc/shared/generation.cpp:200 #7 0x00007ffff60e36e9 in GenCollectedHeap::block_start (this=0x7ffff0038450, addr=0x7ffde5616ca0) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/gc/shared/genCollectedHeap.cpp:884 #8 0x00007ffff60e5b97 in BlockLocationPrinter::base_oop_or_null (addr=0x7ffde5616ca0) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/gc/shared/locationPrinter.inline.hpp:41 #9 0x00007ffff60e592b in BlockLocationPrinter::print_location (st=0x7ffff0000b60, addr=0x7ffde5616ca0) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/gc/shared/locationPrinter.inline.hpp:56 #10 0x00007ffff60e43bd in GenCollectedHeap::print_location (this=0x7ffff0038450, st=0x7ffff0000b60, addr=0x7ffde5616ca0) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/gc/shared/genCollectedHeap.cpp:1046 #11 0x00007ffff66acb22 in os::print_location (st=0x7ffff0000b60, x=140728451820704, verbose=false) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/runtime/os.cpp:1190 And that's again because the heap is in general not *walkable* when we call this function. Making it walkable will fill the remaining TLAB spaces with a dummy int array, but without that, we will just trying to interpret random memory (or NULL if running with `-XX:+ZeroTLAB`) as a `Klass` pointer which is seldomly successful :) > I suspect the following oddly looking code is used to workaround the unimplemented branch of block_start. > > ``` > if (DebuggingContext::is_enabled() || VMError::is_error_reported()) { > return nullptr; > } > ``` That "oddly looking code" is actually the proof that `block_start()` only gets called from `VMError` or manually, when natively debugging the VM. ------------- PR Comment: https://git.openjdk.org/jdk/pull/21870#issuecomment-2454964142 From syan at openjdk.org Tue Nov 5 01:51:37 2024 From: syan at openjdk.org (SendaoYan) Date: Tue, 5 Nov 2024 01:51:37 GMT Subject: RFR: 8343490: Update copyright year for JDK-8341692 Message-ID: <2BwWuKdm5FwggsXPwo3P2xRD6CGr5QDdn3gVG5x5fo0=.41d944e6-6737-4d7d-8654-986149b41c9d@github.com> Hi all, The copyright year of some files which has been changed by [JDK-8341692](https://bugs.openjdk.org/browse/JDK-8341692) wasn't update correctly. This PR update the copyright year of [JDK-8341692](https://bugs.openjdk.org/browse/JDK-8341692). Trivial fix, no risk. ------------- Commit messages: - delete tail whitespace of test/hotspot/jtreg/serviceability/dcmd/gc/HeapDumpCompressedTest.java - 8343490: Update copyright year for JDK-8341692 Changes: https://git.openjdk.org/jdk/pull/21891/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=21891&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8343490 Stats: 66 lines in 66 files changed: 2 ins; 0 del; 64 mod Patch: https://git.openjdk.org/jdk/pull/21891.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/21891/head:pull/21891 PR: https://git.openjdk.org/jdk/pull/21891 From ayang at openjdk.org Tue Nov 5 05:38:27 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Tue, 5 Nov 2024 05:38:27 GMT Subject: RFR: 8343531: Improve print_location for invalid heap pointers In-Reply-To: References:

Message-ID: <9WJcuKHuAyqcP1vaCwtvsJqBWLptNyG2kFHFqp_Xl04=.bf12af6b-ad13-4a21-8259-1b19d770ec71@github.com> On Mon, 4 Nov 2024 15:08:00 GMT, Volker Simonis wrote: > And that's again because the heap is in general not walkable when we call this function. It depends on exactly when this function can be called, and with what arg. I wonder whether it can be called with a pointer to a obj that has not been properly initialized (with klass); if so, the heap is almost never walkable, since allocation is not atomic. > the Serial implementation doesn't really work reliably I am curious if other GCs' impl work (more) reliably, with regarding to the tlab example. ------------- PR Comment: https://git.openjdk.org/jdk/pull/21870#issuecomment-2456274894 From tschatzl at openjdk.org Tue Nov 5 09:50:15 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Tue, 5 Nov 2024 09:50:15 GMT Subject: RFR: 8343189: [REDO] JDK-8295269 G1: Improve slow startup due to predictor initialization [v2] In-Reply-To: References: Message-ID: > Hi all, > > please review this redo of [JDK-8295269](https://bugs.openjdk.org/browse/JDK-8295269) G1: Improve slow startup due to predictor initialization. The cause are issues with the `runtime/cds/DeterministicDump.java` test, that is currently being fixed in #21871. > > There has been no change in these changes. > > Testing: running a few thousand times with the fixed `runtime/cds/DeterministicDump.java` test > > Thanks, > Thomas Thomas Schatzl has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains two additional commits since the last revision: - Merge branch 'master' into 8343189-redo-slow-startup - Revert "8343086: [BACKOUT] JDK-8295269 G1: Improve slow startup due to predictor initialization" This reverts commit f1cc890ddfe2e472cf786856dc7d01645f61b054. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/21876/files - new: https://git.openjdk.org/jdk/pull/21876/files/4ee7c256..f728668c Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=21876&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=21876&range=00-01 Stats: 116898 lines in 366 files changed: 93329 ins; 7618 del; 15951 mod Patch: https://git.openjdk.org/jdk/pull/21876.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/21876/head:pull/21876 PR: https://git.openjdk.org/jdk/pull/21876 From iwalulya at openjdk.org Tue Nov 5 10:09:34 2024 From: iwalulya at openjdk.org (Ivan Walulya) Date: Tue, 5 Nov 2024 10:09:34 GMT Subject: RFR: 8343189: [REDO] JDK-8295269 G1: Improve slow startup due to predictor initialization [v2] In-Reply-To: References:

Message-ID: On Tue, 5 Nov 2024 09:50:15 GMT, Thomas Schatzl wrote: >> Hi all, >> >> please review this redo of [JDK-8295269](https://bugs.openjdk.org/browse/JDK-8295269) G1: Improve slow startup due to predictor initialization. The cause are issues with the `runtime/cds/DeterministicDump.java` test, that is currently being fixed in #21871. >> >> There has been no change in these changes. >> >> Testing: running a few thousand times with the fixed `runtime/cds/DeterministicDump.java` test >> >> Thanks, >> Thomas > > Thomas Schatzl has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains two additional commits since the last revision: > > - Merge branch 'master' into 8343189-redo-slow-startup > - Revert "8343086: [BACKOUT] JDK-8295269 G1: Improve slow startup due to predictor initialization" > > This reverts commit f1cc890ddfe2e472cf786856dc7d01645f61b054. Still good! ------------- Marked as reviewed by iwalulya (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/21876#pullrequestreview-2415173439 From aboldtch at openjdk.org Tue Nov 5 14:18:52 2024 From: aboldtch at openjdk.org (Axel Boldt-Christmas) Date: Tue, 5 Nov 2024 14:18:52 GMT Subject: RFR: 8343460: ZGC: Crash in ZRemembered::scan_page_and_clear_remset Message-ID: `free_page` may concurrently delete the remset while `scan_page_and_clear_remset` is scanning the page. Move it to after the `_safe_recycle.register_and_clone_if_activated`. Doing the deletion on the new cloned page will not occur as it not old. And the registered page's remset will be deleted by the destructor when the `_safe_recycle` scope quest up the `safe_destroy`. To be able to push the deletion all the way into `prepare_to_recycle` the unnecessary use of this mechanism had to be removed. `free_pages_alloc_failed` does not need to protect the pages, as they are not yet present in the PageTable. We have simply taken them out of the cache, but failed to commit or map some memory, so we are putting these pages back into the cache. See bed9c260bbc9bd208b03d7eedd4e2cfa151b58f2 The fix works without this last commit. So we must be careful to check that these pages cannot be reached by some other means. The FoundOld bitmap iteration goes through the PageTable so even if an old page was registered, we would not find these pages. There is a scary lack of a fence between the removal of the page from the PageTable and the lock in `register_and_clone_if_activated`. The stress test will deterministically crash with this modified code 0756e0056b44ee16bee81256f556c8df981ceaf9 and using these options `-XX:+UseZGC -XX:+UseNewCode -XX:ZCollectionIntervalMinor=0.1 -XX:ZCollectionIntervalMajor=1 -XX:ZFragmentationLimit=0 -XX:-CreateCoredumpOnCrash`, and no longer does after with this patch. ------------- Commit messages: - Tie remset deletion to recycle - 8343460: ZGC: Crash in ZRemembered::scan_page_and_clear_remset Changes: https://git.openjdk.org/jdk/pull/21905/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=21905&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8343460 Stats: 26 lines in 2 files changed: 5 ins; 19 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/21905.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/21905/head:pull/21905 PR: https://git.openjdk.org/jdk/pull/21905 From jsikstro at openjdk.org Tue Nov 5 15:19:31 2024 From: jsikstro at openjdk.org (Joel =?UTF-8?B?U2lrc3Ryw7Zt?=) Date: Tue, 5 Nov 2024 15:19:31 GMT Subject: RFR: 8343460: ZGC: Crash in ZRemembered::scan_page_and_clear_remset In-Reply-To: References: Message-ID: On Tue, 5 Nov 2024 14:10:47 GMT, Axel Boldt-Christmas wrote: > `free_page` may concurrently delete the remset while `scan_page_and_clear_remset` is scanning the page. Move it to after the `_safe_recycle.register_and_clone_if_activated`. Doing the deletion on the new cloned page will not occur as it not old. And the registered page's remset will be deleted by the destructor when the `_safe_recycle` scope quest up the `safe_destroy`. > > To be able to push the deletion all the way into `prepare_to_recycle` the unnecessary use of this mechanism had to be removed. `free_pages_alloc_failed` does not need to protect the pages, as they are not yet present in the PageTable. We have simply taken them out of the cache, but failed to commit or map some memory, so we are putting these pages back into the cache. See bed9c260bbc9bd208b03d7eedd4e2cfa151b58f2 > > The fix works without this last commit. So we must be careful to check that these pages cannot be reached by some other means. The FoundOld bitmap iteration goes through the PageTable so even if an old page was registered, we would not find these pages. > > There is a scary lack of a fence between the removal of the page from the PageTable and the lock in `register_and_clone_if_activated`. > > The stress test will deterministically crash with this modified code 0756e0056b44ee16bee81256f556c8df981ceaf9 and using these options `-XX:+UseZGC -XX:+UseNewCode -XX:ZCollectionIntervalMinor=0.1 -XX:ZCollectionIntervalMajor=1 -XX:ZFragmentationLimit=0 -XX:-CreateCoredumpOnCrash`, and no longer does after with this patch. Looks good! ------------- Marked as reviewed by jsikstro (Committer). PR Review: https://git.openjdk.org/jdk/pull/21905#pullrequestreview-2415938963 From iwalulya at openjdk.org Tue Nov 5 16:28:27 2024 From: iwalulya at openjdk.org (Ivan Walulya) Date: Tue, 5 Nov 2024 16:28:27 GMT Subject: RFR: 8339162: [REDO] JDK-8338440 Parallel: Improve fragmentation mitigation in Full GC In-Reply-To: References: Message-ID: On Mon, 4 Nov 2024 10:55:45 GMT, Albert Mingkun Yang wrote: > This PR consists of two commits, the original and bug-fix. > > The original patch calculates the dest-count for the preceding live words incorrectly -- `preceding_destination` can be on region-boundary. > > Test: TEST=gc/TestSoftReferencesBehaviorOnOOME.java fails ~4/100 without the fix but passes with the fix. LGTM! Shouldn't the contributors in the original be added to this redo? ------------- Marked as reviewed by iwalulya (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/21872#pullrequestreview-2416131096 From stuefe at openjdk.org Tue Nov 5 16:40:54 2024 From: stuefe at openjdk.org (Thomas Stuefe) Date: Tue, 5 Nov 2024 16:40:54 GMT Subject: RFR: 8305895: Implement JEP 450: Compact Object Headers (Experimental) [v53] In-Reply-To: References:

Message-ID: <5EgL-mJp75JLOxEccrrGVxbfS6QdUywRSfsOcgx4zl8=.3c283bf3-3e2e-4fe2-bce5-c30d7d4e2da4@github.com> On Thu, 24 Oct 2024 21:04:51 GMT, Roman Kennke wrote: >> This is the main body of the JEP 450: Compact Object Headers (Experimental). >> >> It is also a follow-up to #20640, which now also includes (and supersedes) #20603 and #20605, plus the Tiny Class-Pointers parts that have been previously missing. >> >> Main changes: >> - Introduction of the (experimental) flag UseCompactObjectHeaders. All changes in this PR are protected by this flag. The purpose of the flag is to provide a fallback, in case that users unexpectedly observe problems with the new implementation. The intention is that this flag will remain experimental and opt-in for at least one release, then make it on-by-default and diagnostic (?), and eventually deprecate and obsolete it. However, there are a few unknowns in that plan, specifically, we may want to further improve compact headers to 4 bytes, we are planning to enhance the Klass* encoding to support virtually unlimited number of Klasses, at which point we could also obsolete UseCompressedClassPointers. >> - The compressed Klass* can now be stored in the mark-word of objects. In order to be able to do this, we are add some changes to GC forwarding (see below) to protect the relevant (upper 22) bits of the mark-word. Significant parts of this PR deal with loading the compressed Klass* from the mark-word. This PR also changes some code paths (mostly in GCs) to be more careful when accessing Klass* (or mark-word or size) to be able to fetch it from the forwardee in case the object is forwarded. >> - Self-forwarding in GCs (which is used to deal with promotion failure) now uses a bit to indicate 'self-forwarding'. This is needed to preserve the crucial Klass* bits in the header. This also allows to get rid of preserved-header machinery in SerialGC and G1 (Parallel GC abuses preserved-marks to also find all other relevant oops). >> - Full GC forwarding now uses an encoding similar to compressed-oops. We have 40 bits for that, and can encode up to 8TB of heap. When exceeding 8TB, we turn off UseCompressedClassPointers (except in ZGC, which doesn't use the GC forwarding at all). >> - Instances can now have their base-offset (the offset where the field layouter starts to place fields) at offset 8 (instead of 12 or 16). >> - Arrays will now store their length at offset 8. >> - CDS can now write and read archives with the compressed header. However, it is not possible to read an archive that has been written with an opposite setting of UseCompactObjectHeaders. Some build machinery is added so that _co... > > Roman Kennke has updated the pull request incrementally with one additional commit since the last revision: > > Enable riscv in CompressedClassPointersEncodingScheme test Went again through all the changes, with focus on runtime code. Still good. ------------- Marked as reviewed by stuefe (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/20677#pullrequestreview-2416155892 From amitkumar at openjdk.org Tue Nov 5 16:49:01 2024 From: amitkumar at openjdk.org (Amit Kumar) Date: Tue, 5 Nov 2024 16:49:01 GMT Subject: RFR: 8305895: Implement JEP 450: Compact Object Headers (Experimental) [v50] In-Reply-To: References:

Message-ID: On Tue, 5 Nov 2024 16:43:35 GMT, Roman Kennke wrote: >Hi Amit, sorry I only now get to reply to this, I have been traveling. What does the change do? Is it critical? Would it be possible to fix it after I intergrated the JEP? Because any change that I do now invalidates existing reviews, and might delay integration, and we're already running pretty close to RDP1. If at all possible, I would prefer to take it after I intergrated the JEP - we can have fixes well after RDP1, but not new features. If you agree, then please file a follow-up issue. That's perfectly fine. I will do it with separate RFE :-) ------------- PR Comment: https://git.openjdk.org/jdk/pull/20677#issuecomment-2457680086 From rkennke at openjdk.org Tue Nov 5 16:49:01 2024 From: rkennke at openjdk.org (Roman Kennke) Date: Tue, 5 Nov 2024 16:49:01 GMT Subject: RFR: 8305895: Implement JEP 450: Compact Object Headers (Experimental) [v50] In-Reply-To: References:

Message-ID: On Tue, 22 Oct 2024 16:22:20 GMT, Roman Kennke wrote: >> Roman Kennke has updated the pull request incrementally with two additional commits since the last revision: >> >> - Update copyright >> - Avoid assert/endless-loop in JFR code > > @egahlin / @mgronlun could you please review the JFR parts of this PR? One change is for getting the right prototype header, the other is for avoiding an endless loop/assert in a corner case. > @rkennke can you include this small update for s390x as well: > > ```diff > diff --git a/src/hotspot/cpu/s390/c1_MacroAssembler_s390.cpp b/src/hotspot/cpu/s390/c1_MacroAssembler_s390.cpp > index 0f7e5c9f457..476e3d5daa4 100644 > --- a/src/hotspot/cpu/s390/c1_MacroAssembler_s390.cpp > +++ b/src/hotspot/cpu/s390/c1_MacroAssembler_s390.cpp > @@ -174,8 +174,11 @@ void C1_MacroAssembler::try_allocate( > void C1_MacroAssembler::initialize_header(Register obj, Register klass, Register len, Register Rzero, Register t1) { > assert_different_registers(obj, klass, len, t1, Rzero); > if (UseCompactObjectHeaders) { > - z_lg(t1, Address(klass, in_bytes(Klass::prototype_header_offset()))); > - z_stg(t1, Address(obj, oopDesc::mark_offset_in_bytes())); > + z_mvc( > + Address(obj, oopDesc::mark_offset_in_bytes()), /* move to */ > + Address(klass, in_bytes(Klass::prototype_header_offset())), /* move from */ > + sizeof(markWord) /* how much to move */ > + ); > } else { > load_const_optimized(t1, (intx)markWord::prototype().value()); > z_stg(t1, Address(obj, oopDesc::mark_offset_in_bytes())); > diff --git a/src/hotspot/cpu/s390/c2_MacroAssembler_s390.cpp b/src/hotspot/cpu/s390/c2_MacroAssembler_s390.cpp > index 378d5e4cfe1..c5713161bf9 100644 > --- a/src/hotspot/cpu/s390/c2_MacroAssembler_s390.cpp > +++ b/src/hotspot/cpu/s390/c2_MacroAssembler_s390.cpp > @@ -46,7 +46,7 @@ void C2_MacroAssembler::load_narrow_klass_compact_c2(Register dst, Address src) > // The incoming address is pointing into obj-start + klass_offset_in_bytes. We need to extract > // obj-start, so that we can load from the object's mark-word instead. > z_lg(dst, src.plus_disp(-oopDesc::klass_offset_in_bytes())); > - z_srlg(dst, dst, markWord::klass_shift); // TODO: could be z_sra > + z_srlg(dst, dst, markWord::klass_shift); > } > > //------------------------------------------------------ > diff --git a/src/hotspot/cpu/s390/templateTable_s390.cpp b/src/hotspot/cpu/s390/templateTable_s390.cpp > index 3cb1aba810d..5b8f7a20478 100644 > --- a/src/hotspot/cpu/s390/templateTable_s390.cpp > +++ b/src/hotspot/cpu/s390/templateTable_s390.cpp > @@ -3980,8 +3980,11 @@ void TemplateTable::_new() { > // Initialize object header only. > __ bind(initialize_header); > if (UseCompactObjectHeaders) { > - __ z_lg(tmp, Address(iklass, in_bytes(Klass::prototype_header_offset()))); > - __ z_stg(tmp, Address(RallocatedObject, oopDesc::mark_offset_in_bytes())); > + __ z_mvc( > + Address(RallocatedObject, oopDesc::mark_offset_in_bytes()), // move to > + Address(iklass, in_bytes(Klass::prototype_header_offset())), // move from > + sizeof(markWord) // how much to move > + ); > } else { > __ store_const(Address(RallocatedObject, oopDesc::mark_offset_in_bytes()), > (long) markWord::prototype().value()); > ``` Hi Amit, sorry I only now get to reply to this, I have been traveling. What does the change do? Is it critical? Would it be possible to fix it after I intergrated the JEP? Because any change that I do now invalidates existing reviews, and might delay integration, and we're already running pretty close to RDP1. If at all possible, I would prefer to take it after I intergrated the JEP - we can have fixes well after RDP1, but not new features. If you agree, then please file a follow-up issue. ------------- PR Comment: https://git.openjdk.org/jdk/pull/20677#issuecomment-2457674486 From kvn at openjdk.org Tue Nov 5 18:28:33 2024 From: kvn at openjdk.org (Vladimir Kozlov) Date: Tue, 5 Nov 2024 18:28:33 GMT Subject: RFR: 8343173: Remove ZGC-specific non-JVMCI test groups [v2] In-Reply-To: References:

Message-ID: On Thu, 31 Oct 2024 16:13:53 GMT, Leonid Mesnik wrote: >> The JVMCI should be supported by all GCs and specific >> hotspot_compiler_all_gcs >> group is not needed anymore. >> >> There are few failures of JVMCI tests with ZGC happened, the bug >> https://bugs.openjdk.org/browse/JDK-8343233 >> is filed and corresponding tests are problemlisted. > > Leonid Mesnik has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains three commits: > > - typo fixed > - Merge branch 'master' of https://github.com/openjdk/jdk into 8343173 > - 8343173: Remove ZGC-specific non-JVMCI test groups Good. ------------- Marked as reviewed by kvn (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/21774#pullrequestreview-2416399409 From rkennke at openjdk.org Tue Nov 5 20:00:31 2024 From: rkennke at openjdk.org (Roman Kennke) Date: Tue, 5 Nov 2024 20:00:31 GMT Subject: RFR: 8305895: Implement JEP 450: Compact Object Headers (Experimental) [v54] In-Reply-To: References: Message-ID: > This is the main body of the JEP 450: Compact Object Headers (Experimental). > > It is also a follow-up to #20640, which now also includes (and supersedes) #20603 and #20605, plus the Tiny Class-Pointers parts that have been previously missing. > > Main changes: > - Introduction of the (experimental) flag UseCompactObjectHeaders. All changes in this PR are protected by this flag. The purpose of the flag is to provide a fallback, in case that users unexpectedly observe problems with the new implementation. The intention is that this flag will remain experimental and opt-in for at least one release, then make it on-by-default and diagnostic (?), and eventually deprecate and obsolete it. However, there are a few unknowns in that plan, specifically, we may want to further improve compact headers to 4 bytes, we are planning to enhance the Klass* encoding to support virtually unlimited number of Klasses, at which point we could also obsolete UseCompressedClassPointers. > - The compressed Klass* can now be stored in the mark-word of objects. In order to be able to do this, we are add some changes to GC forwarding (see below) to protect the relevant (upper 22) bits of the mark-word. Significant parts of this PR deal with loading the compressed Klass* from the mark-word. This PR also changes some code paths (mostly in GCs) to be more careful when accessing Klass* (or mark-word or size) to be able to fetch it from the forwardee in case the object is forwarded. > - Self-forwarding in GCs (which is used to deal with promotion failure) now uses a bit to indicate 'self-forwarding'. This is needed to preserve the crucial Klass* bits in the header. This also allows to get rid of preserved-header machinery in SerialGC and G1 (Parallel GC abuses preserved-marks to also find all other relevant oops). > - Full GC forwarding now uses an encoding similar to compressed-oops. We have 40 bits for that, and can encode up to 8TB of heap. When exceeding 8TB, we turn off UseCompressedClassPointers (except in ZGC, which doesn't use the GC forwarding at all). > - Instances can now have their base-offset (the offset where the field layouter starts to place fields) at offset 8 (instead of 12 or 16). > - Arrays will now store their length at offset 8. > - CDS can now write and read archives with the compressed header. However, it is not possible to read an archive that has been written with an opposite setting of UseCompactObjectHeaders. Some build machinery is added so that _coh variants of CDS archiv... Roman Kennke has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 104 commits: - Merge tag 'jdk-24+22' into JDK-8305895-v4 Added tag jdk-24+22 for changeset 388d44fb - Enable riscv in CompressedClassPointersEncodingScheme test - s390 port - Conditionalize platform specific parts of CompressedClassPointersEncodingScheme test - Update copyright - Avoid assert/endless-loop in JFR code - Update copyright headers - Merge tag 'jdk-24+20' into JDK-8305895-v4 Added tag jdk-24+20 for changeset 7a64fbbb - Fix needle copying in indexOf intrinsic for smaller headers - Compact header riscv (#3) Implement compact headers on RISCV --------- Co-authored-by: hamlin - ... and 94 more: https://git.openjdk.org/jdk/compare/388d44fb...b945822a ------------- Changes: https://git.openjdk.org/jdk/pull/20677/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=20677&range=53 Stats: 5214 lines in 218 files changed: 3587 ins; 864 del; 763 mod Patch: https://git.openjdk.org/jdk/pull/20677.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20677/head:pull/20677 PR: https://git.openjdk.org/jdk/pull/20677 From lmesnik at openjdk.org Tue Nov 5 20:55:35 2024 From: lmesnik at openjdk.org (Leonid Mesnik) Date: Tue, 5 Nov 2024 20:55:35 GMT Subject: Integrated: 8343173: Remove ZGC-specific non-JVMCI test groups In-Reply-To: References: Message-ID: <-1bZpI933zmujmTibsiiOkDdxnlxnKEGVGAPlqfvYik=.a0981eca-c8da-466c-a209-b266afea8513@github.com> On Tue, 29 Oct 2024 22:01:08 GMT, Leonid Mesnik wrote: > The JVMCI should be supported by all GCs and specific > hotspot_compiler_all_gcs > group is not needed anymore. > > There are few failures of JVMCI tests with ZGC happened, the bug > https://bugs.openjdk.org/browse/JDK-8343233 > is filed and corresponding tests are problemlisted. This pull request has now been integrated. Changeset: 847cc5eb Author: Leonid Mesnik URL: https://git.openjdk.org/jdk/commit/847cc5ebac43b83746d8f238c5f9ecf2972a2796 Stats: 12 lines in 2 files changed: 8 ins; 4 del; 0 mod 8343173: Remove ZGC-specific non-JVMCI test groups Reviewed-by: kvn ------------- PR: https://git.openjdk.org/jdk/pull/21774 From gli at openjdk.org Wed Nov 6 03:54:28 2024 From: gli at openjdk.org (Guoxiong Li) Date: Wed, 6 Nov 2024 03:54:28 GMT Subject: RFR: 8339162: [REDO] JDK-8338440 Parallel: Improve fragmentation mitigation in Full GC In-Reply-To: References: Message-ID: On Mon, 4 Nov 2024 10:55:45 GMT, Albert Mingkun Yang wrote: > This PR consists of two commits, the original and bug-fix. > > The original patch calculates the dest-count for the preceding live words incorrectly -- `preceding_destination` can be on region-boundary. > > Test: TEST=gc/TestSoftReferencesBehaviorOnOOME.java fails ~4/100 without the fix but passes with the fix. Looks good. Nice found. The `region_align_up` is not the `next_region_start_address`. ------------- Marked as reviewed by gli (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/21872#pullrequestreview-2417193301 From gli at openjdk.org Wed Nov 6 04:01:33 2024 From: gli at openjdk.org (Guoxiong Li) Date: Wed, 6 Nov 2024 04:01:33 GMT Subject: RFR: 8339162: [REDO] JDK-8338440 Parallel: Improve fragmentation mitigation in Full GC In-Reply-To: References:

Message-ID: On Wed, 6 Nov 2024 03:51:55 GMT, Guoxiong Li wrote: > The `region_align_up` is not the `next_region_start_address`. Even an experienced developer would misuse the function `region_align_up`, it may be good to add comment (in another PR?) to `region_align_up` to clarify its usage. ------------- PR Comment: https://git.openjdk.org/jdk/pull/21872#issuecomment-2458679167 From ayang at openjdk.org Wed Nov 6 08:10:06 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 6 Nov 2024 08:10:06 GMT Subject: RFR: 8343658: Parallel: Implement block_start for Young generation Message-ID: Simple block_start implementation for Parallel young-gen. Related to https://github.com/openjdk/jdk/pull/21870 Test: tier1-3 ------------- Commit messages: - pgc-block-start Changes: https://git.openjdk.org/jdk/pull/21919/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=21919&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8343658 Stats: 28 lines in 3 files changed: 24 ins; 1 del; 3 mod Patch: https://git.openjdk.org/jdk/pull/21919.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/21919/head:pull/21919 PR: https://git.openjdk.org/jdk/pull/21919 From rkennke at openjdk.org Wed Nov 6 09:13:46 2024 From: rkennke at openjdk.org (Roman Kennke) Date: Wed, 6 Nov 2024 09:13:46 GMT Subject: RFR: 8305895: Implement JEP 450: Compact Object Headers (Experimental) [v55] In-Reply-To: References: Message-ID: > This is the main body of the JEP 450: Compact Object Headers (Experimental). > > It is also a follow-up to #20640, which now also includes (and supersedes) #20603 and #20605, plus the Tiny Class-Pointers parts that have been previously missing. > > Main changes: > - Introduction of the (experimental) flag UseCompactObjectHeaders. All changes in this PR are protected by this flag. The purpose of the flag is to provide a fallback, in case that users unexpectedly observe problems with the new implementation. The intention is that this flag will remain experimental and opt-in for at least one release, then make it on-by-default and diagnostic (?), and eventually deprecate and obsolete it. However, there are a few unknowns in that plan, specifically, we may want to further improve compact headers to 4 bytes, we are planning to enhance the Klass* encoding to support virtually unlimited number of Klasses, at which point we could also obsolete UseCompressedClassPointers. > - The compressed Klass* can now be stored in the mark-word of objects. In order to be able to do this, we are add some changes to GC forwarding (see below) to protect the relevant (upper 22) bits of the mark-word. Significant parts of this PR deal with loading the compressed Klass* from the mark-word. This PR also changes some code paths (mostly in GCs) to be more careful when accessing Klass* (or mark-word or size) to be able to fetch it from the forwardee in case the object is forwarded. > - Self-forwarding in GCs (which is used to deal with promotion failure) now uses a bit to indicate 'self-forwarding'. This is needed to preserve the crucial Klass* bits in the header. This also allows to get rid of preserved-header machinery in SerialGC and G1 (Parallel GC abuses preserved-marks to also find all other relevant oops). > - Full GC forwarding now uses an encoding similar to compressed-oops. We have 40 bits for that, and can encode up to 8TB of heap. When exceeding 8TB, we turn off UseCompressedClassPointers (except in ZGC, which doesn't use the GC forwarding at all). > - Instances can now have their base-offset (the offset where the field layouter starts to place fields) at offset 8 (instead of 12 or 16). > - Arrays will now store their length at offset 8. > - CDS can now write and read archives with the compressed header. However, it is not possible to read an archive that has been written with an opposite setting of UseCompactObjectHeaders. Some build machinery is added so that _coh variants of CDS archiv... Roman Kennke has updated the pull request incrementally with one additional commit since the last revision: Fix gen-ZGC removal ------------- Changes: - all: https://git.openjdk.org/jdk/pull/20677/files - new: https://git.openjdk.org/jdk/pull/20677/files/b945822a..1ea4de16 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=20677&range=54 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=20677&range=53-54 Stats: 2 lines in 1 file changed: 0 ins; 2 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/20677.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20677/head:pull/20677 PR: https://git.openjdk.org/jdk/pull/20677 From stuefe at openjdk.org Wed Nov 6 09:13:47 2024 From: stuefe at openjdk.org (Thomas Stuefe) Date: Wed, 6 Nov 2024 09:13:47 GMT Subject: RFR: 8305895: Implement JEP 450: Compact Object Headers (Experimental) [v50] In-Reply-To: References:

Message-ID: On Tue, 5 Nov 2024 16:43:35 GMT, Roman Kennke wrote: >> @egahlin / @mgronlun could you please review the JFR parts of this PR? One change is for getting the right prototype header, the other is for avoiding an endless loop/assert in a corner case. > >> @rkennke can you include this small update for s390x as well: >> >> ```diff >> diff --git a/src/hotspot/cpu/s390/c1_MacroAssembler_s390.cpp b/src/hotspot/cpu/s390/c1_MacroAssembler_s390.cpp >> index 0f7e5c9f457..476e3d5daa4 100644 >> --- a/src/hotspot/cpu/s390/c1_MacroAssembler_s390.cpp >> +++ b/src/hotspot/cpu/s390/c1_MacroAssembler_s390.cpp >> @@ -174,8 +174,11 @@ void C1_MacroAssembler::try_allocate( >> void C1_MacroAssembler::initialize_header(Register obj, Register klass, Register len, Register Rzero, Register t1) { >> assert_different_registers(obj, klass, len, t1, Rzero); >> if (UseCompactObjectHeaders) { >> - z_lg(t1, Address(klass, in_bytes(Klass::prototype_header_offset()))); >> - z_stg(t1, Address(obj, oopDesc::mark_offset_in_bytes())); >> + z_mvc( >> + Address(obj, oopDesc::mark_offset_in_bytes()), /* move to */ >> + Address(klass, in_bytes(Klass::prototype_header_offset())), /* move from */ >> + sizeof(markWord) /* how much to move */ >> + ); >> } else { >> load_const_optimized(t1, (intx)markWord::prototype().value()); >> z_stg(t1, Address(obj, oopDesc::mark_offset_in_bytes())); >> diff --git a/src/hotspot/cpu/s390/c2_MacroAssembler_s390.cpp b/src/hotspot/cpu/s390/c2_MacroAssembler_s390.cpp >> index 378d5e4cfe1..c5713161bf9 100644 >> --- a/src/hotspot/cpu/s390/c2_MacroAssembler_s390.cpp >> +++ b/src/hotspot/cpu/s390/c2_MacroAssembler_s390.cpp >> @@ -46,7 +46,7 @@ void C2_MacroAssembler::load_narrow_klass_compact_c2(Register dst, Address src) >> // The incoming address is pointing into obj-start + klass_offset_in_bytes. We need to extract >> // obj-start, so that we can load from the object's mark-word instead. >> z_lg(dst, src.plus_disp(-oopDesc::klass_offset_in_bytes())); >> - z_srlg(dst, dst, markWord::klass_shift); // TODO: could be z_sra >> + z_srlg(dst, dst, markWord::klass_shift); >> } >> >> //------------------------------------------------------ >> diff --git a/src/hotspot/cpu/s390/templateTable_s390.cpp b/src/hotspot/cpu/s390/templateTable_s390.cpp >> index 3cb1aba810d..5b8f7a20478 100644 >> --- a/src/hotspot/cpu/s390/templateTable_s390.cpp >> +++ b/src/hotspot/cpu/s390/templateTable_s390.cpp >> @@ -3980,8 +3980,11 @@ void TemplateTable::_new() { >> // Initialize object header only. >> __ bind(initialize_header); >> if (UseCompactObjectHeaders) { >> - __ z_lg(tmp, Address(iklass, in_bytes(Klass::prototype_header_offset()))); >> - __ z_stg(tmp, Address(RallocatedObject, oo... Merge is good. @rkennke patch for the new test errors due to removal of non-generational ZGC: https://gist.github.com/tstuefe/321b769d3b281198b767b68e18bb7271 ------------- PR Comment: https://git.openjdk.org/jdk/pull/20677#issuecomment-2459069232 From simonis at openjdk.org Wed Nov 6 14:17:32 2024 From: simonis at openjdk.org (Volker Simonis) Date: Wed, 6 Nov 2024 14:17:32 GMT Subject: RFR: 8343658: Parallel: Implement block_start for Young generation In-Reply-To: References: Message-ID: <0JdxCwM0B41HAGRr1A5ch4WdP8Gmi2uAd2mgjqW5gNM=.9a485a86-bce3-4a41-b5fb-e17f48992182@github.com> On Wed, 6 Nov 2024 08:04:17 GMT, Albert Mingkun Yang wrote: > Simple block_start implementation for Parallel young-gen. Related to https://github.com/openjdk/jdk/pull/21870 > > Test: tier1-3 src/hotspot/share/gc/parallel/mutableSpace.cpp line 239: > 237: > 238: HeapWord* cur_addr = bottom(); > 239: while (cur_addr <= addr) { As already described in https://github.com/openjdk/jdk/pull/21870#issuecomment-2454964142, this will not work in the general case, if the heap is not walkable. In a debug build you'll run into the assertion once you arrive in the unallcoated TLAB area (which you don't want during error reporting). Even worse, in the product build you can crash or run this loop infinitely, depending on what data `obj->size()` will find in the unallocated TLAB space. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/21919#discussion_r1831091387 From simonis at openjdk.org Wed Nov 6 14:24:31 2024 From: simonis at openjdk.org (Volker Simonis) Date: Wed, 6 Nov 2024 14:24:31 GMT Subject: RFR: 8343531: Improve print_location for invalid heap pointers In-Reply-To: <9WJcuKHuAyqcP1vaCwtvsJqBWLptNyG2kFHFqp_Xl04=.bf12af6b-ad13-4a21-8259-1b19d770ec71@github.com> References:

Message-ID: On Mon, 4 Nov 2024 10:46:02 GMT, Volker Simonis wrote: >> Currently `BlockLocationPrinter::print_location()` checks for a pointer if it points into the heap and if that's true, it either prints it as an oop if `is_valid_obj()` is true or it tries to find the the start address of an oop for that pointer by calling `CollectedHeapT::heap()->block_start()`. >> >> However, the `block_start()` functionality is not fully implemented for all GCs (e.g. the young generation of `ParallelScavengeHeap`) and for these cases `block_start()` returns NULL. Because of this NULL return value `os::print_location()` will finally qualify the corresponding pointer as pointing "into unknown readable memory" although we already know that it actually points into an invalid heap area. >> >> In such cases, print at least that the pointer is pointing into an unknown part of the heap instead of just saying that it points into unknown memory. >> >> I've manually tested the new functionality in GDB. > > Volker Simonis has updated the pull request incrementally with one additional commit since the last revision: > > Small refactoring based on tschatzl's review I think this is good on its own. ------------- Marked as reviewed by ayang (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/21870#pullrequestreview-2419079944 From ayang at openjdk.org Wed Nov 6 18:11:29 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Wed, 6 Nov 2024 18:11:29 GMT Subject: RFR: 8343658: Parallel: Implement block_start for Young generation In-Reply-To: <0JdxCwM0B41HAGRr1A5ch4WdP8Gmi2uAd2mgjqW5gNM=.9a485a86-bce3-4a41-b5fb-e17f48992182@github.com> References: <0JdxCwM0B41HAGRr1A5ch4WdP8Gmi2uAd2mgjqW5gNM=.9a485a86-bce3-4a41-b5fb-e17f48992182@github.com> Message-ID: <2DQOnQWFzESExHcKdsf2xdbDw46GDuyYLEQ_OTEHpc8=.e96b5363-65f2-4f73-afe3-85f13d928c62@github.com> On Wed, 6 Nov 2024 14:14:14 GMT, Volker Simonis wrote: > this will not work in the general case, if the heap is not walkable. True, but this is the best-effort approach used in other GCs, as far as I can tell. Is there a real use case that warrants a more sophisticated variant? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/21919#discussion_r1831500873 From stuefe at openjdk.org Thu Nov 7 10:50:48 2024 From: stuefe at openjdk.org (Thomas Stuefe) Date: Thu, 7 Nov 2024 10:50:48 GMT Subject: RFR: 8343658: Parallel: Implement block_start for Young generation In-Reply-To: <2DQOnQWFzESExHcKdsf2xdbDw46GDuyYLEQ_OTEHpc8=.e96b5363-65f2-4f73-afe3-85f13d928c62@github.com> References: <0JdxCwM0B41HAGRr1A5ch4WdP8Gmi2uAd2mgjqW5gNM=.9a485a86-bce3-4a41-b5fb-e17f48992182@github.com> <2DQOnQWFzESExHcKdsf2xdbDw46GDuyYLEQ_OTEHpc8=.e96b5363-65f2-4f73-afe3-85f13d928c62@github.com> Message-ID: On Wed, 6 Nov 2024 18:08:43 GMT, Albert Mingkun Yang wrote: >> src/hotspot/share/gc/parallel/mutableSpace.cpp line 239: >> >>> 237: >>> 238: HeapWord* cur_addr = bottom(); >>> 239: while (cur_addr <= addr) { >> >> As already described in https://github.com/openjdk/jdk/pull/21870#issuecomment-2454964142, this will not work in the general case, if the heap is not walkable. In a debug build you'll run into the assertion once you arrive in the unallcoated TLAB area (which you don't want during error reporting). >> Even worse, in the product build you can crash or run this loop infinitely, depending on what data `obj->size()` will find in the unallocated TLAB space. > >> this will not work in the general case, if the heap is not walkable. > > True, but this is the best-effort approach used in other GCs, as far as I can tell. Is there a real use case that warrants a more sophisticated variant? What would be nice would be something like `oopDesc::safe_klass_or_null()` or similar, feeding into a corresponding `oopDesc::size_given_klass_safe_or_0()`. The former would check the klass word for validity before dereferencing - `CompressedKlassPointers::is_encodable(p)` and then the load of layouthelper etc should happen with SafeFetch. Alternatively (and a bit more unsafe), check the readability of Klass* with SafeFetch beforehand, then call normal size_given_klass. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/21919#discussion_r1832464222 From simonis at openjdk.org Thu Nov 7 12:13:47 2024 From: simonis at openjdk.org (Volker Simonis) Date: Thu, 7 Nov 2024 12:13:47 GMT Subject: Integrated: 8343531: Improve print_location for invalid heap pointers In-Reply-To: References: Message-ID: On Mon, 4 Nov 2024 09:43:18 GMT, Volker Simonis wrote: > Currently `BlockLocationPrinter::print_location()` checks for a pointer if it points into the heap and if that's true, it either prints it as an oop if `is_valid_obj()` is true or it tries to find the the start address of an oop for that pointer by calling `CollectedHeapT::heap()->block_start()`. > > However, the `block_start()` functionality is not fully implemented for all GCs (e.g. the young generation of `ParallelScavengeHeap`) and for these cases `block_start()` returns NULL. Because of this NULL return value `os::print_location()` will finally qualify the corresponding pointer as pointing "into unknown readable memory" although we already know that it actually points into an invalid heap area. > > In such cases, print at least that the pointer is pointing into an unknown part of the heap instead of just saying that it points into unknown memory. > > I've manually tested the new functionality in GDB. This pull request has now been integrated. Changeset: f0b251d7 Author: Volker Simonis URL: https://git.openjdk.org/jdk/commit/f0b251d76078e8d5b47e967b0449c4cbdcb5a005 Stats: 8 lines in 1 file changed: 6 ins; 0 del; 2 mod 8343531: Improve print_location for invalid heap pointers Reviewed-by: shade, tschatzl, ayang ------------- PR: https://git.openjdk.org/jdk/pull/21870 From simonis at openjdk.org Thu Nov 7 12:13:47 2024 From: simonis at openjdk.org (Volker Simonis) Date: Thu, 7 Nov 2024 12:13:47 GMT Subject: RFR: 8343531: Improve print_location for invalid heap pointers In-Reply-To: <9WJcuKHuAyqcP1vaCwtvsJqBWLptNyG2kFHFqp_Xl04=.bf12af6b-ad13-4a21-8259-1b19d770ec71@github.com> References:

<9WJcuKHuAyqcP1vaCwtvsJqBWLptNyG2kFHFqp_Xl04=.bf12af6b-ad13-4a21-8259-1b19d770ec71@github.com> Message-ID: On Tue, 5 Nov 2024 05:35:57 GMT, Albert Mingkun Yang wrote: >>> > but I think it is not trivial. >>> >>> I was thinking copying the Serial impl into `ParallelScavengeHeap::block_start`; nothing sophisticated. >>> >> >> Unfortunately, the Serial implementation doesn't really work reliably if running with `-XX:+UseTLAB` (which is the default). If called with a pointer which points into unallocated TLAB buffer, `ContiguousSpace::block_start_const()` will just crash with a SIGSEGV (or a secondary crash during error reporting when called from `VMError`): >> >> #0 0x00007ffff57d78ce in oopDesc::size_given_klass (this=0x7ffde5616c70, klass=0x7ffda2000000) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/oops/oop.inline.hpp:196 >> #1 0x00007ffff57d7756 in oopDesc::size (this=0x7ffde5616c70) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/oops/oop.inline.hpp:153 >> #2 0x00007ffff689a421 in ContiguousSpace::block_start_const (this=0x7ffff004c880, p=0x7ffde5616ca0) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/gc/shared/space.cpp:565 >> #3 0x00007ffff689b7ba in Space::block_start (this=0x7ffff004c880, p=0x7ffde5616ca0) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/gc/shared/space.inline.hpp:43 >> #4 0x00007ffff60f4144 in GenerationBlockStartClosure::do_space (this=0x7ffff530ef30, s=0x7ffff004c880) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/gc/shared/generation.cpp:191 >> #5 0x00007ffff5e560c5 in DefNewGeneration::space_iterate (this=0x7ffff004b9c0, blk=0x7ffff530ef30, usedOnly=false) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/gc/serial/defNewGeneration.cpp:674 >> #6 0x00007ffff60f3527 in Generation::block_start (this=0x7ffff004b9c0, p=0x7ffde5616ca0) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/gc/shared/generation.cpp:200 >> #7 0x00007ffff60e36e9 in GenCollectedHeap::block_start (this=0x7ffff0038450, addr=0x7ffde5616ca0) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/gc/shared/genCollectedHeap.cpp:884 >> #8 0x00007ffff60e5b97 in BlockLocationPrinter::base_oop_or_null (addr=0x7ffde5616ca0) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/gc/shared/locationPrinter.inline.hpp:41 >> #9 0x00007ffff60e592b in BlockLocationPrinter::print_location (st=0x7ffff0000b60, addr=0x7ffde5616ca0) at /priv/simonisv/OpenJDK/Git/jdk21u-dev/src/hotspot/share/gc/shared/locationPrinter.inline.hpp:56 >> #10 0x00007ffff60e43bd in GenCollectedHeap::print_location (this=0x7ffff0038450, st=0x7ffff0000b60, a... > >> And that's again because the heap is in general not walkable when we call this function. > > It depends on exactly when this function can be called, and with what arg. I wonder whether it can be called with a pointer to a obj that has not been properly initialized (with klass); if so, the heap is almost never walkable, since allocation is not atomic. > >> the Serial implementation doesn't really work reliably > > I am curious if other GCs' impl work (more) reliably, with regarding to the tlab example. Thanks @albertnetymk, @tschatzl and @shipilev for your reviews. ------------- PR Comment: https://git.openjdk.org/jdk/pull/21870#issuecomment-2462078214 From simonis at openjdk.org Thu Nov 7 12:40:43 2024 From: simonis at openjdk.org (Volker Simonis) Date: Thu, 7 Nov 2024 12:40:43 GMT Subject: RFR: 8343658: Parallel: Implement block_start for Young generation In-Reply-To: References: <0JdxCwM0B41HAGRr1A5ch4WdP8Gmi2uAd2mgjqW5gNM=.9a485a86-bce3-4a41-b5fb-e17f48992182@github.com> <2DQOnQWFzESExHcKdsf2xdbDw46GDuyYLEQ_OTEHpc8=.e96b5363-65f2-4f73-afe3-85f13d928c62@github.com> Message-ID: <-3P95fiUgh8CavaWr6qd1uMHTGqq4KrzbyO7YfIkCZc=.cf1aad7c-77b0-4151-9a91-7676216dbecd@github.com> On Thu, 7 Nov 2024 10:48:25 GMT, Thomas Stuefe wrote: >>> this will not work in the general case, if the heap is not walkable. >> >> True, but this is the best-effort approach used in other GCs, as far as I can tell. Is there a real use case that warrants a more sophisticated variant? > > What would be nice would be something like `oopDesc::safe_klass_or_null()` or similar, feeding into a corresponding `oopDesc::size_given_klass_safe_or_0()`. The former would check the klass word for validity before dereferencing - `CompressedKlassPointers::is_encodable(p)` and then the load of layouthelper etc should happen with SafeFetch. Alternatively (and a bit more unsafe), check the readability of Klass* with SafeFetch beforehand, then call normal size_given_klass. > > this will not work in the general case, if the heap is not walkable. > > True, but this is the best-effort approach used in other GCs, as far as I can tell. Is there a real use case that warrants a more sophisticated variant? The **only** use case for this code during hs_err reporting for heap-addresses not pointing at the beginning of an oop. I think we should be conservative here, because a secondary crash will cut the information available in the hs_err file and will therefor do more harm then being helpful. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/21919#discussion_r1832612499 From simonis at openjdk.org Thu Nov 7 12:48:43 2024 From: simonis at openjdk.org (Volker Simonis) Date: Thu, 7 Nov 2024 12:48:43 GMT Subject: RFR: 8343658: Parallel: Implement block_start for Young generation In-Reply-To: <-3P95fiUgh8CavaWr6qd1uMHTGqq4KrzbyO7YfIkCZc=.cf1aad7c-77b0-4151-9a91-7676216dbecd@github.com> References: <0JdxCwM0B41HAGRr1A5ch4WdP8Gmi2uAd2mgjqW5gNM=.9a485a86-bce3-4a41-b5fb-e17f48992182@github.com> <2DQOnQWFzESExHcKdsf2xdbDw46GDuyYLEQ_OTEHpc8=.e96b5363-65f2-4f73-afe3-85f13d928c62@github.com> <-3P95fiUgh8CavaWr6qd1uMHTGqq4KrzbyO7YfIkCZc=.cf1aad7c-77b0-4151-9a91-7676216dbecd@github.com> Message-ID: On Thu, 7 Nov 2024 12:38:03 GMT, Volker Simonis wrote: >> What would be nice would be something like `oopDesc::safe_klass_or_null()` or similar, feeding into a corresponding `oopDesc::size_given_klass_safe_or_0()`. The former would check the klass word for validity before dereferencing - `CompressedKlassPointers::is_encodable(p)` and then the load of layouthelper etc should happen with SafeFetch. Alternatively (and a bit more unsafe), check the readability of Klass* with SafeFetch beforehand, then call normal size_given_klass. > >> > this will not work in the general case, if the heap is not walkable. >> >> True, but this is the best-effort approach used in other GCs, as far as I can tell. Is there a real use case that warrants a more sophisticated variant? > > The **only** use case for this code during hs_err reporting for heap-addresses not pointing at the beginning of an oop. I think we should be conservative here, because a secondary crash will cut the information available in the hs_err file and will therefor do more harm then being helpful. > What would be nice would be something like `oopDesc::safe_klass_or_null()` or similar, feeding into a corresponding `oopDesc::size_given_klass_safe_or_0()`. The former would check the klass word for validity before dereferencing - `CompressedKlassPointers::is_encodable(p)` and then the load of layouthelper etc should happen with SafeFetch. Alternatively (and a bit more unsafe), check the readability of Klass* with SafeFetch beforehand, then call normal size_given_klass. We already have [LocationPrinter::is_valid_obj()](https://github.com/openjdk/jdk/blob/ac82a8f89c7066fb1d379b12bcfd68053cb39ba4/src/hotspot/share/gc/shared/locationPrinter.cpp#L33) which uses [Klass::is_valid()](https://github.com/openjdk/jdk/blob/ac82a8f89c7066fb1d379b12bcfd68053cb39ba4/src/hotspot/share/oops/klass.cpp#L1038) to check the validity of an oop. I don't think we need `SafeFetch` here. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/21919#discussion_r1832623326 From thomas.stuefe at gmail.com Thu Nov 7 13:51:40 2024 From: thomas.stuefe at gmail.com (=?UTF-8?Q?Thomas_St=C3=BCfe?=) Date: Thu, 7 Nov 2024 14:51:40 +0100 Subject: ParallelGC, large old generation when optimizing for footprint goal Message-ID: Hi, I have a question about some odd behavior I observe when ParallelGC optimizes for footprint. If I omit giving a pause time goal and relax the throughput goal enough, the JVM should optimize for the footprint goal. But if the JVM was started with a small young gen (e.g. because the initial heap size was small), it seems to go into a tailspin where the young gen stays tiny or even shrinks more and more, resulting in lots of promotions, old gen grows until it hits the ceiling, Full GC, then the cycle repeats. That maximizes RAM use and thus runs counter to the footprint goal. Example: I run heapothesys/hyperalloc [1] with JDK 21. I run with an allocation pressure of 512MB/sec and a live set size of 128MB. `java -Xlog:gc* -Xmx8g -Xms512m -XX:+UseParallelGC -XX:GCTimeRatio=1 -jar ./target/HyperAlloc-1.0.jar -h 8192 -a 512 -s 128` One can observe how young gen starts at ?150MB, shrinks to ?60MB, and old gen grows till it hits the ceiling at ?5.5GB. Increasing the initial heap size mitigates the problem: Eden still shrinks but settles at a larger size. We still get very frequent young GCs, though. Ironically, the problem is more likely on containers with little RAM. Eden size depends on initial heap size, which depends on total RAM (even if -Xmx was set). Little RAM -> tiny Eden. Therefore, less RAM can cause the JVM to use more memory. That behavior can easily be observed with different values for MaxRAM: calling above program with -XX:MaxRAM=10g will cause the JVM to enter the tailspin immediately, the process peaks at >5GB RSS. The same program with -XX:MaxRAM=128g causes the process to use just ~1.2GB RSS since the young gen stays sensibly large and thus total heap size never grows that much. I looked into the tuning guide [2] but did not find information about how exactly the footprint goal is reached. For ParallelGC, it just states: "Footprint: The maximum heap footprint is specified using the option -Xmx. In addition, the collector has an implicit goal of minimizing the size of the heap as long as the other goals are being met." which looks to me like it should work with default settings, out of the box. Am I making a thinking error somewhere? Is this a bug or is this behavior expected? Thank you, Thomas [1] https://github.com/corretto/heapothesys/tree/master/HyperAlloc [2] https://docs.oracle.com/en/java/javase/11/gctuning/parallel-collector1.html#GUID-DCDD6E46-0406-41D1-AB49-FB96A50EB9CE -------------- next part -------------- An HTML attachment was scrubbed... URL: From rkennke at openjdk.org Thu Nov 7 16:58:36 2024 From: rkennke at openjdk.org (Roman Kennke) Date: Thu, 7 Nov 2024 16:58:36 GMT Subject: RFR: 8305895: Implement JEP 450: Compact Object Headers (Experimental) [v56] In-Reply-To: References: Message-ID: > This is the main body of the JEP 450: Compact Object Headers (Experimental). > > It is also a follow-up to #20640, which now also includes (and supersedes) #20603 and #20605, plus the Tiny Class-Pointers parts that have been previously missing. > > Main changes: > - Introduction of the (experimental) flag UseCompactObjectHeaders. All changes in this PR are protected by this flag. The purpose of the flag is to provide a fallback, in case that users unexpectedly observe problems with the new implementation. The intention is that this flag will remain experimental and opt-in for at least one release, then make it on-by-default and diagnostic (?), and eventually deprecate and obsolete it. However, there are a few unknowns in that plan, specifically, we may want to further improve compact headers to 4 bytes, we are planning to enhance the Klass* encoding to support virtually unlimited number of Klasses, at which point we could also obsolete UseCompressedClassPointers. > - The compressed Klass* can now be stored in the mark-word of objects. In order to be able to do this, we are add some changes to GC forwarding (see below) to protect the relevant (upper 22) bits of the mark-word. Significant parts of this PR deal with loading the compressed Klass* from the mark-word. This PR also changes some code paths (mostly in GCs) to be more careful when accessing Klass* (or mark-word or size) to be able to fetch it from the forwardee in case the object is forwarded. > - Self-forwarding in GCs (which is used to deal with promotion failure) now uses a bit to indicate 'self-forwarding'. This is needed to preserve the crucial Klass* bits in the header. This also allows to get rid of preserved-header machinery in SerialGC and G1 (Parallel GC abuses preserved-marks to also find all other relevant oops). > - Full GC forwarding now uses an encoding similar to compressed-oops. We have 40 bits for that, and can encode up to 8TB of heap. When exceeding 8TB, we turn off UseCompressedClassPointers (except in ZGC, which doesn't use the GC forwarding at all). > - Instances can now have their base-offset (the offset where the field layouter starts to place fields) at offset 8 (instead of 12 or 16). > - Arrays will now store their length at offset 8. > - CDS can now write and read archives with the compressed header. However, it is not possible to read an archive that has been written with an opposite setting of UseCompactObjectHeaders. Some build machinery is added so that _coh variants of CDS archiv... Roman Kennke has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 106 commits: - Merge tag 'jdk-25+23' into JDK-8305895-v4 Added tag jdk-24+23 for changeset c0e6c3b9 - Fix gen-ZGC removal - Merge tag 'jdk-24+22' into JDK-8305895-v4 Added tag jdk-24+22 for changeset 388d44fb - Enable riscv in CompressedClassPointersEncodingScheme test - s390 port - Conditionalize platform specific parts of CompressedClassPointersEncodingScheme test - Update copyright - Avoid assert/endless-loop in JFR code - Update copyright headers - Merge tag 'jdk-24+20' into JDK-8305895-v4 Added tag jdk-24+20 for changeset 7a64fbbb - ... and 96 more: https://git.openjdk.org/jdk/compare/c0e6c3b9...4d282247 ------------- Changes: https://git.openjdk.org/jdk/pull/20677/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=20677&range=55 Stats: 5212 lines in 218 files changed: 3585 ins; 864 del; 763 mod Patch: https://git.openjdk.org/jdk/pull/20677.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20677/head:pull/20677 PR: https://git.openjdk.org/jdk/pull/20677 From rkennke at openjdk.org Thu Nov 7 17:25:40 2024 From: rkennke at openjdk.org (Roman Kennke) Date: Thu, 7 Nov 2024 17:25:40 GMT Subject: RFR: 8305895: Implement JEP 450: Compact Object Headers (Experimental) [v57] In-Reply-To: References: Message-ID: > This is the main body of the JEP 450: Compact Object Headers (Experimental). > > It is also a follow-up to #20640, which now also includes (and supersedes) #20603 and #20605, plus the Tiny Class-Pointers parts that have been previously missing. > > Main changes: > - Introduction of the (experimental) flag UseCompactObjectHeaders. All changes in this PR are protected by this flag. The purpose of the flag is to provide a fallback, in case that users unexpectedly observe problems with the new implementation. The intention is that this flag will remain experimental and opt-in for at least one release, then make it on-by-default and diagnostic (?), and eventually deprecate and obsolete it. However, there are a few unknowns in that plan, specifically, we may want to further improve compact headers to 4 bytes, we are planning to enhance the Klass* encoding to support virtually unlimited number of Klasses, at which point we could also obsolete UseCompressedClassPointers. > - The compressed Klass* can now be stored in the mark-word of objects. In order to be able to do this, we are add some changes to GC forwarding (see below) to protect the relevant (upper 22) bits of the mark-word. Significant parts of this PR deal with loading the compressed Klass* from the mark-word. This PR also changes some code paths (mostly in GCs) to be more careful when accessing Klass* (or mark-word or size) to be able to fetch it from the forwardee in case the object is forwarded. > - Self-forwarding in GCs (which is used to deal with promotion failure) now uses a bit to indicate 'self-forwarding'. This is needed to preserve the crucial Klass* bits in the header. This also allows to get rid of preserved-header machinery in SerialGC and G1 (Parallel GC abuses preserved-marks to also find all other relevant oops). > - Full GC forwarding now uses an encoding similar to compressed-oops. We have 40 bits for that, and can encode up to 8TB of heap. When exceeding 8TB, we turn off UseCompressedClassPointers (except in ZGC, which doesn't use the GC forwarding at all). > - Instances can now have their base-offset (the offset where the field layouter starts to place fields) at offset 8 (instead of 12 or 16). > - Arrays will now store their length at offset 8. > - CDS can now write and read archives with the compressed header. However, it is not possible to read an archive that has been written with an opposite setting of UseCompactObjectHeaders. Some build machinery is added so that _coh variants of CDS archiv... Roman Kennke has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 107 commits: - Merge branch 'master' into JDK-8305895-v4 - Merge tag 'jdk-25+23' into JDK-8305895-v4 Added tag jdk-24+23 for changeset c0e6c3b9 - Fix gen-ZGC removal - Merge tag 'jdk-24+22' into JDK-8305895-v4 Added tag jdk-24+22 for changeset 388d44fb - Enable riscv in CompressedClassPointersEncodingScheme test - s390 port - Conditionalize platform specific parts of CompressedClassPointersEncodingScheme test - Update copyright - Avoid assert/endless-loop in JFR code - Update copyright headers - ... and 97 more: https://git.openjdk.org/jdk/compare/d3c042f9...c1a6323b ------------- Changes: https://git.openjdk.org/jdk/pull/20677/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=20677&range=56 Stats: 5212 lines in 218 files changed: 3585 ins; 864 del; 763 mod Patch: https://git.openjdk.org/jdk/pull/20677.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20677/head:pull/20677 PR: https://git.openjdk.org/jdk/pull/20677 From rkennke at openjdk.org Thu Nov 7 17:33:11 2024 From: rkennke at openjdk.org (Roman Kennke) Date: Thu, 7 Nov 2024 17:33:11 GMT Subject: RFR: 8305895: Implement JEP 450: Compact Object Headers (Experimental) [v57] In-Reply-To: References:

Message-ID: <2xoAD2r5G_6IHT9gt8-uSkN_hPiRmIkJ6VhkB1GarfI=.4e3c65db-3aab-4926-b1fc-fc78599b2885@github.com> On Thu, 7 Nov 2024 17:25:40 GMT, Roman Kennke wrote: >> This is the main body of the JEP 450: Compact Object Headers (Experimental). >> >> It is also a follow-up to #20640, which now also includes (and supersedes) #20603 and #20605, plus the Tiny Class-Pointers parts that have been previously missing. >> >> Main changes: >> - Introduction of the (experimental) flag UseCompactObjectHeaders. All changes in this PR are protected by this flag. The purpose of the flag is to provide a fallback, in case that users unexpectedly observe problems with the new implementation. The intention is that this flag will remain experimental and opt-in for at least one release, then make it on-by-default and diagnostic (?), and eventually deprecate and obsolete it. However, there are a few unknowns in that plan, specifically, we may want to further improve compact headers to 4 bytes, we are planning to enhance the Klass* encoding to support virtually unlimited number of Klasses, at which point we could also obsolete UseCompressedClassPointers. >> - The compressed Klass* can now be stored in the mark-word of objects. In order to be able to do this, we are add some changes to GC forwarding (see below) to protect the relevant (upper 22) bits of the mark-word. Significant parts of this PR deal with loading the compressed Klass* from the mark-word. This PR also changes some code paths (mostly in GCs) to be more careful when accessing Klass* (or mark-word or size) to be able to fetch it from the forwardee in case the object is forwarded. >> - Self-forwarding in GCs (which is used to deal with promotion failure) now uses a bit to indicate 'self-forwarding'. This is needed to preserve the crucial Klass* bits in the header. This also allows to get rid of preserved-header machinery in SerialGC and G1 (Parallel GC abuses preserved-marks to also find all other relevant oops). >> - Full GC forwarding now uses an encoding similar to compressed-oops. We have 40 bits for that, and can encode up to 8TB of heap. When exceeding 8TB, we turn off UseCompressedClassPointers (except in ZGC, which doesn't use the GC forwarding at all). >> - Instances can now have their base-offset (the offset where the field layouter starts to place fields) at offset 8 (instead of 12 or 16). >> - Arrays will now store their length at offset 8. >> - CDS can now write and read archives with the compressed header. However, it is not possible to read an archive that has been written with an opposite setting of UseCompactObjectHeaders. Some build machinery is added so that _co... > > Roman Kennke has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 107 commits: > > - Merge branch 'master' into JDK-8305895-v4 > - Merge tag 'jdk-25+23' into JDK-8305895-v4 > > Added tag jdk-24+23 for changeset c0e6c3b9 > - Fix gen-ZGC removal > - Merge tag 'jdk-24+22' into JDK-8305895-v4 > > Added tag jdk-24+22 for changeset 388d44fb > - Enable riscv in CompressedClassPointersEncodingScheme test > - s390 port > - Conditionalize platform specific parts of CompressedClassPointersEncodingScheme test > - Update copyright > - Avoid assert/endless-loop in JFR code > - Update copyright headers > - ... and 97 more: https://git.openjdk.org/jdk/compare/d3c042f9...c1a6323b GHA failures look like one unrelated timeout and one unrelated infra problem. Please confirm. I also run tier1 on x86_64 x aarch64 x -UCOH x + UCOH, with nothing sticking out (same timeout observed, though). ------------- PR Comment: https://git.openjdk.org/jdk/pull/20677#issuecomment-2463245179 From stuefe at openjdk.org Fri Nov 8 07:02:51 2024 From: stuefe at openjdk.org (Thomas Stuefe) Date: Fri, 8 Nov 2024 07:02:51 GMT Subject: RFR: 8305895: Implement JEP 450: Compact Object Headers (Experimental) [v57] In-Reply-To: References:

Message-ID: On Thu, 7 Nov 2024 17:25:40 GMT, Roman Kennke wrote: >> This is the main body of the JEP 450: Compact Object Headers (Experimental). >> >> It is also a follow-up to #20640, which now also includes (and supersedes) #20603 and #20605, plus the Tiny Class-Pointers parts that have been previously missing. >> >> Main changes: >> - Introduction of the (experimental) flag UseCompactObjectHeaders. All changes in this PR are protected by this flag. The purpose of the flag is to provide a fallback, in case that users unexpectedly observe problems with the new implementation. The intention is that this flag will remain experimental and opt-in for at least one release, then make it on-by-default and diagnostic (?), and eventually deprecate and obsolete it. However, there are a few unknowns in that plan, specifically, we may want to further improve compact headers to 4 bytes, we are planning to enhance the Klass* encoding to support virtually unlimited number of Klasses, at which point we could also obsolete UseCompressedClassPointers. >> - The compressed Klass* can now be stored in the mark-word of objects. In order to be able to do this, we are add some changes to GC forwarding (see below) to protect the relevant (upper 22) bits of the mark-word. Significant parts of this PR deal with loading the compressed Klass* from the mark-word. This PR also changes some code paths (mostly in GCs) to be more careful when accessing Klass* (or mark-word or size) to be able to fetch it from the forwardee in case the object is forwarded. >> - Self-forwarding in GCs (which is used to deal with promotion failure) now uses a bit to indicate 'self-forwarding'. This is needed to preserve the crucial Klass* bits in the header. This also allows to get rid of preserved-header machinery in SerialGC and G1 (Parallel GC abuses preserved-marks to also find all other relevant oops). >> - Full GC forwarding now uses an encoding similar to compressed-oops. We have 40 bits for that, and can encode up to 8TB of heap. When exceeding 8TB, we turn off UseCompressedClassPointers (except in ZGC, which doesn't use the GC forwarding at all). >> - Instances can now have their base-offset (the offset where the field layouter starts to place fields) at offset 8 (instead of 12 or 16). >> - Arrays will now store their length at offset 8. >> - CDS can now write and read archives with the compressed header. However, it is not possible to read an archive that has been written with an opposite setting of UseCompactObjectHeaders. Some build machinery is added so that _co... > > Roman Kennke has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 107 commits: > > - Merge branch 'master' into JDK-8305895-v4 > - Merge tag 'jdk-25+23' into JDK-8305895-v4 > > Added tag jdk-24+23 for changeset c0e6c3b9 > - Fix gen-ZGC removal > - Merge tag 'jdk-24+22' into JDK-8305895-v4 > > Added tag jdk-24+22 for changeset 388d44fb > - Enable riscv in CompressedClassPointersEncodingScheme test > - s390 port > - Conditionalize platform specific parts of CompressedClassPointersEncodingScheme test > - Update copyright > - Avoid assert/endless-loop in JFR code > - Update copyright headers > - ... and 97 more: https://git.openjdk.org/jdk/compare/d3c042f9...c1a6323b Merge looks good. build errors on MacOS unrelated. ------------- PR Review: https://git.openjdk.org/jdk/pull/20677#pullrequestreview-2422830379 From albert.m.yang at oracle.com Fri Nov 8 09:08:51 2024 From: albert.m.yang at oracle.com (Albert Yang) Date: Fri, 8 Nov 2024 09:08:51 +0000 Subject: ParallelGC, large old generation when optimizing for footprint goal In-Reply-To: References: Message-ID: > I run with an allocation pressure of 512MB/sec If the alloc-rate is 512M/s and init-heap-size is 512M, it's indeed expected that young-gc is frequent -- the default eden-size is ~150M. > One can observe how young gen starts at ?150MB, shrinks to ?60MB, and old gen grows till it hits the ceiling at ?5.5GB. This is definitely undesirable, and as you put it, "runs counter to the footprint goal". I have been working on JDK-8338977, and the current prototype maintains heap-capacity under ~600M. Thank you for providing this bm (and the config); I will include the result for this bm when I send out the PR. /Albert ________________________________________ From: hotspot-gc-dev on behalf of Thomas St?fe Sent: Thursday, November 7, 2024 14:51 To: hotspot-gc-dev at openjdk.java.net Subject: ParallelGC, large old generation when optimizing for footprint goal Hi, I have a question about some odd behavior I observe when ParallelGC optimizes for footprint. If I omit giving a pause time goal and relax the throughput goal enough, the JVM should optimize for the footprint goal. But if the JVM was started with a small young gen (e.g. because the initial heap size was small), it seems to go into a tailspin where the young gen stays tiny or even shrinks more and more, resulting in lots of promotions, old gen grows until it hits the ceiling, Full GC, then the cycle repeats. That maximizes RAM use and thus runs counter to the footprint goal. Example: I run heapothesys/hyperalloc [1] with JDK 21. I run with an allocation pressure of 512MB/sec and a live set size of 128MB. `java -Xlog:gc* -Xmx8g -Xms512m -XX:+UseParallelGC -XX:GCTimeRatio=1 -jar ./target/HyperAlloc-1.0.jar -h 8192 -a 512 -s 128` One can observe how young gen starts at ?150MB, shrinks to ?60MB, and old gen grows till it hits the ceiling at ?5.5GB. Increasing the initial heap size mitigates the problem: Eden still shrinks but settles at a larger size. We still get very frequent young GCs, though. Ironically, the problem is more likely on containers with little RAM. Eden size depends on initial heap size, which depends on total RAM (even if -Xmx was set). Little RAM -> tiny Eden. Therefore, less RAM can cause the JVM to use more memory. That behavior can easily be observed with different values for MaxRAM: calling above program with -XX:MaxRAM=10g will cause the JVM to enter the tailspin immediately, the process peaks at >5GB RSS. The same program with -XX:MaxRAM=128g causes the process to use just ~1.2GB RSS since the young gen stays sensibly large and thus total heap size never grows that much. I looked into the tuning guide [2] but did not find information about how exactly the footprint goal is reached. For ParallelGC, it just states: "Footprint: The maximum heap footprint is specified using the option -Xmx. In addition, the collector has an implicit goal of minimizing the size of the heap as long as the other goals are being met." which looks to me like it should work with default settings, out of the box. Am I making a thinking error somewhere? Is this a bug or is this behavior expected? Thank you, Thomas [1] https://github.com/corretto/heapothesys/tree/master/HyperAlloc [2] https://docs.oracle.com/en/java/javase/11/gctuning/parallel-collector1.html#GUID-DCDD6E46-0406-41D1-AB49-FB96A50EB9CE From sjohanss at openjdk.org Fri Nov 8 09:22:41 2024 From: sjohanss at openjdk.org (Stefan Johansson) Date: Fri, 8 Nov 2024 09:22:41 GMT Subject: RFR: 8343189: [REDO] JDK-8295269 G1: Improve slow startup due to predictor initialization [v2] In-Reply-To: References:

Message-ID: On Tue, 5 Nov 2024 10:06:34 GMT, Ivan Walulya wrote: >> Thomas Schatzl has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains two additional commits since the last revision: >> >> - Merge branch 'master' into 8343189-redo-slow-startup >> - Revert "8343086: [BACKOUT] JDK-8295269 G1: Improve slow startup due to predictor initialization" >> >> This reverts commit f1cc890ddfe2e472cf786856dc7d01645f61b054. > > Still good! Thanks @walulyai @kstefanj for your reviews. ------------- PR Comment: https://git.openjdk.org/jdk/pull/21876#issuecomment-2464252609 From tschatzl at openjdk.org Fri Nov 8 09:46:39 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Fri, 8 Nov 2024 09:46:39 GMT Subject: Integrated: 8343189: [REDO] JDK-8295269 G1: Improve slow startup due to predictor initialization In-Reply-To: References: Message-ID: On Mon, 4 Nov 2024 15:06:04 GMT, Thomas Schatzl wrote: > Hi all, > > please review this redo of [JDK-8295269](https://bugs.openjdk.org/browse/JDK-8295269) G1: Improve slow startup due to predictor initialization. The cause are issues with the `runtime/cds/DeterministicDump.java` test, that is currently being fixed in #21871. > > There has been no change in these changes. > > Testing: running a few thousand times with the fixed `runtime/cds/DeterministicDump.java` test > > Thanks, > Thomas This pull request has now been integrated. Changeset: c7f071cf Author: Thomas Schatzl URL: https://git.openjdk.org/jdk/commit/c7f071cf36a6f064e293e82e7e5bb0abcc76ad70 Stats: 7 lines in 2 files changed: 6 ins; 0 del; 1 mod 8343189: [REDO] JDK-8295269 G1: Improve slow startup due to predictor initialization Reviewed-by: iwalulya, sjohanss ------------- PR: https://git.openjdk.org/jdk/pull/21876 From stuefe at openjdk.org Fri Nov 8 16:10:56 2024 From: stuefe at openjdk.org (Thomas Stuefe) Date: Fri, 8 Nov 2024 16:10:56 GMT Subject: RFR: 8305895: Implement JEP 450: Compact Object Headers (Experimental) [v57] In-Reply-To: References:

Message-ID: On Thu, 7 Nov 2024 17:25:40 GMT, Roman Kennke wrote: >> This is the main body of the JEP 450: Compact Object Headers (Experimental). >> >> It is also a follow-up to #20640, which now also includes (and supersedes) #20603 and #20605, plus the Tiny Class-Pointers parts that have been previously missing. >> >> Main changes: >> - Introduction of the (experimental) flag UseCompactObjectHeaders. All changes in this PR are protected by this flag. The purpose of the flag is to provide a fallback, in case that users unexpectedly observe problems with the new implementation. The intention is that this flag will remain experimental and opt-in for at least one release, then make it on-by-default and diagnostic (?), and eventually deprecate and obsolete it. However, there are a few unknowns in that plan, specifically, we may want to further improve compact headers to 4 bytes, we are planning to enhance the Klass* encoding to support virtually unlimited number of Klasses, at which point we could also obsolete UseCompressedClassPointers. >> - The compressed Klass* can now be stored in the mark-word of objects. In order to be able to do this, we are add some changes to GC forwarding (see below) to protect the relevant (upper 22) bits of the mark-word. Significant parts of this PR deal with loading the compressed Klass* from the mark-word. This PR also changes some code paths (mostly in GCs) to be more careful when accessing Klass* (or mark-word or size) to be able to fetch it from the forwardee in case the object is forwarded. >> - Self-forwarding in GCs (which is used to deal with promotion failure) now uses a bit to indicate 'self-forwarding'. This is needed to preserve the crucial Klass* bits in the header. This also allows to get rid of preserved-header machinery in SerialGC and G1 (Parallel GC abuses preserved-marks to also find all other relevant oops). >> - Full GC forwarding now uses an encoding similar to compressed-oops. We have 40 bits for that, and can encode up to 8TB of heap. When exceeding 8TB, we turn off UseCompressedClassPointers (except in ZGC, which doesn't use the GC forwarding at all). >> - Instances can now have their base-offset (the offset where the field layouter starts to place fields) at offset 8 (instead of 12 or 16). >> - Arrays will now store their length at offset 8. >> - CDS can now write and read archives with the compressed header. However, it is not possible to read an archive that has been written with an opposite setting of UseCompactObjectHeaders. Some build machinery is added so that _co... > > Roman Kennke has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 107 commits: > > - Merge branch 'master' into JDK-8305895-v4 > - Merge tag 'jdk-25+23' into JDK-8305895-v4 > > Added tag jdk-24+23 for changeset c0e6c3b9 > - Fix gen-ZGC removal > - Merge tag 'jdk-24+22' into JDK-8305895-v4 > > Added tag jdk-24+22 for changeset 388d44fb > - Enable riscv in CompressedClassPointersEncodingScheme test > - s390 port > - Conditionalize platform specific parts of CompressedClassPointersEncodingScheme test > - Update copyright > - Avoid assert/endless-loop in JFR code > - Update copyright headers > - ... and 97 more: https://git.openjdk.org/jdk/compare/d3c042f9...c1a6323b Still looks good. Nice work! ------------- Marked as reviewed by coleenp (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/20677#pullrequestreview-2424274474 From tschatzl at openjdk.org Fri Nov 8 16:56:05 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Fri, 8 Nov 2024 16:56:05 GMT Subject: RFR: 8297692: Avoid sending per-region GCPhaseParallel JFR events in G1ScanCollectionSetRegionClosure Message-ID: <0vG-VYZ2aKoCjTB6bxD8aTbqfB7LotSmJBL1LHrcLw8=.5cb24f1f-09aa-44e3-81e0-90badc70ee10@github.com> Hi all, please review this change that significantly reduces the amount of "Code Roots" and "Optional Roots" JFR events to reduce default recording sizes significantly. E.g. a 10min BigRamTester run creates a 23MB recording without this change, with like hundreds of thousands of these events (#gcs * #gc threads * #regions in collection set). With this change, the recording is reduced to 4MB (#gcs * #gc threads) Testing: gha, tier1-3 Thanks, Thomas ------------- Commit messages: - Update src/hotspot/share/gc/g1/g1RemSet.cpp - 8297692 Changes: https://git.openjdk.org/jdk/pull/21984/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=21984&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8297692 Stats: 162 lines in 3 files changed: 84 ins; 52 del; 26 mod Patch: https://git.openjdk.org/jdk/pull/21984.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/21984/head:pull/21984 PR: https://git.openjdk.org/jdk/pull/21984 From tschatzl at openjdk.org Fri Nov 8 16:56:05 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Fri, 8 Nov 2024 16:56:05 GMT Subject: RFR: 8297692: Avoid sending per-region GCPhaseParallel JFR events in G1ScanCollectionSetRegionClosure In-Reply-To: <0vG-VYZ2aKoCjTB6bxD8aTbqfB7LotSmJBL1LHrcLw8=.5cb24f1f-09aa-44e3-81e0-90badc70ee10@github.com> References: <0vG-VYZ2aKoCjTB6bxD8aTbqfB7LotSmJBL1LHrcLw8=.5cb24f1f-09aa-44e3-81e0-90badc70ee10@github.com> Message-ID: On Fri, 8 Nov 2024 15:20:21 GMT, Thomas Schatzl wrote: > Hi all, > > please review this change that significantly reduces the amount of "Code Roots" and "Optional Roots" JFR events to reduce default recording sizes significantly. > > E.g. a 10min BigRamTester run creates a 23MB recording without this change, with like hundreds of thousands of these events (#gcs * #gc threads * #regions in collection set). With this change, the recording is reduced to 4MB (#gcs * #gc threads) > > Testing: gha, tier1-3 > > Thanks, > Thomas src/hotspot/share/gc/g1/g1RemSet.cpp line 842: > 840: _opt_refs_scanned(0), > 841: _opt_refs_memory_used(0) { } > 842: Suggestion: _opt_refs_memory_used(0) { } ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/21984#discussion_r1834729833 From rkennke at openjdk.org Fri Nov 8 17:24:05 2024 From: rkennke at openjdk.org (Roman Kennke) Date: Fri, 8 Nov 2024 17:24:05 GMT Subject: Integrated: 8305895: Implement JEP 450: Compact Object Headers (Experimental) In-Reply-To: References: Message-ID: On Thu, 22 Aug 2024 13:35:08 GMT, Roman Kennke wrote: > This is the main body of the JEP 450: Compact Object Headers (Experimental). > > It is also a follow-up to #20640, which now also includes (and supersedes) #20603 and #20605, plus the Tiny Class-Pointers parts that have been previously missing. > > Main changes: > - Introduction of the (experimental) flag UseCompactObjectHeaders. All changes in this PR are protected by this flag. The purpose of the flag is to provide a fallback, in case that users unexpectedly observe problems with the new implementation. The intention is that this flag will remain experimental and opt-in for at least one release, then make it on-by-default and diagnostic (?), and eventually deprecate and obsolete it. However, there are a few unknowns in that plan, specifically, we may want to further improve compact headers to 4 bytes, we are planning to enhance the Klass* encoding to support virtually unlimited number of Klasses, at which point we could also obsolete UseCompressedClassPointers. > - The compressed Klass* can now be stored in the mark-word of objects. In order to be able to do this, we are add some changes to GC forwarding (see below) to protect the relevant (upper 22) bits of the mark-word. Significant parts of this PR deal with loading the compressed Klass* from the mark-word. This PR also changes some code paths (mostly in GCs) to be more careful when accessing Klass* (or mark-word or size) to be able to fetch it from the forwardee in case the object is forwarded. > - Self-forwarding in GCs (which is used to deal with promotion failure) now uses a bit to indicate 'self-forwarding'. This is needed to preserve the crucial Klass* bits in the header. This also allows to get rid of preserved-header machinery in SerialGC and G1 (Parallel GC abuses preserved-marks to also find all other relevant oops). > - Full GC forwarding now uses an encoding similar to compressed-oops. We have 40 bits for that, and can encode up to 8TB of heap. When exceeding 8TB, we turn off UseCompressedClassPointers (except in ZGC, which doesn't use the GC forwarding at all). > - Instances can now have their base-offset (the offset where the field layouter starts to place fields) at offset 8 (instead of 12 or 16). > - Arrays will now store their length at offset 8. > - CDS can now write and read archives with the compressed header. However, it is not possible to read an archive that has been written with an opposite setting of UseCompactObjectHeaders. Some build machinery is added so that _coh variants of CDS archiv... This pull request has now been integrated. Changeset: 44ec501a Author: Roman Kennke URL: https://git.openjdk.org/jdk/commit/44ec501a41f4794259dd03cd168838e79334890e Stats: 5212 lines in 218 files changed: 3585 ins; 864 del; 763 mod 8305895: Implement JEP 450: Compact Object Headers (Experimental) Co-authored-by: Sandhya Viswanathan Co-authored-by: Martin Doerr Co-authored-by: Hamlin Li Co-authored-by: Thomas Stuefe Co-authored-by: Amit Kumar Co-authored-by: Stefan Karlsson Co-authored-by: Coleen Phillimore Co-authored-by: Axel Boldt-Christmas Reviewed-by: coleenp, stefank, stuefe, phh, ihse, lmesnik, tschatzl, matsaave, rcastanedalo, vpaprotski, yzheng, egahlin ------------- PR: https://git.openjdk.org/jdk/pull/20677 From rkennke at openjdk.org Fri Nov 8 17:45:40 2024 From: rkennke at openjdk.org (Roman Kennke) Date: Fri, 8 Nov 2024 17:45:40 GMT Subject: RFR: 8305895: Implement JEP 450: Compact Object Headers (Experimental) [v19] In-Reply-To: References:

Message-ID: On Wed, 18 Sep 2024 12:22:34 GMT, Yudi Zheng wrote: >> Roman Kennke has updated the pull request incrementally with two additional commits since the last revision: >> >> - CompressedKlassPointers::is_encodable shall be callable with -UseCCP >> - Johan review feedback > > Could you please cherry pick https://github.com/mur47x111/jdk/commit/c45ebc2a89d0b25a3dd8cc46386e37a635ff9af2 for the JVMCI support? @mur47x111 it's now intergrated in jdk24. do your magic in Graal ;-) ------------- PR Comment: https://git.openjdk.org/jdk/pull/20677#issuecomment-2465413222 From yzheng at openjdk.org Fri Nov 8 17:52:05 2024 From: yzheng at openjdk.org (Yudi Zheng) Date: Fri, 8 Nov 2024 17:52:05 GMT Subject: RFR: 8305895: Implement JEP 450: Compact Object Headers (Experimental) [v19] In-Reply-To: References:

Message-ID: On Fri, 8 Nov 2024 17:42:24 GMT, Roman Kennke wrote: >> Could you please cherry pick https://github.com/mur47x111/jdk/commit/c45ebc2a89d0b25a3dd8cc46386e37a635ff9af2 for the JVMCI support? > > @mur47x111 it's now intergrated in jdk24. do your magic in Graal ;-) @rkennke It is in the merge queue ------------- PR Comment: https://git.openjdk.org/jdk/pull/20677#issuecomment-2465423342 From tschatzl at openjdk.org Fri Nov 8 20:01:16 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Fri, 8 Nov 2024 20:01:16 GMT Subject: RFR: 8297692: Avoid sending per-region GCPhaseParallel JFR events in G1ScanCollectionSetRegionClosure In-Reply-To: <0vG-VYZ2aKoCjTB6bxD8aTbqfB7LotSmJBL1LHrcLw8=.5cb24f1f-09aa-44e3-81e0-90badc70ee10@github.com> References: <0vG-VYZ2aKoCjTB6bxD8aTbqfB7LotSmJBL1LHrcLw8=.5cb24f1f-09aa-44e3-81e0-90badc70ee10@github.com> Message-ID: On Fri, 8 Nov 2024 15:20:21 GMT, Thomas Schatzl wrote: > Hi all, > > please review this change that significantly reduces the amount of "Code Roots" and "Optional Roots" JFR events to reduce default recording sizes significantly. > > E.g. a 10min BigRamTester run creates a 23MB recording without this change, with like hundreds of thousands of these events (#gcs * #gc threads * #regions in collection set). With this change, the recording is reduced to 4MB (#gcs * #gc threads) > > Testing: gha, tier1-3 > > Thanks, > Thomas Fwiw, I went with splitting code root scan and optional root scan into two iterations that are each bracketed by a single per-thread JFR event now. This also allowed a minor optimization: in the initial evacuation there can be no optional roots, so that iteration over all regions can be skipped. ------------- PR Comment: https://git.openjdk.org/jdk/pull/21984#issuecomment-2465653093 From ayang at openjdk.org Sat Nov 9 10:51:47 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Sat, 9 Nov 2024 10:51:47 GMT Subject: RFR: 8339162: [REDO] JDK-8338440 Parallel: Improve fragmentation mitigation in Full GC [v2] In-Reply-To: References: Message-ID: > This PR consists of two commits, the original and bug-fix. > > The original patch calculates the dest-count for the preceding live words incorrectly -- `preceding_destination` can be on region-boundary. > > Test: TEST=gc/TestSoftReferencesBehaviorOnOOME.java fails ~4/100 without the fix but passes with the fix. Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains three commits: - Merge branch 'master' into pgc-redo - fix - original ------------- Changes: https://git.openjdk.org/jdk/pull/21872/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=21872&range=01 Stats: 568 lines in 2 files changed: 209 ins; 143 del; 216 mod Patch: https://git.openjdk.org/jdk/pull/21872.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/21872/head:pull/21872 PR: https://git.openjdk.org/jdk/pull/21872 From zgu at openjdk.org Sat Nov 9 15:56:21 2024 From: zgu at openjdk.org (Zhengyu Gu) Date: Sat, 9 Nov 2024 15:56:21 GMT Subject: RFR: 8339162: [REDO] JDK-8338440 Parallel: Improve fragmentation mitigation in Full GC [v2] In-Reply-To: References:

Message-ID: On Sat, 9 Nov 2024 10:51:47 GMT, Albert Mingkun Yang wrote: >> This PR consists of two commits, the original and bug-fix. >> >> The original patch calculates the dest-count for the preceding live words incorrectly -- `preceding_destination` can be on region-boundary. >> >> Test: TEST=gc/TestSoftReferencesBehaviorOnOOME.java fails ~4/100 without the fix but passes with the fix. > > Albert Mingkun Yang has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains three commits: > > - Merge branch 'master' into pgc-redo > - fix > - original Thanks for review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/21872#issuecomment-2466688932 From tschatzl at openjdk.org Mon Nov 11 10:06:04 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 11 Nov 2024 10:06:04 GMT Subject: RFR: 8343929: Remove PreservedMarksSet::createTask() after JDK-8305895 Message-ID: Hi all, please review this trivial cleanup after pushing the Compact Object Header JEP (JDK-8305895). The method mentioned is unused. Testing: gha, local compilation Thanks, Thomas ------------- Commit messages: - 8343929 Changes: https://git.openjdk.org/jdk/pull/22006/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=22006&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8343929 Stats: 6 lines in 2 files changed: 0 ins; 6 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/22006.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22006/head:pull/22006 PR: https://git.openjdk.org/jdk/pull/22006 From ayang at openjdk.org Mon Nov 11 10:18:41 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 11 Nov 2024 10:18:41 GMT Subject: RFR: 8343929: Remove PreservedMarksSet::createTask() after JDK-8305895 In-Reply-To: References: Message-ID: On Mon, 11 Nov 2024 10:01:56 GMT, Thomas Schatzl wrote: > Hi all, > > please review this trivial cleanup after pushing the Compact Object Header JEP (JDK-8305895). > > The method mentioned is unused. > > Testing: gha, local compilation > > Thanks, > Thomas Trivial. ------------- Marked as reviewed by ayang (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/22006#pullrequestreview-2426727355 From ayang at openjdk.org Mon Nov 11 10:43:28 2024 From: ayang at openjdk.org (Albert Mingkun Yang) Date: Mon, 11 Nov 2024 10:43:28 GMT Subject: RFR: 8343658: Parallel: Implement block_start for Young generation In-Reply-To: References: <0JdxCwM0B41HAGRr1A5ch4WdP8Gmi2uAd2mgjqW5gNM=.9a485a86-bce3-4a41-b5fb-e17f48992182@github.com> <2DQOnQWFzESExHcKdsf2xdbDw46GDuyYLEQ_OTEHpc8=.e96b5363-65f2-4f73-afe3-85f13d928c62@github.com> <-3P95fiUgh8CavaWr6qd1uMHTGqq4KrzbyO7YfIkCZc=.cf1aad7c-77b0-4151-9a91-7676216dbecd@github.com> Message-ID: <7TCNU0SJQumJksp5tdVaXFpz6sGNkgH3_O-j2Q3TRpI=.cdcbcdd3-8d25-4d52-8ac9-f2de55b0f99f@github.com> On Thu, 7 Nov 2024 12:46:25 GMT, Volker Simonis wrote: >>> > this will not work in the general case, if the heap is not walkable. >>> >>> True, but this is the best-effort approach used in other GCs, as far as I can tell. Is there a real use case that warrants a more sophisticated variant? >> >> The **only** use case for this code during hs_err reporting for heap-addresses not pointing at the beginning of an oop. I think we should be conservative here, because a secondary crash will cut the information available in the hs_err file and will therefor do more harm then being helpful. > >> What would be nice would be something like `oopDesc::safe_klass_or_null()` or similar, feeding into a corresponding `oopDesc::size_given_klass_safe_or_0()`. The former would check the klass word for validity before dereferencing - `CompressedKlassPointers::is_encodable(p)` and then the load of layouthelper etc should happen with SafeFetch. Alternatively (and a bit more unsafe), check the readability of Klass* with SafeFetch beforehand, then call normal size_given_klass. > > We already have [LocationPrinter::is_valid_obj()](https://github.com/openjdk/jdk/blob/ac82a8f89c7066fb1d379b12bcfd68053cb39ba4/src/hotspot/share/gc/shared/locationPrinter.cpp#L33) which uses [Klass::is_valid()](https://github.com/openjdk/jdk/blob/ac82a8f89c7066fb1d379b12bcfd68053cb39ba4/src/hotspot/share/oops/klass.cpp#L1038) to check the validity of an oop. I don't think we need `SafeFetch` here. > I think we should be conservative here, because a secondary crash will cut the information available in the hs_err file Have you ever seen "a secondary crash" in practice (for other GCs)? I am a bit concerned that we might be adding complex code that is never exercised. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/21919#discussion_r1836428995 From shade at openjdk.org Mon Nov 11 10:50:17 2024 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 11 Nov 2024 10:50:17 GMT Subject: RFR: 8343929: Remove PreservedMarksSet::createTask() after JDK-8305895 In-Reply-To: References: Message-ID: On Mon, 11 Nov 2024 10:01:56 GMT, Thomas Schatzl wrote: > Hi all, > > please review this trivial cleanup after pushing the Compact Object Header JEP (JDK-8305895). > > The method mentioned is unused. > > Testing: gha, local compilation > > Thanks, > Thomas Marked as reviewed by shade (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/22006#pullrequestreview-2426929910 From tschatzl at openjdk.org Mon Nov 11 11:34:55 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 11 Nov 2024 11:34:55 GMT Subject: RFR: 8343929: Remove PreservedMarksSet::createTask() after JDK-8305895 In-Reply-To: References:

Message-ID: On Mon, 11 Nov 2024 10:16:33 GMT, Albert Mingkun Yang wrote: >> Hi all, >> >> please review this trivial cleanup after pushing the Compact Object Header JEP (JDK-8305895). >> >> The method mentioned is unused. >> >> Testing: gha, local compilation >> >> Thanks, >> Thomas > > Trivial. Thanks @albertnetymk @shipilev for your reviews ------------- PR Comment: https://git.openjdk.org/jdk/pull/22006#issuecomment-2467951109 From tschatzl at openjdk.org Mon Nov 11 11:34:56 2024 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 11 Nov 2024 11:34:56 GMT Subject: Integrated: 8343929: Remove PreservedMarksSet::createTask() after JDK-8305895 In-Reply-To: References: Message-ID: On Mon, 11 Nov 2024 10:01:56 GMT, Thomas Schatzl wrote: > Hi all, > > please review this trivial cleanup after pushing the Compact Object Header JEP (JDK-8305895). > > The method mentioned is unused. > > Testing: gha, local compilation > > Thanks, > Thomas This pull request has now been integrated. Changeset: 36e12955 Author: Thomas Schatzl URL: https://git.openjdk.org/jdk/commit/36e12955b2129f2075a203a0b39198f256083a24 Stats: 6 lines in 2 files changed: 0 ins; 6 del; 0 mod 8343929: Remove PreservedMarksSet::createTask() after JDK-8305895 Reviewed-by: ayang, shade ------------- PR: https://git.openjdk.org/jdk/pull/22006 From iwalulya at openjdk.org Mon Nov 11 15:35:30 2024 From: iwalulya at openjdk.org (Ivan Walulya) Date: Mon, 11 Nov 2024 15:35:30 GMT Subject: RFR: 8343782: G1: Use one G1CardSet instance for multiple old gen regions Message-ID: Hi all, Please review this change to assign multiple collection candidate regions to a single instance of a G1CardSet. Currently, we maintain a 1:1 mapping of old-gen regions and G1CardSet instances, assuming these regions are collected independently. However, regions are collected in batches for performance reasons to meet the G1MixedGCCountTarget. In this change, at the end of the Remark phase, we batch regions that we anticipate will be collected together into a collection group while selecting remembered set rebuild candidates. Regions in a collection group should be evacuated at the same time because they are assigned to the same G1CardSet instances. This implies that we do not need to maintain cross-region remembered set entries for regions within the same collection group. The benefit is a reduction in the memory overhead of the remembered set and the remembered set merge time during the collection pause. One disadvantage is that this approach decreases the flexibility during evacuation: you can only evacuate all regions that share a particular G1CardSet at the same time. Another downside is that pinned regions that are part of a collection group have to be partially evacuated when the collection group is selected for evacuation. This removes the optimization in the mainline implementation where the pinned regions are skipped to allow for potential unpinning before evacuation. In this change, we make significant changes to the collection set implementation as we switch to group selection instead of region selection. Consequently, many of the changes in the PR are about switching from region-centered collection set selection to a group-centered approach. Note: The batching is based on the sort order by reclaimable bytes which may change the evacuation order in which regions would have been evacuated when sorted by gc efficiency. We have not observed any regressions on internal performance testing platforms. Memory comparisons for the Cachestress benchmark for different heap sizes are attached below. Testing: Mach5 Tier1-6 ![16GB](https://github.com/user-attachments/assets/3224c2f1-172d-4d76-ba28-bf483b1b1c95) ![32G](https://github.com/user-attachments/assets/abd10537-41a9-4cf9-b668-362af12fe949) ![64GB](https://github.com/user-attachments/assets/fa87eefc-cf8a-4fb5-9fc4-e7151498bf73) ![128GB](https://github.com/user-attachments/assets/c3a59e32-6bd7-43e3-a3e4-c472f71aa544) ------------- Commit messages: - Merge remote-tracking branch 'upstream/master' into OldGenRemsetGroupsV1 - add logging - more cleanups - Merge remote-tracking branch 'upstream/master' into OldGenRemsetGroupsV1 - cleanup - remove MarkingSkipEvents - Merge remote-tracking branch 'upstream/master' into OldGenRemsetGroupsV1 - revamp with retained regions added to groups Changes: https://git.openjdk.org/jdk/pull/22015/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=22015&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8343782 Stats: 969 lines in 19 files changed: 438 ins; 269 del; 262 mod Patch: https://git.openjdk.org/jdk/pull/22015.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22015/head:pull/22015 PR: https://git.openjdk.org/jdk/pull/22015 From aboldtch at openjdk.org Tue Nov 12 12:57:27 2024 From: aboldtch at openjdk.org (Axel Boldt-Christmas) Date: Tue, 12 Nov 2024 12:57:27 GMT Subject: RFR: 8343460: ZGC: Crash in ZRemembered::scan_page_and_clear_remset [v2] In-Reply-To: References: Message-ID: > `free_page` may concurrently delete the remset while `scan_page_and_clear_remset` is scanning the page. Move it to after the `_safe_recycle.register_and_clone_if_activated`. Doing the deletion on the new cloned page will not occur as it not old. And the registered page's remset will be deleted by the destructor when the `_safe_recycle` scope quest up the `safe_destroy`. > > To be able to push the deletion all the way into `prepare_to_recycle` the unnecessary use of this mechanism had to be removed. `free_pages_alloc_failed` does not need to protect the pages, as they are not yet present in the PageTable. We have simply taken them out of the cache, but failed to commit or map some memory, so we are putting these pages back into the cache. See bed9c260bbc9bd208b03d7eedd4e2cfa151b58f2 > > The fix works without this last commit. So we must be careful to check that these pages cannot be reached by some other means. The FoundOld bitmap iteration goes through the PageTable so even if an old page was registered, we would not find these pages. > > There is a scary lack of a fence between the removal of the page from the PageTable and the lock in `register_and_clone_if_activated`. > > The stress test will deterministically crash with this modified code 0756e0056b44ee16bee81256f556c8df981ceaf9 and using these options `-XX:+UseZGC -XX:+UseNewCode -XX:ZCollectionIntervalMinor=0.1 -XX:ZCollectionIntervalMajor=1 -XX:ZFragmentationLimit=0 -XX:-CreateCoredumpOnCrash`, and no longer does after with this patch. Axel Boldt-Christmas has updated the pull request incrementally with two additional commits since the last revision: - Add comment about prepare_to_recycle - Revert recycle_page call, still update last_used ------------- Changes: - all: https://git.openjdk.org/jdk/pull/21905/files - new: https://git.openjdk.org/jdk/pull/21905/files/bed9c260..5e1042dd Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=21905&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=21905&range=00-01 Stats: 4 lines in 1 file changed: 3 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/21905.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/21905/head:pull/21905 PR: https://git.openjdk.org/jdk/pull/21905 From eosterlund at openjdk.org Tue Nov 12 13:04:32 2024 From: eosterlund at openjdk.org (Erik =?UTF-8?B?w5ZzdGVybHVuZA==?=) Date: Tue, 12 Nov 2024 13:04:32 GMT Subject: RFR: 8343460: ZGC: Crash in ZRemembered::scan_page_and_clear_remset [v2] In-Reply-To: References:

Message-ID: On Tue, 12 Nov 2024 12:57:27 GMT, Axel Boldt-Christmas wrote: >> `free_page` may concurrently delete the remset while `scan_page_and_clear_remset` is scanning the page. Move it to after the `_safe_recycle.register_and_clone_if_activated`. Doing the deletion on the new cloned page will not occur as it not old. And the registered page's remset will be deleted by the destructor when the `_safe_recycle` scope quest up the `safe_destroy`. >> >> To be able to push the deletion all the way into `prepare_to_recycle` the unnecessary use of this mechanism had to be removed. `free_pages_alloc_failed` does not need to protect the pages, as they are not yet present in the PageTable. We have simply taken them out of the cache, but failed to commit or map some memory, so we are putting these pages back into the cache. See bed9c260bbc9bd208b03d7eedd4e2cfa151b58f2 >> >> The fix works without this last commit. So we must be careful to check that these pages cannot be reached by some other means. The FoundOld bitmap iteration goes through the PageTable so even if an old page was registered, we would not find these pages. >> >> There is a scary lack of a fence between the removal of the page from the PageTable and the lock in `register_and_clone_if_activated`. >> >> The stress test will deterministically crash with this modified code 0756e0056b44ee16bee81256f556c8df981ceaf9 and using these options `-XX:+UseZGC -XX:+UseNewCode -XX:ZCollectionIntervalMinor=0.1 -XX:ZCollectionIntervalMajor=1 -XX:ZFragmentationLimit=0 -XX:-CreateCoredumpOnCrash`, and no longer does after with this patch. > > Axel Boldt-Christmas has updated the pull request incrementally with two additional commits since the last revision: > > - Add comment about prepare_to_recycle > - Revert recycle_page call, still update last_used Marked as reviewed by stefank (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/21905#pullrequestreview-2429602801 From wkemper at openjdk.org Tue Nov 12 17:32:00 2024 From: wkemper at openjdk.org (William Kemper) Date: Tue, 12 Nov 2024 17:32:00 GMT Subject: RFR: 8342444: Shenandoah: Uncommit regions from a separate, STS aware thread Message-ID: Currently, Shenandoah uncommits regions from its control thread. The control thread is responsible for starting GC cycles in a timely fashion. Uncommitting memory from this thread may introduce unwanted delays in the control thread's response to GC pressure. ------------- Commit messages: - Check for safepoint when stopping (stopping thread is java thread) - Fix ridiculous typo - Merge remote-tracking branch 'jdk/master' into shen-uncommit-thread - Fix shutdown protocol - Take heap lock when uncommitting bitmaps, uncommit thread joins STS. - Little bit of cleanup - WIP: checkpoint before sync up - WIP: checkpoint Changes: https://git.openjdk.org/jdk/pull/22019/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=22019&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8342444 Stats: 319 lines in 6 files changed: 229 ins; 74 del; 16 mod Patch: https://git.openjdk.org/jdk/pull/22019.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22019/head:pull/22019 PR: https://git.openjdk.org/jdk/pull/22019 From wkemper at openjdk.org Tue Nov 12 17:32:00 2024 From: wkemper at openjdk.org (William Kemper) Date: Tue, 12 Nov 2024 17:32:00 GMT Subject: RFR: 8342444: Shenandoah: Uncommit regions from a separate, STS aware thread In-Reply-To: References: Message-ID: On Mon, 11 Nov 2024 17:31:58 GMT, William Kemper wrote: > Currently, Shenandoah uncommits regions from its control thread. The control thread is responsible for starting GC cycles in a timely fashion. Uncommitting memory from this thread may introduce unwanted delays in the control thread's response to GC pressure. I modified the testing pipelines to set `-Xms4g -Xmx10g -XX:+ShenandoahUncommit`. All performance and stress tests completed successfully on x86 and aarch64. Marking this as ready for review. ------------- PR Comment: https://git.openjdk.org/jdk/pull/22019#issuecomment-2471152157 From kdnilsen at openjdk.org Tue Nov 12 17:32:00 2024 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Tue, 12 Nov 2024 17:32:00 GMT Subject: RFR: 8342444: Shenandoah: Uncommit regions from a separate, STS aware thread In-Reply-To: References: Message-ID: