From zgu at openjdk.org Thu Jan 2 16:22:35 2025 From: zgu at openjdk.org (Zhengyu Gu) Date: Thu, 2 Jan 2025 16:22:35 GMT Subject: RFR: 8346569: Shenandoah: Worker initializes ShenandoahThreadLocalData twice results in memory leak In-Reply-To: References:

Message-ID: On Thu, 19 Dec 2024 23:33:04 GMT, William Kemper wrote: > Good catch! How'd you find this? Thank you for the review. I have a script to capture allocations that have not seen before, I guess it is largely obsoleted by --enable-lsan. ------------- PR Comment: https://git.openjdk.org/jdk/pull/22812#issuecomment-2568026495 From zgu at openjdk.org Thu Jan 2 16:22:36 2025 From: zgu at openjdk.org (Zhengyu Gu) Date: Thu, 2 Jan 2025 16:22:36 GMT Subject: RFR: 8346569: Shenandoah: Worker initializes ShenandoahThreadLocalData twice results in memory leak In-Reply-To: References: Message-ID: <_g_TssIoBU2kwwm1XvAO-4noeadthqn1pPuVnNtW8jg=.dfec6482-ce98-4f8b-9e1b-c56a195cd309@github.com> On Wed, 18 Dec 2024 14:46:57 GMT, Zhengyu Gu wrote: > Worker thread initializes ShenandoahThreadLocalData twice, from Thread's constructor and ShenandoahWorkerThreads::on_create_worker(), that results in leaking ShenandoahEvacuationStats. Can I have a (R)eview? @rkennke and @shipilev? ------------- PR Comment: https://git.openjdk.org/jdk/pull/22812#issuecomment-2568028950 From coleenp at openjdk.org Fri Jan 3 14:38:22 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Fri, 3 Jan 2025 14:38:22 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros Message-ID: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. ------------- Commit messages: - 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros Changes: https://git.openjdk.org/jdk/pull/22916/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=22916&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8346990 Stats: 339 lines in 83 files changed: 0 ins; 13 del; 326 mod Patch: https://git.openjdk.org/jdk/pull/22916.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22916/head:pull/22916 PR: https://git.openjdk.org/jdk/pull/22916 From jwaters at openjdk.org Fri Jan 3 15:43:41 2025 From: jwaters at openjdk.org (Julian Waters) Date: Fri, 3 Jan 2025 15:43:41 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros In-Reply-To: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: On Fri, 3 Jan 2025 14:32:39 GMT, Coleen Phillimore wrote: > There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. > > Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. Speaking of %z, there is a non Standard %Ix in os_windows.cpp tty->print_cr("reserve_memory of %Ix bytes took " JLONG_FORMAT " ms (" JLONG_FORMAT " ticks)", bytes, reserveTimer.milliseconds(), reserveTimer.ticks()); Could changing that to %zu be trivial enough to fit into this change? ------------- PR Comment: https://git.openjdk.org/jdk/pull/22916#issuecomment-2569435948 From coleenp at openjdk.org Fri Jan 3 16:23:31 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Fri, 3 Jan 2025 16:23:31 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v2] In-Reply-To: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: > There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. > > Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: Fix %Ix to %zx. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/22916/files - new: https://git.openjdk.org/jdk/pull/22916/files/6d6fbfa7..1748797a Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=22916&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=22916&range=00-01 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/22916.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22916/head:pull/22916 PR: https://git.openjdk.org/jdk/pull/22916 From coleenp at openjdk.org Fri Jan 3 16:23:32 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Fri, 3 Jan 2025 16:23:32 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros In-Reply-To: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: On Fri, 3 Jan 2025 14:32:39 GMT, Coleen Phillimore wrote: > There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. > > Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. I was going to take on the other FORMAT ones in separate PRs. Sorry I see what you're saying. yes, I'll fix that too. ------------- PR Comment: https://git.openjdk.org/jdk/pull/22916#issuecomment-2569484010 From kbarrett at openjdk.org Sat Jan 4 10:04:46 2025 From: kbarrett at openjdk.org (Kim Barrett) Date: Sat, 4 Jan 2025 10:04:46 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v2] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: On Fri, 3 Jan 2025 16:23:31 GMT, Coleen Phillimore wrote: >> There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. >> >> Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. > > Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: > > Fix %Ix to %zx. Uses of `[U]INTX_FORMAT_X` have been replaced with `0x%zx`. I mentioned the possibility of instead using `%#zx`. I don't know if we really want to use some of the (to me) more obscure flag options though. src/hotspot/cpu/x86/vm_version_x86.cpp line 1725: > 1723: ArrayOperationPartialInlineSize = MaxVectorSize >= 16 ? MaxVectorSize : 0; > 1724: if (ArrayOperationPartialInlineSize) { > 1725: warning("Setting ArrayOperationPartialInlineSize as MaxVectorSize%zd)", MaxVectorSize); pre-existing: seems like there should be a separator of some kind between "MaxVectorSize" and the value, either a space or an "=" would be okay. src/hotspot/os/linux/os_linux.cpp line 1370: > 1368: > 1369: #define _UFM "%zu" > 1370: #define _DFM "%zd" Why not get rid of these? src/hotspot/share/ci/ciMethodData.cpp line 788: > 786: // which makes comparing it with the SA version of this output > 787: // harder. data()'s element type is intptr_t. > 788: out->print(" 0x%zx", data()[i]); Could instead use " %#zx". src/hotspot/share/compiler/disassembler.cpp line 600: > 598: st->print("Stub::%s", desc->name()); > 599: if (desc->begin() != adr) { > 600: st->print("%+zd " PTR_FORMAT, adr - desc->begin(), p2i(adr)); Oh, that's an interesting "abuse" of the `_W` variant. src/hotspot/share/gc/shared/ageTable.cpp line 38: > 36: #include "logging/logStream.hpp" > 37: > 38: /* Copyright (c) 1992, 2025, Oracle and/or its affiliates, and Stanford University. Well this is weird. An atypical copyright down inside the file? src/hotspot/share/oops/instanceKlass.cpp line 3695: > 3693: > 3694: st->print(BULLET"hash_slot: %d", hash_slot()); st->cr(); > 3695: st->print(BULLET"secondary bitmap: " LP64_ONLY("0x%016zu") NOT_LP64("0x%08zu"), _secondary_supers_bitmap); st->cr(); Should be using "zx" rather than "zu". I think this could be written as `"%#0*zx", (2 * BytesPerWord + 2), _secondary_supers_bitmap` That's looking a lot like line noise though. I think this and ones like it probably ought not be changed at all. src/hotspot/share/oops/klass.cpp line 1308: > 1306: if (secondary_supers() != nullptr) { > 1307: st->print(" - "); st->print("%d elements;", _secondary_supers->length()); > 1308: st->print_cr(" bitmap: " LP64_ONLY("0x%016zu") NOT_LP64("0x%08zu"), _secondary_supers_bitmap); Same as in instanceKlass - maybe this shouldn't be changed at all. src/hotspot/share/utilities/globalDefinitions.hpp line 156: > 154: #define UINTX_FORMAT_X_0 "0x%016" PRIxPTR > 155: #else > 156: #define UINTX_FORMAT_X_0 "0x%08" PRIxPTR As noted in places where it's used, I'm not sure we should remove and replace UINTX_FORMAT_X_0. test/hotspot/gtest/utilities/test_globalDefinitions.cpp line 281: > 279: > 280: check_format("%zd", (intx)123, "123"); > 281: check_format("0x%zx", (intx)0x123, "0x123"); Could be "%#zx". test/hotspot/gtest/utilities/test_globalDefinitions.cpp line 286: > 284: > 285: check_format("%zu", (uintx)123u, "123"); > 286: check_format("0x%zx", (uintx)0x123u, "0x123"); Could be "%#zx". ------------- Changes requested by kbarrett (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/22916#pullrequestreview-2530503795 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1902879593 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1902886743 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1902972028 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1902912020 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1902916165 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1902944144 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1902945394 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1902960940 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1902965078 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1902966477 From shade at openjdk.org Mon Jan 6 09:57:35 2025 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 6 Jan 2025 09:57:35 GMT Subject: RFR: 8346569: Shenandoah: Worker initializes ShenandoahThreadLocalData twice results in memory leak In-Reply-To: References: Message-ID: On Wed, 18 Dec 2024 14:46:57 GMT, Zhengyu Gu wrote: > Worker thread initializes ShenandoahThreadLocalData twice, from Thread's constructor and ShenandoahWorkerThreads::on_create_worker(), that results in leaking ShenandoahEvacuationStats. This makes sense, thanks. I see that in all other implementations, `BarrierSet` is responsible for creating thread-local data. AFAICS, this only becomes a problem when we run with generational mode that leaks `ShenandoahEvacuationStats`. ------------- Marked as reviewed by shade (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/22812#pullrequestreview-2531783206 From shade at openjdk.org Mon Jan 6 12:05:12 2025 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 6 Jan 2025 12:05:12 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port Message-ID: **NOTE: This is work-in-progress draft for interested parties. The JEP is not even submitted, let alone targeted.** My plan is to to get this done in a quiet time in mainline to limit the ongoing conflicts with mainline. Feel free to comment in this PR, if you see something ahead of time. These comments might adjust the trajectory we take to implement this removal and/or allows us submit and work out more RFEs ahead of this removal. I plan to re-open a clean PR after this preliminary PR is done, maybe after the round of preliminary reviews. This removes the 32-bit x86 port and does a deeper cleaning in Hotspot. The following paragraphs describe what and why was being done. Easy stuff first: all files named `*_x86_32` are gone. Those are only built when build system knows we are compiling for x86_32. There is therefore no impact on x86_64. The code under `!LP64`, `!AMD64` and `IA32` is removed in `x86`-specific files. There is quite a bit of the code, especially around `Assembler` and `MacroAssembler`. I think these removals make the whole thing cleaner. The downside is that some of the `MacroAssembler::*ptr` functions that were used to select the "machine pointer" instructions either from x86_64 or x86_32 are now exclusively for x86_64. I don't think we want to rewrite `*ptr` -> `*q` at this point. I think we gradually morph the code base to use `*q`-flavored methods in new code. x86_32 is the only platform that has special cases for x87 FPU. C1 even implements the whole separate thing to deal with x87 FPU: the parts of regalloc treat it specially, there is `FpuStackSim`, there is `VerifyFPU` family of flags, etc. There are also peculiarities with FP conversions that use FPU, that's why x86_32 used to have template interpreter stubs for FP conversion methods. None of that is needed anymore without x86_32. This cleans up some arch-specific code as well. Both C1 and C2 implement the workarounds for non-IEEE compliant rounding of x87 FPU. After x86_32 is gone, these are not needed anymore. This removes some C2 nodes, removes the rounding instructions in C1. x86_64 is baselined on SSE2+, the VM would not even start if SSE2 is not supported. Most of the checks that we have for `UseSSE < 2` are for the benefit of x86_32. Because of this I folded redundant `UseSSE` checks around Hotspot. The one thing I _deliberately_ avoided doing is merging `x86.ad` and `x86_64.ad`. It would likely introduce uncomfortable amount of conflicts with pending work in mainline, so I would like to do it separately as the follow-up. ------------- Commit messages: - More cleanups/reversals - More FPU cleanups in C1 regalloc - More touchups - Fix more backsliding LP64 in Assembler - Revert accidental removal in C1 regalloc - C1: Cleanup dead lir_f stack ops - Cleanup more FPU-related stuff - Remove rounding code from C1 and template interpreter - Purge 32-bit specific rounding mode - OS cleanup - ... and 9 more: https://git.openjdk.org/jdk/compare/f1d85ab3...b55fc750 Changes: https://git.openjdk.org/jdk/pull/22567/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=22567&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8345169 Stats: 40692 lines in 213 files changed: 33 ins; 39906 del; 753 mod Patch: https://git.openjdk.org/jdk/pull/22567.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22567/head:pull/22567 PR: https://git.openjdk.org/jdk/pull/22567 From zgu at openjdk.org Mon Jan 6 13:47:42 2025 From: zgu at openjdk.org (Zhengyu Gu) Date: Mon, 6 Jan 2025 13:47:42 GMT Subject: RFR: 8346569: Shenandoah: Worker initializes ShenandoahThreadLocalData twice results in memory leak In-Reply-To: References: Message-ID: <5yrh2oRRSs-L4QZTgyFUTxd-jS0hDSkgWp-Uke5Cg4U=.41fadf56-e372-4b9d-a966-f4803fb6a235@github.com> On Wed, 18 Dec 2024 14:46:57 GMT, Zhengyu Gu wrote: > Worker thread initializes ShenandoahThreadLocalData twice, from Thread's constructor and ShenandoahWorkerThreads::on_create_worker(), that results in leaking ShenandoahEvacuationStats. Thanks, @shipilev ------------- PR Comment: https://git.openjdk.org/jdk/pull/22812#issuecomment-2573140414 From zgu at openjdk.org Mon Jan 6 13:47:42 2025 From: zgu at openjdk.org (Zhengyu Gu) Date: Mon, 6 Jan 2025 13:47:42 GMT Subject: Integrated: 8346569: Shenandoah: Worker initializes ShenandoahThreadLocalData twice results in memory leak In-Reply-To: References: Message-ID: <-O5aGBTWtR__shbWdwHgYg-vWEmktBh59kxQhss9O88=.4e238976-5e83-4fb1-8b3e-5c28f7b2340f@github.com> On Wed, 18 Dec 2024 14:46:57 GMT, Zhengyu Gu wrote: > Worker thread initializes ShenandoahThreadLocalData twice, from Thread's constructor and ShenandoahWorkerThreads::on_create_worker(), that results in leaking ShenandoahEvacuationStats. This pull request has now been integrated. Changeset: dfaa8916 Author: Zhengyu Gu URL: https://git.openjdk.org/jdk/commit/dfaa89162a35acd20b1ed35e147f9626a181510a Stats: 2 lines in 1 file changed: 1 ins; 1 del; 0 mod 8346569: Shenandoah: Worker initializes ShenandoahThreadLocalData twice results in memory leak Reviewed-by: wkemper, shade ------------- PR: https://git.openjdk.org/jdk/pull/22812 From coleenp at openjdk.org Mon Jan 6 15:09:18 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Mon, 6 Jan 2025 15:09:18 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v3] In-Reply-To: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: > There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. > > Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: Fixed some code review comments. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/22916/files - new: https://git.openjdk.org/jdk/pull/22916/files/1748797a..15b1052a Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=22916&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=22916&range=01-02 Stats: 16 lines in 5 files changed: 0 ins; 9 del; 7 mod Patch: https://git.openjdk.org/jdk/pull/22916.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22916/head:pull/22916 PR: https://git.openjdk.org/jdk/pull/22916 From coleenp at openjdk.org Mon Jan 6 15:09:19 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Mon, 6 Jan 2025 15:09:19 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v2] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: <3vQ-kxRahCEhGLRshu6KE_0ZkWCnrgtnyx8cbXsPIeE=.24a34a54-28b0-4202-8ea3-6bd2b7325ce3@github.com> On Fri, 3 Jan 2025 16:23:31 GMT, Coleen Phillimore wrote: >> There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. >> >> Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. > > Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: > > Fix %Ix to %zx. Kim, thanks for slogging through this change. I've updated the patch with your suggested changes. ------------- PR Comment: https://git.openjdk.org/jdk/pull/22916#issuecomment-2573301941 From coleenp at openjdk.org Mon Jan 6 15:09:19 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Mon, 6 Jan 2025 15:09:19 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v2] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com>

Message-ID: On Sat, 4 Jan 2025 09:02:34 GMT, Kim Barrett wrote: >> Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: >> >> Fix %Ix to %zx. > > src/hotspot/os/linux/os_linux.cpp line 1370: > >> 1368: >> 1369: #define _UFM "%zu" >> 1370: #define _DFM "%zd" > > Why not get rid of these? Fixed. > src/hotspot/share/gc/shared/ageTable.cpp line 38: > >> 36: #include "logging/logStream.hpp" >> 37: >> 38: /* Copyright (c) 1992, 2025, Oracle and/or its affiliates, and Stanford University. > > Well this is weird. An atypical copyright down inside the file? This is a relic and not the legal copyright that got updated since nobody noticed. Until you did. Removed. > src/hotspot/share/oops/instanceKlass.cpp line 3695: > >> 3693: >> 3694: st->print(BULLET"hash_slot: %d", hash_slot()); st->cr(); >> 3695: st->print(BULLET"secondary bitmap: " LP64_ONLY("0x%016zu") NOT_LP64("0x%08zu"), _secondary_supers_bitmap); st->cr(); > > Should be using "zx" rather than "zu". I think this could be written as > `"%#0*zx", (2 * BytesPerWord + 2), _secondary_supers_bitmap` > That's looking a lot like line noise though. I think this and ones like it probably ought not be > changed at all. I have to confess that I have no idea what this is trying to show. I'd rather have all the UINTX_FORMAT purged and not leave a remnant for these two special cases. A function whose name describes what this is trying to show would be better. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1904264225 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1904264062 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1904263162 From coleenp at openjdk.org Mon Jan 6 15:24:18 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Mon, 6 Jan 2025 15:24:18 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v4] In-Reply-To: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: > There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. > > Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: Use INTPTR_FORMAT instead of zu for secondary_supers_bitmap. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/22916/files - new: https://git.openjdk.org/jdk/pull/22916/files/15b1052a..6e8b2702 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=22916&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=22916&range=02-03 Stats: 2 lines in 2 files changed: 0 ins; 0 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/22916.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22916/head:pull/22916 PR: https://git.openjdk.org/jdk/pull/22916 From coleenp at openjdk.org Mon Jan 6 15:24:19 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Mon, 6 Jan 2025 15:24:19 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v2] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com>

Message-ID: On Mon, 6 Jan 2025 15:03:34 GMT, Coleen Phillimore wrote: >> src/hotspot/share/oops/instanceKlass.cpp line 3695: >> >>> 3693: >>> 3694: st->print(BULLET"hash_slot: %d", hash_slot()); st->cr(); >>> 3695: st->print(BULLET"secondary bitmap: " LP64_ONLY("0x%016zu") NOT_LP64("0x%08zu"), _secondary_supers_bitmap); st->cr(); >> >> Should be using "zx" rather than "zu". I think this could be written as >> `"%#0*zx", (2 * BytesPerWord + 2), _secondary_supers_bitmap` >> That's looking a lot like line noise though. I think this and ones like it probably ought not be >> changed at all. > > I have to confess that I have no idea what this is trying to show. I'd rather have all the UINTX_FORMAT purged and not leave a remnant for these two special cases. A function whose name describes what this is trying to show would be better. @theRealAph added this with the secondary super cache work, but I think it may have also been meant to be zx because of the leading 0x. So INTPTR_FORMAT would also work. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1904284828 From coleenp at openjdk.org Mon Jan 6 16:02:36 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Mon, 6 Jan 2025 16:02:36 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v2] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com>

Message-ID: On Sat, 4 Jan 2025 09:52:00 GMT, Kim Barrett wrote: >> Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: >> >> Fix %Ix to %zx. > > test/hotspot/gtest/utilities/test_globalDefinitions.cpp line 281: > >> 279: >> 280: check_format("%zd", (intx)123, "123"); >> 281: check_format("0x%zx", (intx)0x123, "0x123"); > > Could be "%#zx". I fixed this. This seems ok. I didn't know about this format option tbh but if it's standard, why not? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1904331779 From kvn at openjdk.org Mon Jan 6 17:49:41 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Mon, 6 Jan 2025 17:49:41 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References: Message-ID: On Thu, 5 Dec 2024 08:26:10 GMT, Aleksey Shipilev wrote: > **NOTE: This is work-in-progress draft for interested parties. The JEP is not even submitted, let alone targeted.** > > My plan is to to get this done in a quiet time in mainline to limit the ongoing conflicts with mainline. Feel free to comment in this PR, if you see something ahead of time. These comments might adjust the trajectory we take to implement this removal and/or allows us submit and work out more RFEs ahead of this removal. I plan to re-open a clean PR after this preliminary PR is done, maybe after the round of preliminary reviews. > > This removes the 32-bit x86 port and does a deeper cleaning in Hotspot. The following paragraphs describe what and why was being done. > > Easy stuff first: all files named `*_x86_32` are gone. Those are only built when build system knows we are compiling for x86_32. There is therefore no impact on x86_64. > > The code under `!LP64`, `!AMD64` and `IA32` is removed in `x86`-specific files. There is quite a bit of the code, especially around `Assembler` and `MacroAssembler`. I think these removals make the whole thing cleaner. The downside is that some of the `MacroAssembler::*ptr` functions that were used to select the "machine pointer" instructions either from x86_64 or x86_32 are now exclusively for x86_64. I don't think we want to rewrite `*ptr` -> `*q` at this point. I think we gradually morph the code base to use `*q`-flavored methods in new code. > > x86_32 is the only platform that has special cases for x87 FPU. > > C1 even implements the whole separate thing to deal with x87 FPU: the parts of regalloc treat it specially, there is `FpuStackSim`, there is `VerifyFPU` family of flags, etc. There are also peculiarities with FP conversions that use FPU, that's why x86_32 used to have template interpreter stubs for FP conversion methods. None of that is needed anymore without x86_32. This cleans up some arch-specific code as well. > > Both C1 and C2 implement the workarounds for non-IEEE compliant rounding of x87 FPU. After x86_32 is gone, these are not needed anymore. This removes some C2 nodes, removes the rounding instructions in C1. > > x86_64 is baselined on SSE2+, the VM would not even start if SSE2 is not supported. Most of the checks that we have for `UseSSE < 2` are for the benefit of x86_32. Because of this I folded redundant `UseSSE` checks around Hotspot. > > The one thing I _deliberately_ avoided doing is merging `x86.ad` and `x86_64.ad`. It would likely introduce uncomfortable amount of conflicts with pending work in mainli... > The one thing I deliberately avoided doing is merging x86.ad and x86_64.ad. I think we can keep them separate (big .ad files is difficult to navigate). `x86.ad` is mostly used for vector instructions. We can rename it to ``x86_vect.ad`. And `x86_64.ad` to `x86.ad`. As followup changes. ------------- PR Comment: https://git.openjdk.org/jdk/pull/22567#issuecomment-2573606824 From kvn at openjdk.org Mon Jan 6 17:53:35 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Mon, 6 Jan 2025 17:53:35 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References: Message-ID: On Thu, 5 Dec 2024 08:26:10 GMT, Aleksey Shipilev wrote: > **NOTE: This is work-in-progress draft for interested parties. The JEP is not even submitted, let alone targeted.** > > My plan is to to get this done in a quiet time in mainline to limit the ongoing conflicts with mainline. Feel free to comment in this PR, if you see something ahead of time. These comments might adjust the trajectory we take to implement this removal and/or allows us submit and work out more RFEs ahead of this removal. I plan to re-open a clean PR after this preliminary PR is done, maybe after the round of preliminary reviews. > > This removes the 32-bit x86 port and does a deeper cleaning in Hotspot. The following paragraphs describe what and why was being done. > > Easy stuff first: all files named `*_x86_32` are gone. Those are only built when build system knows we are compiling for x86_32. There is therefore no impact on x86_64. > > The code under `!LP64`, `!AMD64` and `IA32` is removed in `x86`-specific files. There is quite a bit of the code, especially around `Assembler` and `MacroAssembler`. I think these removals make the whole thing cleaner. The downside is that some of the `MacroAssembler::*ptr` functions that were used to select the "machine pointer" instructions either from x86_64 or x86_32 are now exclusively for x86_64. I don't think we want to rewrite `*ptr` -> `*q` at this point. I think we gradually morph the code base to use `*q`-flavored methods in new code. > > x86_32 is the only platform that has special cases for x87 FPU. > > C1 even implements the whole separate thing to deal with x87 FPU: the parts of regalloc treat it specially, there is `FpuStackSim`, there is `VerifyFPU` family of flags, etc. There are also peculiarities with FP conversions that use FPU, that's why x86_32 used to have template interpreter stubs for FP conversion methods. None of that is needed anymore without x86_32. This cleans up some arch-specific code as well. > > Both C1 and C2 implement the workarounds for non-IEEE compliant rounding of x87 FPU. After x86_32 is gone, these are not needed anymore. This removes some C2 nodes, removes the rounding instructions in C1. > > x86_64 is baselined on SSE2+, the VM would not even start if SSE2 is not supported. Most of the checks that we have for `UseSSE < 2` are for the benefit of x86_32. Because of this I folded redundant `UseSSE` checks around Hotspot. > > The one thing I _deliberately_ avoided doing is merging `x86.ad` and `x86_64.ad`. It would likely introduce uncomfortable amount of conflicts with pending work in mainli... I don't see make files changes. ------------- PR Comment: https://git.openjdk.org/jdk/pull/22567#issuecomment-2573613772 From kvn at openjdk.org Mon Jan 6 18:01:50 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Mon, 6 Jan 2025 18:01:50 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References: Message-ID: On Thu, 5 Dec 2024 08:26:10 GMT, Aleksey Shipilev wrote: > **NOTE: This is work-in-progress draft for interested parties. The JEP is not even submitted, let alone targeted.** > > My plan is to to get this done in a quiet time in mainline to limit the ongoing conflicts with mainline. Feel free to comment in this PR, if you see something ahead of time. These comments might adjust the trajectory we take to implement this removal and/or allows us submit and work out more RFEs ahead of this removal. I plan to re-open a clean PR after this preliminary PR is done, maybe after the round of preliminary reviews. > > This removes the 32-bit x86 port and does a deeper cleaning in Hotspot. The following paragraphs describe what and why was being done. > > Easy stuff first: all files named `*_x86_32` are gone. Those are only built when build system knows we are compiling for x86_32. There is therefore no impact on x86_64. > > The code under `!LP64`, `!AMD64` and `IA32` is removed in `x86`-specific files. There is quite a bit of the code, especially around `Assembler` and `MacroAssembler`. I think these removals make the whole thing cleaner. The downside is that some of the `MacroAssembler::*ptr` functions that were used to select the "machine pointer" instructions either from x86_64 or x86_32 are now exclusively for x86_64. I don't think we want to rewrite `*ptr` -> `*q` at this point. I think we gradually morph the code base to use `*q`-flavored methods in new code. > > x86_32 is the only platform that has special cases for x87 FPU. > > C1 even implements the whole separate thing to deal with x87 FPU: the parts of regalloc treat it specially, there is `FpuStackSim`, there is `VerifyFPU` family of flags, etc. There are also peculiarities with FP conversions that use FPU, that's why x86_32 used to have template interpreter stubs for FP conversion methods. None of that is needed anymore without x86_32. This cleans up some arch-specific code as well. > > Both C1 and C2 implement the workarounds for non-IEEE compliant rounding of x87 FPU. After x86_32 is gone, these are not needed anymore. This removes some C2 nodes, removes the rounding instructions in C1. > > x86_64 is baselined on SSE2+, the VM would not even start if SSE2 is not supported. Most of the checks that we have for `UseSSE < 2` are for the benefit of x86_32. Because of this I folded redundant `UseSSE` checks around Hotspot. > > The one thing I _deliberately_ avoided doing is merging `x86.ad` and `x86_64.ad`. It would likely introduce uncomfortable amount of conflicts with pending work in mainli... It would be nice to split this into separate PRs for easy review. Removing "rounding of x87 FPU" could be definitely done separately. ------------- PR Comment: https://git.openjdk.org/jdk/pull/22567#issuecomment-2573626448 From wkemper at openjdk.org Mon Jan 6 18:08:08 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 6 Jan 2025 18:08:08 GMT Subject: [jdk24] RFR: 8345970: pthread_getcpuclockid related crashes in shenandoah tests Message-ID: <3queiTTYxaqjTtFWIMIQ6AMERNOr4BF4iLpp_5iVvRs=.506092ae-2458-4e41-97af-4e90630456fb@github.com> Clean backport. Fixes acute issue with musl libc (used by Alpine Linux). ------------- Commit messages: - Backport 2ce53e88481659734bc5424c643c5e31c116bc5d Changes: https://git.openjdk.org/jdk/pull/22933/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=22933&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8345970 Stats: 18 lines in 4 files changed: 15 ins; 3 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/22933.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22933/head:pull/22933 PR: https://git.openjdk.org/jdk/pull/22933 From shade at openjdk.org Mon Jan 6 18:19:40 2025 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 6 Jan 2025 18:19:40 GMT Subject: [jdk24] RFR: 8345970: pthread_getcpuclockid related crashes in shenandoah tests In-Reply-To: <3queiTTYxaqjTtFWIMIQ6AMERNOr4BF4iLpp_5iVvRs=.506092ae-2458-4e41-97af-4e90630456fb@github.com> References: <3queiTTYxaqjTtFWIMIQ6AMERNOr4BF4iLpp_5iVvRs=.506092ae-2458-4e41-97af-4e90630456fb@github.com> Message-ID: On Mon, 6 Jan 2025 18:03:20 GMT, William Kemper wrote: > Clean backport. Fixes acute issue with musl libc (used by Alpine Linux). Marked as reviewed by shade (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/22933#pullrequestreview-2532708736 From wkemper at openjdk.org Mon Jan 6 18:27:41 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 6 Jan 2025 18:27:41 GMT Subject: [jdk24] Integrated: 8345970: pthread_getcpuclockid related crashes in shenandoah tests In-Reply-To: <3queiTTYxaqjTtFWIMIQ6AMERNOr4BF4iLpp_5iVvRs=.506092ae-2458-4e41-97af-4e90630456fb@github.com> References: <3queiTTYxaqjTtFWIMIQ6AMERNOr4BF4iLpp_5iVvRs=.506092ae-2458-4e41-97af-4e90630456fb@github.com> Message-ID: On Mon, 6 Jan 2025 18:03:20 GMT, William Kemper wrote: > Clean backport. Fixes acute issue with musl libc (used by Alpine Linux). This pull request has now been integrated. Changeset: cc7c293b Author: William Kemper URL: https://git.openjdk.org/jdk/commit/cc7c293bce8a564943606dbbcad64db96909d68a Stats: 18 lines in 4 files changed: 15 ins; 3 del; 0 mod 8345970: pthread_getcpuclockid related crashes in shenandoah tests Reviewed-by: shade Backport-of: 2ce53e88481659734bc5424c643c5e31c116bc5d ------------- PR: https://git.openjdk.org/jdk/pull/22933 From kdnilsen at openjdk.org Mon Jan 6 19:02:35 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Mon, 6 Jan 2025 19:02:35 GMT Subject: RFR: 8344049: Shenandoah: Eliminate init-update-refs safepoint In-Reply-To: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> References: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> Message-ID: On Wed, 11 Dec 2024 19:08:08 GMT, William Kemper wrote: > Shenandoah typically takes 4 safepoints per GC cycle. Although Shenandoah itself does not spend much time on these safepoints, it may still take quite some time for all of the mutator threads to reach the safepoint. The occasionally long time-to-safepoint increases latency in the higher percentiles. > > The `init-update-refs` safepoint is responsible for retiring GCLABs (and PLABs) used during evacuation. Once evacuation is complete, no threads will access these LABs. This need not be done on a safepoint. `init-update-refs` is also where the global and thread local copies of the `gc_state` are updated. However, here we are turning off the `WEAK_ROOTS` flag _after_ all of the unmarked weak referents have been `nulled` out, so this does not need to happen atomically with respect to the mutators. Neither is it necessary to change the other state flags (EVACUATION, UPDATE_REFS) atomically across all mutators. > > Note that the `init-update-refs` safepoint is still taken if either verification or `ShenandoahPacing` are enabled. Thanks. Looks very clean for how significant the change is in behavior... ------------- Marked as reviewed by kdnilsen (Author). PR Review: https://git.openjdk.org/jdk/pull/22688#pullrequestreview-2532779642 From dholmes at openjdk.org Tue Jan 7 06:24:41 2025 From: dholmes at openjdk.org (David Holmes) Date: Tue, 7 Jan 2025 06:24:41 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References: Message-ID: On Thu, 5 Dec 2024 08:26:10 GMT, Aleksey Shipilev wrote: > **NOTE: This is work-in-progress draft for interested parties. The JEP is not even submitted, let alone targeted.** > > My plan is to to get this done in a quiet time in mainline to limit the ongoing conflicts with mainline. Feel free to comment in this PR, if you see something ahead of time. These comments might adjust the trajectory we take to implement this removal and/or allows us submit and work out more RFEs ahead of this removal. I plan to re-open a clean PR after this preliminary PR is done, maybe after the round of preliminary reviews. > > This removes the 32-bit x86 port and does a deeper cleaning in Hotspot. The following paragraphs describe what and why was being done. > > Easy stuff first: all files named `*_x86_32` are gone. Those are only built when build system knows we are compiling for x86_32. There is therefore no impact on x86_64. > > The code under `!LP64`, `!AMD64` and `IA32` is removed in `x86`-specific files. There is quite a bit of the code, especially around `Assembler` and `MacroAssembler`. I think these removals make the whole thing cleaner. The downside is that some of the `MacroAssembler::*ptr` functions that were used to select the "machine pointer" instructions either from x86_64 or x86_32 are now exclusively for x86_64. I don't think we want to rewrite `*ptr` -> `*q` at this point. I think we gradually morph the code base to use `*q`-flavored methods in new code. > > x86_32 is the only platform that has special cases for x87 FPU. > > C1 even implements the whole separate thing to deal with x87 FPU: the parts of regalloc treat it specially, there is `FpuStackSim`, there is `VerifyFPU` family of flags, etc. There are also peculiarities with FP conversions that use FPU, that's why x86_32 used to have template interpreter stubs for FP conversion methods. None of that is needed anymore without x86_32. This cleans up some arch-specific code as well. > > Both C1 and C2 implement the workarounds for non-IEEE compliant rounding of x87 FPU. After x86_32 is gone, these are not needed anymore. This removes some C2 nodes, removes the rounding instructions in C1. > > x86_64 is baselined on SSE2+, the VM would not even start if SSE2 is not supported. Most of the checks that we have for `UseSSE < 2` are for the benefit of x86_32. Because of this I folded redundant `UseSSE` checks around Hotspot. > > The one thing I _deliberately_ avoided doing is merging `x86.ad` and `x86_64.ad`. It would likely introduce uncomfortable amount of conflicts with pending work in mainli... src/hotspot/share/interpreter/abstractInterpreter.cpp line 137: > 135: case vmIntrinsics::_floatToRawIntBits: return java_lang_Float_floatToRawIntBits; > 136: case vmIntrinsics::_longBitsToDouble: return java_lang_Double_longBitsToDouble; > 137: case vmIntrinsics::_doubleToRawLongBits: return java_lang_Double_doubleToRawLongBits; Why are these intrinsics for the Java methods disappearing? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22567#discussion_r1904957718 From kbarrett at openjdk.org Tue Jan 7 08:34:41 2025 From: kbarrett at openjdk.org (Kim Barrett) Date: Tue, 7 Jan 2025 08:34:41 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v2] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com>

Message-ID: On Mon, 6 Jan 2025 15:04:19 GMT, Coleen Phillimore wrote: >> src/hotspot/share/gc/shared/ageTable.cpp line 38: >> >>> 36: #include "logging/logStream.hpp" >>> 37: >>> 38: /* Copyright (c) 1992, 2025, Oracle and/or its affiliates, and Stanford University. >> >> Well this is weird. An atypical copyright down inside the file? > > This is a relic and not the legal copyright that got updated since nobody noticed. Until you did. Removed. Not sure we're allowed to remove a copyright statement, even if not in the usual place. >> test/hotspot/gtest/utilities/test_globalDefinitions.cpp line 281: >> >>> 279: >>> 280: check_format("%zd", (intx)123, "123"); >>> 281: check_format("0x%zx", (intx)0x123, "0x123"); >> >> Could be "%#zx". > > I fixed this. This seems ok. I didn't know about this format option tbh but if it's standard, why not? I'd forgotten about that format option too, which is why I'm not enamored of it. Also, written that way the prefix gets included in the width when dealing with field width, which might not be great either. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1905081061 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1905079637 From kbarrett at openjdk.org Tue Jan 7 08:34:42 2025 From: kbarrett at openjdk.org (Kim Barrett) Date: Tue, 7 Jan 2025 08:34:42 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v2] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com>

Message-ID: On Mon, 6 Jan 2025 15:21:14 GMT, Coleen Phillimore wrote: >> I have to confess that I have no idea what this is trying to show. I'd rather have all the UINTX_FORMAT purged and not leave a remnant for these two special cases. A function whose name describes what this is trying to show would be better. > > @theRealAph added this with the secondary super cache work, but I think it may have also been meant to be zx because of the leading 0x. So INTPTR_FORMAT would also work. I don't think we should be mixing uintx types and UINTPTR_FORMAT like that. As I said earlier, this is one that I think probably ought not be changed at all. I think some of the FORMAT macros are useful to avoid inline format directives that resemble line noise, or ugly conditionals like that. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1905076840 From kbarrett at openjdk.org Tue Jan 7 08:52:42 2025 From: kbarrett at openjdk.org (Kim Barrett) Date: Tue, 7 Jan 2025 08:52:42 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v2] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com>

Message-ID: On Tue, 7 Jan 2025 08:28:32 GMT, Kim Barrett wrote: >> @theRealAph added this with the secondary super cache work, but I think it may have also been meant to be zx because of the leading 0x. So INTPTR_FORMAT would also work. > > I don't think we should be mixing uintx types and UINTPTR_FORMAT like that. As I said earlier, this is one that > I think probably ought not be changed at all. I think some of the FORMAT macros are useful to avoid inline > format directives that resemble line noise, or ugly conditionals like that. Improving on my prior suggestion `"%#.*zx", (2 * BytesPerWord), _secondary_supers_bitmap` Using precision rather than field width, to avoid needing to account for the prefix in the width calculation. But still looking a lot like line noise, and still think it shouldn't be changed. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1905101674 From kbarrett at openjdk.org Tue Jan 7 08:52:43 2025 From: kbarrett at openjdk.org (Kim Barrett) Date: Tue, 7 Jan 2025 08:52:43 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v2] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com>

Message-ID: On Tue, 7 Jan 2025 08:31:13 GMT, Kim Barrett wrote: >> I fixed this. This seems ok. I didn't know about this format option tbh but if it's standard, why not? > > I'd forgotten about that format option too, which is why I'm not enamored of it. Also, written that way the > prefix gets included in the width when dealing with field width, which might not be great either. The problem of accounting for the prefix in the field width calculation can be dealt with by using precision rather than field width. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1905104428 From shade at openjdk.org Tue Jan 7 09:15:41 2025 From: shade at openjdk.org (Aleksey Shipilev) Date: Tue, 7 Jan 2025 09:15:41 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References:

Message-ID: On Tue, 7 Jan 2025 06:21:50 GMT, David Holmes wrote: >> **NOTE: This is work-in-progress draft for interested parties. The JEP is not even submitted, let alone targeted.** >> >> My plan is to to get this done in a quiet time in mainline to limit the ongoing conflicts with mainline. Feel free to comment in this PR, if you see something ahead of time. These comments might adjust the trajectory we take to implement this removal and/or allows us submit and work out more RFEs ahead of this removal. I plan to re-open a clean PR after this preliminary PR is done, maybe after the round of preliminary reviews. >> >> This removes the 32-bit x86 port and does a deeper cleaning in Hotspot. The following paragraphs describe what and why was being done. >> >> Easy stuff first: all files named `*_x86_32` are gone. Those are only built when build system knows we are compiling for x86_32. There is therefore no impact on x86_64. >> >> The code under `!LP64`, `!AMD64` and `IA32` is removed in `x86`-specific files. There is quite a bit of the code, especially around `Assembler` and `MacroAssembler`. I think these removals make the whole thing cleaner. The downside is that some of the `MacroAssembler::*ptr` functions that were used to select the "machine pointer" instructions either from x86_64 or x86_32 are now exclusively for x86_64. I don't think we want to rewrite `*ptr` -> `*q` at this point. I think we gradually morph the code base to use `*q`-flavored methods in new code. >> >> x86_32 is the only platform that has special cases for x87 FPU. >> >> C1 even implements the whole separate thing to deal with x87 FPU: the parts of regalloc treat it specially, there is `FpuStackSim`, there is `VerifyFPU` family of flags, etc. There are also peculiarities with FP conversions that use FPU, that's why x86_32 used to have template interpreter stubs for FP conversion methods. None of that is needed anymore without x86_32. This cleans up some arch-specific code as well. >> >> Both C1 and C2 implement the workarounds for non-IEEE compliant rounding of x87 FPU. After x86_32 is gone, these are not needed anymore. This removes some C2 nodes, removes the rounding instructions in C1. >> >> x86_64 is baselined on SSE2+, the VM would not even start if SSE2 is not supported. Most of the checks that we have for `UseSSE < 2` are for the benefit of x86_32. Because of this I folded redundant `UseSSE` checks around Hotspot. >> >> The one thing I _deliberately_ avoided doing is merging `x86.ad` and `x86_64.ad`. It would likely introduce uncomfortable amount of... > > src/hotspot/share/interpreter/abstractInterpreter.cpp line 137: > >> 135: case vmIntrinsics::_floatToRawIntBits: return java_lang_Float_floatToRawIntBits; >> 136: case vmIntrinsics::_longBitsToDouble: return java_lang_Double_longBitsToDouble; >> 137: case vmIntrinsics::_doubleToRawLongBits: return java_lang_Double_doubleToRawLongBits; > > Why are these intrinsics for the Java methods disappearing? These are interpreter "intrinsics" that are only implemented on x86_32 to handle x87 FPU pecularities. Look around for `TemplateInterpreterGenerator::generate_Float_intBitsToFloat_entry`, for example. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22567#discussion_r1905134973 From coleenp at openjdk.org Tue Jan 7 12:36:46 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Tue, 7 Jan 2025 12:36:46 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v2] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com>

Message-ID: On Tue, 7 Jan 2025 08:50:04 GMT, Kim Barrett wrote: >> I'd forgotten about that format option too, which is why I'm not enamored of it. Also, written that way the >> prefix gets included in the width when dealing with field width, which might not be great either. > > The problem of accounting for the prefix in the field width calculation can be dealt with by using precision > rather than field width. Well then that leaves the fun of dealing with these format specifiers when you're trying to do your own formatting. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1905390045 From coleenp at openjdk.org Tue Jan 7 12:51:33 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Tue, 7 Jan 2025 12:51:33 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v5] In-Reply-To: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: > There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. > > Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: Restore copyright and macro. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/22916/files - new: https://git.openjdk.org/jdk/pull/22916/files/6e8b2702..ae9d9f6f Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=22916&range=04 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=22916&range=03-04 Stats: 8 lines in 4 files changed: 5 ins; 0 del; 3 mod Patch: https://git.openjdk.org/jdk/pull/22916.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22916/head:pull/22916 PR: https://git.openjdk.org/jdk/pull/22916 From coleenp at openjdk.org Tue Jan 7 12:51:33 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Tue, 7 Jan 2025 12:51:33 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v2] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com>

Message-ID: On Tue, 7 Jan 2025 08:48:08 GMT, Kim Barrett wrote: >> I don't think we should be mixing uintx types and UINTPTR_FORMAT like that. As I said earlier, this is one that >> I think probably ought not be changed at all. I think some of the FORMAT macros are useful to avoid inline >> format directives that resemble line noise, or ugly conditionals like that. > > Improving on my prior suggestion > `"%#.*zx", (2 * BytesPerWord), _secondary_supers_bitmap` > Using precision rather than field width, to avoid needing to account for the prefix in the width calculation. > But still looking a lot like line noise, and still think it shouldn't be changed. Yes, this looks horrible. The macro that I was trying to remove is better. I restored but moved it. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1905405696 From ysr at openjdk.org Wed Jan 8 16:43:02 2025 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Wed, 8 Jan 2025 16:43:02 GMT Subject: RFR: 8344049: Shenandoah: Eliminate init-update-refs safepoint In-Reply-To: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> References: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> Message-ID: On Wed, 11 Dec 2024 19:08:08 GMT, William Kemper wrote: > Shenandoah typically takes 4 safepoints per GC cycle. Although Shenandoah itself does not spend much time on these safepoints, it may still take quite some time for all of the mutator threads to reach the safepoint. The occasionally long time-to-safepoint increases latency in the higher percentiles. > > The `init-update-refs` safepoint is responsible for retiring GCLABs (and PLABs) used during evacuation. Once evacuation is complete, no threads will access these LABs. This need not be done on a safepoint. `init-update-refs` is also where the global and thread local copies of the `gc_state` are updated. However, here we are turning off the `WEAK_ROOTS` flag _after_ all of the unmarked weak referents have been `nulled` out, so this does not need to happen atomically with respect to the mutators. Neither is it necessary to change the other state flags (EVACUATION, UPDATE_REFS) atomically across all mutators. > > Note that the `init-update-refs` safepoint is still taken if either verification or `ShenandoahPacing` are enabled. Looks good... Left a few documentation request comments. I haven't fully wrapped my head around the correctness of this yet (sorry, slow start to the new year :-), and will go over it again and complete it a bit later today after I get to the office. src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp line 1166: > 1164: } > 1165: > 1166: if (VerifyAfterGC) { What are the conventions of when to use Verify{Before,After,During}GC on the one hand, vs ShenandoahVerify, G1Verify* etc., on the other? src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 2650: > 2648: bool ShenandoahHeap::is_gc_state(GCState state) const { > 2649: return _gc_state_changed ? _gc_state.is_set(state) : ShenandoahThreadLocalData::is_gc_state(state); > 2650: } This needs a documentation comment, please; e.g. why we check `_gc_state_changed` before we check the global state. Is the transition of the local and global states wrt the phase described in a comment somewhere else already? src/hotspot/share/gc/shenandoah/shenandoahHeap.hpp line 371: > 369: public: > 370: char gc_state() const; > 371: bool is_gc_state(GCState state) const; Can you write a 1-line documentation comment for this method? It would make its implementation clearer. (See my comment in the method's implementation.) src/hotspot/share/gc/shenandoah/shenandoahHeap.hpp line 374: > 372: > 373: // This copies the global gc state into a thread local variable for all threads. > 374: // It is primarily intended to support quick access at barriers. All threads are Instead of "It ..." say "The thread local gc state ..." src/hotspot/share/gc/shenandoah/shenandoahHeapRegionCounters.cpp line 150: > 148: return 3; > 149: } > 150: if (heap->is_concurrent_mark_in_progress() || heap->is_concurrent_weak_root_in_progress() || heap->is_full_gc_in_progress()) { naive question: where are the counters/encoding used? ------------- PR Review: https://git.openjdk.org/jdk/pull/22688#pullrequestreview-2536114812 PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1907457433 PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1906544298 PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1906551170 PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1906548845 PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1906541100 From ysr at openjdk.org Wed Jan 8 16:43:02 2025 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Wed, 8 Jan 2025 16:43:02 GMT Subject: RFR: 8344049: Shenandoah: Eliminate init-update-refs safepoint In-Reply-To: References: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> Message-ID: On Wed, 8 Jan 2025 06:30:45 GMT, Y. Srinivas Ramakrishna wrote: >> Shenandoah typically takes 4 safepoints per GC cycle. Although Shenandoah itself does not spend much time on these safepoints, it may still take quite some time for all of the mutator threads to reach the safepoint. The occasionally long time-to-safepoint increases latency in the higher percentiles. >> >> The `init-update-refs` safepoint is responsible for retiring GCLABs (and PLABs) used during evacuation. Once evacuation is complete, no threads will access these LABs. This need not be done on a safepoint. `init-update-refs` is also where the global and thread local copies of the `gc_state` are updated. However, here we are turning off the `WEAK_ROOTS` flag _after_ all of the unmarked weak referents have been `nulled` out, so this does not need to happen atomically with respect to the mutators. Neither is it necessary to change the other state flags (EVACUATION, UPDATE_REFS) atomically across all mutators. >> >> Note that the `init-update-refs` safepoint is still taken if either verification or `ShenandoahPacing` are enabled. > > src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 2650: > >> 2648: bool ShenandoahHeap::is_gc_state(GCState state) const { >> 2649: return _gc_state_changed ? _gc_state.is_set(state) : ShenandoahThreadLocalData::is_gc_state(state); >> 2650: } > > This needs a documentation comment, please; e.g. why we check `_gc_state_changed` before we check the global state. Is the transition of the local and global states wrt the phase described in a comment somewhere else already? Or is this a common idiom used elsewhere as well, and already well-documented? > src/hotspot/share/gc/shenandoah/shenandoahHeap.hpp line 371: > >> 369: public: >> 370: char gc_state() const; >> 371: bool is_gc_state(GCState state) const; > > Can you write a 1-line documentation comment for this method? It would make its implementation clearer. (See my comment in the method's implementation.) (e.g. that, unlike comment at line 366, this must return the "right" value even at non-safepoints.) ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1907477306 PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1906552223 From wkemper at openjdk.org Wed Jan 8 20:28:25 2025 From: wkemper at openjdk.org (William Kemper) Date: Wed, 8 Jan 2025 20:28:25 GMT Subject: RFR: 8338737: Shenandoah: Reset marking bitmaps after the cycle [v3] In-Reply-To: References: <6duTgo8vKHyCUnasOsrHp341B2krxcK8jNogKjX09gs=.af63669e-9c8d-4f17-b055-bf3a03a9618e@github.com> <__kORuPC0guQED9-jn2Xg9CFIJ15wVRojwZoy_VqcPs=.0e5c812f-9e4e-4396-8acd-1e84a5e598c5@github.com>

Message-ID: On Sat, 21 Dec 2024 01:48:10 GMT, Xiaolong Peng wrote: >> The old cycle may be preempted by young collections, but it is only really _cancelled_ by global cycles or full GCs. Control thread will resume old marking, but this operates independently from young bitmap regions. I think we can reset young region bitmaps even when concurrent old marking is on going. > > I think we are taking about the same thing, old gen could be preempted by young gc and resumed after the cycle. I have seem crash from gc verification caused by this, an old gc was bootstrapped but it was preempted/canceled multiple times right after the old gc started, eventually caused a crash from verifier because it expected the object in young is marked. I will share the gc log on slack later. That sounds like an issue with the verifier then? Once a young cycle is complete, nothing should depend on the state of the bitmaps for young regions (if, for no other reason, evacuation could have moved objects so that the bitmaps no longer represent the addresses of marked objects that were evacuated). ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22778#discussion_r1907802968 From wkemper at openjdk.org Wed Jan 8 20:41:47 2025 From: wkemper at openjdk.org (William Kemper) Date: Wed, 8 Jan 2025 20:41:47 GMT Subject: RFR: 8344049: Shenandoah: Eliminate init-update-refs safepoint In-Reply-To: References: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> Message-ID: On Wed, 8 Jan 2025 06:26:38 GMT, Y. Srinivas Ramakrishna wrote: >> Shenandoah typically takes 4 safepoints per GC cycle. Although Shenandoah itself does not spend much time on these safepoints, it may still take quite some time for all of the mutator threads to reach the safepoint. The occasionally long time-to-safepoint increases latency in the higher percentiles. >> >> The `init-update-refs` safepoint is responsible for retiring GCLABs (and PLABs) used during evacuation. Once evacuation is complete, no threads will access these LABs. This need not be done on a safepoint. `init-update-refs` is also where the global and thread local copies of the `gc_state` are updated. However, here we are turning off the `WEAK_ROOTS` flag _after_ all of the unmarked weak referents have been `nulled` out, so this does not need to happen atomically with respect to the mutators. Neither is it necessary to change the other state flags (EVACUATION, UPDATE_REFS) atomically across all mutators. >> >> Note that the `init-update-refs` safepoint is still taken if either verification or `ShenandoahPacing` are enabled. > > src/hotspot/share/gc/shenandoah/shenandoahHeapRegionCounters.cpp line 150: > >> 148: return 3; >> 149: } >> 150: if (heap->is_concurrent_mark_in_progress() || heap->is_concurrent_weak_root_in_progress() || heap->is_full_gc_in_progress()) { > > naive question: where are the counters/encoding used? They get put in `PerfData` variables. They also may be serialized in a log. The [Shenandoah Visualizer](https://github.com/openjdk/shenandoah-visualizer) is able to render them. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1907839891 From wkemper at openjdk.org Wed Jan 8 20:45:48 2025 From: wkemper at openjdk.org (William Kemper) Date: Wed, 8 Jan 2025 20:45:48 GMT Subject: RFR: 8344049: Shenandoah: Eliminate init-update-refs safepoint In-Reply-To: References: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> Message-ID: On Wed, 8 Jan 2025 16:25:23 GMT, Y. Srinivas Ramakrishna wrote: >> Shenandoah typically takes 4 safepoints per GC cycle. Although Shenandoah itself does not spend much time on these safepoints, it may still take quite some time for all of the mutator threads to reach the safepoint. The occasionally long time-to-safepoint increases latency in the higher percentiles. >> >> The `init-update-refs` safepoint is responsible for retiring GCLABs (and PLABs) used during evacuation. Once evacuation is complete, no threads will access these LABs. This need not be done on a safepoint. `init-update-refs` is also where the global and thread local copies of the `gc_state` are updated. However, here we are turning off the `WEAK_ROOTS` flag _after_ all of the unmarked weak referents have been `nulled` out, so this does not need to happen atomically with respect to the mutators. Neither is it necessary to change the other state flags (EVACUATION, UPDATE_REFS) atomically across all mutators. >> >> Note that the `init-update-refs` safepoint is still taken if either verification or `ShenandoahPacing` are enabled. > > src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp line 1166: > >> 1164: } >> 1165: >> 1166: if (VerifyAfterGC) { > > What are the conventions of when to use Verify{Before,After,During}GC on the one hand, vs ShenandoahVerify, G1Verify* etc., on the other? I don't really think there is a convention. In this particular case, it was "verifying" before concurrent reference processing was complete, which could lead to erroneous verification failures. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1907844225 From kdnilsen at openjdk.org Wed Jan 8 20:52:44 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 8 Jan 2025 20:52:44 GMT Subject: RFR: 8338737: Shenandoah: Reset marking bitmaps after the cycle [v3] In-Reply-To: References: <6duTgo8vKHyCUnasOsrHp341B2krxcK8jNogKjX09gs=.af63669e-9c8d-4f17-b055-bf3a03a9618e@github.com> Message-ID: On Mon, 30 Dec 2024 22:54:27 GMT, Xiaolong Peng wrote: >> Reset marking bitmaps after collection cycle; for GenShen only do this for young generation, also choose not do this for Degen and full GC since both are running at safepoint, we should leave safepoint as ASAP. >> >> I have run same workload for 30s with Shenandoah in generational mode and classic mode, average average time of concurrent reset dropped significantly since in most case bitmap for young gen should have been reset after pervious concurrent cycle finishes if there is no need to preserve bitmap states. >> >> GenShen: >> Before: >> >> [33.342s][info][gc,stats ] Concurrent Reset = 0.023 s (a = 1921 us) (n = 12) (lvls, us = 133, 385, 1191, 1836, 8878) >> >> >> After: >> >> [33.597s][info][gc,stats ] Concurrent Reset = 0.004 s (a = 317 us) (n = 13) (lvls, us = 58, 119, 217, 410, 670) >> [33.597s][info][gc,stats ] Concurrent Reset After Collect = 0.018 s (a = 1365 us) (n = 13) (lvls, us = 91, 186, 818, 1836, 3872) >> >> >> Shenandoah: >> Before: >> >> [33.144s][info][gc,stats ] Concurrent Reset = 0.014 s (a = 1067 us) (n = 13) (lvls, us = 139, 277, 898, 1328, 2118) >> >> After: >> >> [33.128s][info][gc,stats ] Concurrent Reset = 0.003 s (a = 225 us) (n = 13) (lvls, us = 32, 92, 137, 295, 542) >> [33.128s][info][gc,stats ] Concurrent Reset After Collect = 0.009 s (a = 661 us) (n = 13) (lvls, us = 92, 160, 594, 896, 1661) >> >> >> Additional changes: >> * Remove `ShenandoahResetBitmapClosure` and `ShenandoahPrepareForMarkClosure`, merge the code with `ShenandoahResetBitmapClosure`, saving one iteration over all the regions. >> * Use API `ShenandoahGeneration::parallel_heap_region_iterate_free` to iterate the regions, two benefits from this: >> - Underneath it calls `ShenandoahHeap::parallel_heap_region_iterate`, which is faster for very light tasks, see https://bugs.openjdk.org/browse/JDK-8337154 >> - `ShenandoahGeneration::parallel_heap_region_iterate_free` decorate the closure with `ShenandoahExcludeRegionClosure`, which simplifies the code in closure. >> * When `_do_old_gc_bootstrap is true`, instead of reset mark bitmap for old gen separately, simply reset the global generations, so we don't need walk the all regions twice. >> * Clean up FullGC code, remove duplicate code. >> >> ... > > Xiaolong Peng has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains 17 additional commits since the last revision: > > - Merge branch 'openjdk:master' into reset-bitmap > - Address review comments > - Merge branch 'openjdk:master' into reset-bitmap > - Remove ShenandoahResetUpdateRegionStateClosure > - Always set_mark_incomplete when reset mark bitmap > - Fix > - Add comments > - fix > - Not reset_mark_bitmap after cycle when is_concurrent_old_mark_in_progress or is_prepare_for_old_mark_in_progress > - Not invoke set_mark_incomplete when reset bitmap after cycle > - ... and 7 more: https://git.openjdk.org/jdk/compare/82c2f771...f82fdfaa src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp line 242: > 240: // Instead of always reset before collect, some reset can be done after collect to save > 241: // the time before before the cycle so the cycle can be started as soon as possible. > 242: entry_reset_after_collect(); For comment, I would say: "Instead of always resetting immediately before the start of a new GC, we can often reset at the end of the previous GC. This allows us to start the next GC cycle more quickly after a trigger condition is detected, reducing the likelihood that GC will degenerate." src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp line 592: > 590: // If it is old GC bootstrap cycle, always clear bitmap for global gen > 591: // to ensure bitmap for old gen is clear for old GC cycle after this. > 592: if (_do_old_gc_bootstrap) { This may deserve a comment. It seems we ought to clear the old-gen mark bitmap at the end of coalesce-and-fill. But that does not allow us to avoid clearing old-gen mark bitmaps at start of bootstrap because when young-gen regions are promoted in place, the mark bitmap is preserved for those regions, and since they are considered old at the end of the GC cycle during which they were promoted, those bitmaps will not be cleared by op_reset_after_collect(). Is there a way to improve this behavior? For example, in op_reset_after_collect(), maybe we should clear old-gen bitmaps also (at least for recently promoted in place regions) unless old marking is in process and/or mixed evacuations are in progress. Maybe this can be tackled in a separate PR, but would be good to file JBS ticket now if there is agreement on the approach. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22778#discussion_r1907828996 PR Review Comment: https://git.openjdk.org/jdk/pull/22778#discussion_r1907845214 From kdnilsen at openjdk.org Wed Jan 8 20:52:45 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Wed, 8 Jan 2025 20:52:45 GMT Subject: RFR: 8338737: Shenandoah: Reset marking bitmaps after the cycle [v3] In-Reply-To: References: <6duTgo8vKHyCUnasOsrHp341B2krxcK8jNogKjX09gs=.af63669e-9c8d-4f17-b055-bf3a03a9618e@github.com> <__kORuPC0guQED9-jn2Xg9CFIJ15wVRojwZoy_VqcPs=.0e5c812f-9e4e-4396-8acd-1e84a5e598c5@github.com>

Message-ID: <6kG3_NLd3D4G9fYnrmdKw-s25Fsmu1rLwOV_6eRDrfI=.cd23eb62-96b9-459f-a313-2d4ae7762284@github.com> On Wed, 8 Jan 2025 20:15:30 GMT, William Kemper wrote: >> I think we are taking about the same thing, old gen could be preempted by young gc and resumed after the cycle. I have seem crash from gc verification caused by this, an old gc was bootstrapped but it was preempted/canceled multiple times right after the old gc started, eventually caused a crash from verifier because it expected the object in young is marked. I will share the gc log on slack later. > > That sounds like an issue with the verifier then? Once a young cycle is complete, nothing should depend on the state of the bitmaps for young regions (if, for no other reason, evacuation could have moved objects so that the bitmaps no longer represent the addresses of marked objects that were evacuated). I agree with @earthling-amzn that we should be able to reset young-generation mark bitmap even if this is old_gc_bootstrap and even if old marking is in progress. We should dive deeper to figure out the crash you observed. It seems we don't fully understand the root cause. I also suggest rewording the comment. trigged? (See other comments about increasing generality of this approach.) ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22778#discussion_r1907848997 From xpeng at openjdk.org Wed Jan 8 21:40:59 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Wed, 8 Jan 2025 21:40:59 GMT Subject: RFR: 8338737: Shenandoah: Reset marking bitmaps after the cycle [v3] In-Reply-To: References: <6duTgo8vKHyCUnasOsrHp341B2krxcK8jNogKjX09gs=.af63669e-9c8d-4f17-b055-bf3a03a9618e@github.com>

Message-ID: On Wed, 8 Jan 2025 20:43:36 GMT, Kelvin Nilsen wrote: >> Xiaolong Peng has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains 17 additional commits since the last revision: >> >> - Merge branch 'openjdk:master' into reset-bitmap >> - Address review comments >> - Merge branch 'openjdk:master' into reset-bitmap >> - Remove ShenandoahResetUpdateRegionStateClosure >> - Always set_mark_incomplete when reset mark bitmap >> - Fix >> - Add comments >> - fix >> - Not reset_mark_bitmap after cycle when is_concurrent_old_mark_in_progress or is_prepare_for_old_mark_in_progress >> - Not invoke set_mark_incomplete when reset bitmap after cycle >> - ... and 7 more: https://git.openjdk.org/jdk/compare/5c258fa2...f82fdfaa > > src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp line 592: > >> 590: // If it is old GC bootstrap cycle, always clear bitmap for global gen >> 591: // to ensure bitmap for old gen is clear for old GC cycle after this. >> 592: if (_do_old_gc_bootstrap) { > > This may deserve a comment. It seems we ought to clear the old-gen mark bitmap at the end of coalesce-and-fill. But that does not allow us to avoid clearing old-gen mark bitmaps at start of bootstrap because when young-gen regions are promoted in place, the mark bitmap is preserved for those regions, and since they are considered old at the end of the GC cycle during which they were promoted, those bitmaps will not be cleared by op_reset_after_collect(). Is there a way to improve this behavior? > > For example, in op_reset_after_collect(), maybe we should clear old-gen bitmaps also (at least for recently promoted in place regions) unless old marking is in process and/or mixed evacuations are in progress. > > Maybe this can be tackled in a separate PR, but would be good to file JBS ticket now if there is agreement on the approach. Yes, We can reset bimap of old region when there in place promotion and all old regions after coalesce-and-fill for old gen. Thanks Kelvin, I'll create a JBS ticket for this. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22778#discussion_r1907893752 From xpeng at openjdk.org Wed Jan 8 22:58:30 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Wed, 8 Jan 2025 22:58:30 GMT Subject: RFR: 8338737: Shenandoah: Reset marking bitmaps after the cycle [v3] In-Reply-To: References: <6duTgo8vKHyCUnasOsrHp341B2krxcK8jNogKjX09gs=.af63669e-9c8d-4f17-b055-bf3a03a9618e@github.com>

Message-ID: On Wed, 8 Jan 2025 20:30:38 GMT, Kelvin Nilsen wrote: >> Xiaolong Peng has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains 17 additional commits since the last revision: >> >> - Merge branch 'openjdk:master' into reset-bitmap >> - Address review comments >> - Merge branch 'openjdk:master' into reset-bitmap >> - Remove ShenandoahResetUpdateRegionStateClosure >> - Always set_mark_incomplete when reset mark bitmap >> - Fix >> - Add comments >> - fix >> - Not reset_mark_bitmap after cycle when is_concurrent_old_mark_in_progress or is_prepare_for_old_mark_in_progress >> - Not invoke set_mark_incomplete when reset bitmap after cycle >> - ... and 7 more: https://git.openjdk.org/jdk/compare/099e4ed4...f82fdfaa > > src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp line 242: > >> 240: // Instead of always reset before collect, some reset can be done after collect to save >> 241: // the time before before the cycle so the cycle can be started as soon as possible. >> 242: entry_reset_after_collect(); > > For comment, I would say: "Instead of always resetting immediately before the start of a new GC, we can often reset at the end of the previous GC. This allows us to start the next GC cycle more quickly after a trigger condition is detected, reducing the likelihood that GC will degenerate." I'll update comments, thanks Kelvin! ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22778#discussion_r1907947027 From wkemper at openjdk.org Wed Jan 8 23:31:51 2025 From: wkemper at openjdk.org (William Kemper) Date: Wed, 8 Jan 2025 23:31:51 GMT Subject: [jdk24] RFR: 8346737: GenShen: Generational memory pools should not report zero for maximum capacity Message-ID: Clean backport. Fixes many SA tests. ------------- Commit messages: - Backport 249f141211c94afcce70d9d536d84e108e07b4e5 Changes: https://git.openjdk.org/jdk/pull/22984/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=22984&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8346737 Stats: 6 lines in 2 files changed: 0 ins; 6 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/22984.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22984/head:pull/22984 PR: https://git.openjdk.org/jdk/pull/22984 From kdnilsen at openjdk.org Thu Jan 9 00:22:35 2025 From: kdnilsen at openjdk.org (Kelvin Nilsen) Date: Thu, 9 Jan 2025 00:22:35 GMT Subject: [jdk24] RFR: 8346737: GenShen: Generational memory pools should not report zero for maximum capacity In-Reply-To: References: Message-ID: On Wed, 8 Jan 2025 23:26:53 GMT, William Kemper wrote: > Clean backport. Fixes many SA tests. Marked as reviewed by kdnilsen (Author). ------------- PR Review: https://git.openjdk.org/jdk/pull/22984#pullrequestreview-2538518161 From ysr at openjdk.org Thu Jan 9 00:59:42 2025 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Thu, 9 Jan 2025 00:59:42 GMT Subject: [jdk24] RFR: 8346737: GenShen: Generational memory pools should not report zero for maximum capacity In-Reply-To: References: Message-ID: <2SlOW_Bx_aU_KyYjRVjHmZcmkgp0M1qjNV29EULbdtk=.41d35bd0-7a53-43cb-89fb-7fa820a49538@github.com> On Wed, 8 Jan 2025 23:26:53 GMT, William Kemper wrote: > Clean backport. Fixes many SA tests. Marked as reviewed by ysr (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/22984#pullrequestreview-2538573609 From dholmes at openjdk.org Thu Jan 9 01:21:15 2025 From: dholmes at openjdk.org (David Holmes) Date: Thu, 9 Jan 2025 01:21:15 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References:

Message-ID: On Tue, 7 Jan 2025 09:13:06 GMT, Aleksey Shipilev wrote: >> src/hotspot/share/interpreter/abstractInterpreter.cpp line 137: >> >>> 135: case vmIntrinsics::_floatToRawIntBits: return java_lang_Float_floatToRawIntBits; >>> 136: case vmIntrinsics::_longBitsToDouble: return java_lang_Double_longBitsToDouble; >>> 137: case vmIntrinsics::_doubleToRawLongBits: return java_lang_Double_doubleToRawLongBits; >> >> Why are these intrinsics for the Java methods disappearing? > > These are interpreter "intrinsics" that are only implemented on x86_32 to handle x87 FPU pecularities. Look around for `TemplateInterpreterGenerator::generate_Float_intBitsToFloat_entry`, for example. Hmmm ... okay ... I see something "special" is done only on x86_32, but what is done seems to have nothing to do with x87 code. Just to be clear these Java methods still get intrinsified, it is just handled in a different way - right? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22567#discussion_r1908061750 From shade at openjdk.org Thu Jan 9 09:38:38 2025 From: shade at openjdk.org (Aleksey Shipilev) Date: Thu, 9 Jan 2025 09:38:38 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References:

Message-ID: On Thu, 9 Jan 2025 01:17:49 GMT, David Holmes wrote: >> These are interpreter "intrinsics" that are only implemented on x86_32 to handle x87 FPU pecularities. Look around for `TemplateInterpreterGenerator::generate_Float_intBitsToFloat_entry`, for example. > > Hmmm ... okay ... I see something "special" is done only on x86_32, but what is done seems to have nothing to do with x87 code. > > Just to be clear these Java methods still get intrinsified, it is just handled in a different way - right? It *is* about x87 handling of NaNs, a common problem for x86_32 code in Hotspot, you can read about this mess in [JDK-8076373](https://bugs.openjdk.org/browse/JDK-8076373), if you are interested. If we allow to use native implementations of these conversion methods, we get into trouble with NaNs. What these interpreter intrinsics do on x86_32: going for SSE if available, thus avoiding x87. Since this is a correctness problem, these intrinsics go all the way down to interpreter as well. There is still a gaping hole when SSE is not available, but then we have no choice than to use x87 and have all the relevant issues. But all of this is only a headache for x86_32, all other platforms do not have these interpreter intrinsics implemented. With x86_32 going away, we can finally yank these and relevant scaffolding out. The C1/C2 intrinsics are still up and enabled for supported platforms: those are for performance :) ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22567#discussion_r1908436596 From coleenp at openjdk.org Thu Jan 9 13:34:39 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Thu, 9 Jan 2025 13:34:39 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v2] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com>

Message-ID: On Tue, 7 Jan 2025 08:32:27 GMT, Kim Barrett wrote: >> This is a relic and not the legal copyright that got updated since nobody noticed. Until you did. Removed. > > Not sure we're allowed to remove a copyright statement, even if not in the usual place. put copyright back. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1908806441 From wkemper at openjdk.org Thu Jan 9 17:09:41 2025 From: wkemper at openjdk.org (William Kemper) Date: Thu, 9 Jan 2025 17:09:41 GMT Subject: [jdk24] Integrated: 8346737: GenShen: Generational memory pools should not report zero for maximum capacity In-Reply-To: References: Message-ID: <0za45SpKfNjBbXsh5lBROU3kBFsbCZ-Cyh8uPoG7Mto=.9bff78a4-6a0a-4824-8757-0ce15eab06f9@github.com> On Wed, 8 Jan 2025 23:26:53 GMT, William Kemper wrote: > Clean backport. Fixes many SA tests. This pull request has now been integrated. Changeset: ff9b8e46 Author: William Kemper URL: https://git.openjdk.org/jdk/commit/ff9b8e4607e28cf2b165f3ff170b17e6b6d8a8a5 Stats: 6 lines in 2 files changed: 0 ins; 6 del; 0 mod 8346737: GenShen: Generational memory pools should not report zero for maximum capacity Reviewed-by: kdnilsen, ysr Backport-of: 249f141211c94afcce70d9d536d84e108e07b4e5 ------------- PR: https://git.openjdk.org/jdk/pull/22984 From wkemper at openjdk.org Thu Jan 9 17:47:26 2025 From: wkemper at openjdk.org (William Kemper) Date: Thu, 9 Jan 2025 17:47:26 GMT Subject: RFR: 8344049: Shenandoah: Eliminate init-update-refs safepoint [v2] In-Reply-To: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> References: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> Message-ID: > Shenandoah typically takes 4 safepoints per GC cycle. Although Shenandoah itself does not spend much time on these safepoints, it may still take quite some time for all of the mutator threads to reach the safepoint. The occasionally long time-to-safepoint increases latency in the higher percentiles. > > The `init-update-refs` safepoint is responsible for retiring GCLABs (and PLABs) used during evacuation. Once evacuation is complete, no threads will access these LABs. This need not be done on a safepoint. `init-update-refs` is also where the global and thread local copies of the `gc_state` are updated. However, here we are turning off the `WEAK_ROOTS` flag _after_ all of the unmarked weak referents have been `nulled` out, so this does not need to happen atomically with respect to the mutators. Neither is it necessary to change the other state flags (EVACUATION, UPDATE_REFS) atomically across all mutators. > > Note that the `init-update-refs` safepoint is still taken if either verification or `ShenandoahPacing` are enabled. William Kemper has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains 32 additional commits since the last revision: - Improve comments - Merge remote-tracking branch 'jdk/master' into remove-init-update-refs-safepoint - Fix comments - Fix comment, revert unnecessary change - Merge remote-tracking branch 'jdk/master' into remove-init-update-refs-safepoint - Fix phase encoding to handle weak roots - WIP: Use Threads::threads_do for propagating gc state (consolidated) - WIP: Use Threads::threads_do for propagating gc state - Remove unnecessary gc state propagations - Encapsulate gc state - ... and 22 more: https://git.openjdk.org/jdk/compare/967c77a7...83ac7b49 ------------- Changes: - all: https://git.openjdk.org/jdk/pull/22688/files - new: https://git.openjdk.org/jdk/pull/22688/files/9aaef708..83ac7b49 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=22688&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=22688&range=00-01 Stats: 31726 lines in 2167 files changed: 20848 ins; 5389 del; 5489 mod Patch: https://git.openjdk.org/jdk/pull/22688.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22688/head:pull/22688 PR: https://git.openjdk.org/jdk/pull/22688 From matsaave at openjdk.org Thu Jan 9 19:04:51 2025 From: matsaave at openjdk.org (Matias Saavedra Silva) Date: Thu, 9 Jan 2025 19:04:51 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v5] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: On Tue, 7 Jan 2025 12:51:33 GMT, Coleen Phillimore wrote: >> There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. >> >> Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. > > Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: > > Restore copyright and macro. >From what I've looked at so far it looks good! I noticed there are several cases where you mix format specifiers with macros. I understand that replacing other macros may not be in the scope of this change but I find it inconsistent in places where we have both. I listed out some of the cases below, but if you don't believe this to be necessary you can ignore me. src/hotspot/os/bsd/os_bsd.cpp line 2527: > 2525: "\n\n" > 2526: "Do you want to debug the problem?\n\n" > 2527: "To debug, run 'gdb /proc/%d/exe %d'; then switch to thread %zd (" INTPTR_FORMAT ")\n" There is both `%zd` and `INTPTR_FORMAT` in this line. I think it would be more consistent to convert both to format specifiers here. src/hotspot/os/linux/os_linux.cpp line 5276: > 5274: "\n\n" > 5275: "Do you want to debug the problem?\n\n" > 5276: "To debug, run 'gdb /proc/%d/exe %d'; then switch to thread %zu (" INTPTR_FORMAT ")\n" Same as above src/hotspot/os/windows/os_windows.cpp line 533: > 531: } > 532: > 533: log_info(os, thread)("Thread is alive (tid: %zu, stacksize: " SIZE_FORMAT "k).", os::current_thread_id(), thread->stack_size() / K); Same as above, this time with `SIZE_FORMAT` src/hotspot/os/windows/os_windows.cpp line 618: > 616: thread->set_osthread(osthread); > 617: > 618: log_info(os, thread)("Thread attached (tid: %zu, stack: " This line also mixes format specifiers and macros src/hotspot/os/windows/os_windows.cpp line 3340: > 3338: if (Verbose && PrintMiscellaneous) { > 3339: reserveTimer.stop(); > 3340: tty->print_cr("reserve_memory of %zx bytes took " JLONG_FORMAT " ms (" JLONG_FORMAT " ticks)", bytes, Here too src/hotspot/share/classfile/classLoaderStats.cpp line 115: > 113: Klass* parent_klass = (cls._parent == nullptr ? nullptr : cls._parent->klass()); > 114: > 115: _out->print(INTPTR_FORMAT " " INTPTR_FORMAT " " INTPTR_FORMAT " %6zu " SIZE_FORMAT_W(8) " " SIZE_FORMAT_W(8) " ", Here too src/hotspot/share/classfile/classLoaderStats.cpp line 126: > 124: _out->cr(); > 125: if (cls._hidden_classes_count > 0) { > 126: _out->print_cr(SPACE SPACE SPACE " %6zu " SIZE_FORMAT_W(8) " " SIZE_FORMAT_W(8) " + hidden classes", And here src/hotspot/share/classfile/classLoaderStats.cpp line 140: > 138: _out->print("Total = %-6zu", _total_loaders); > 139: _out->print(SPACE SPACE SPACE " ", "", "", ""); > 140: _out->print_cr("%6zu " SIZE_FORMAT_W(8) " " SIZE_FORMAT_W(8) " ", And here src/hotspot/share/code/vtableStubs.cpp line 82: > 80: > 81: void VtableStub::print_on(outputStream* st) const { > 82: st->print("vtable stub (index = %d, receiver_location = %zd, code = [" INTPTR_FORMAT ", " INTPTR_FORMAT "])", And here ------------- Changes requested by matsaave (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/22916#pullrequestreview-2540706941 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1909299619 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1909300550 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1909300883 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1909301552 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1909301678 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1909303066 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1909303216 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1909303480 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1909303991 From xpeng at openjdk.org Thu Jan 9 19:10:37 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Thu, 9 Jan 2025 19:10:37 GMT Subject: RFR: 8338737: Shenandoah: Reset marking bitmaps after the cycle [v3] In-Reply-To: <6kG3_NLd3D4G9fYnrmdKw-s25Fsmu1rLwOV_6eRDrfI=.cd23eb62-96b9-459f-a313-2d4ae7762284@github.com> References: <6duTgo8vKHyCUnasOsrHp341B2krxcK8jNogKjX09gs=.af63669e-9c8d-4f17-b055-bf3a03a9618e@github.com> <__kORuPC0guQED9-jn2Xg9CFIJ15wVRojwZoy_VqcPs=.0e5c812f-9e4e-4396-8acd-1e84a5e598c5@github.com>

<6kG3_NLd3D4G9fYnrmdKw-s25Fsmu1rLwOV_6eRDrfI=.cd23eb62-96b9-459f-a313-2d4ae7762284@github.com> Message-ID: On Wed, 8 Jan 2025 20:47:44 GMT, Kelvin Nilsen wrote: >> That sounds like an issue with the verifier then? Once a young cycle is complete, nothing should depend on the state of the bitmaps for young regions (if, for no other reason, evacuation could have moved objects so that the bitmaps no longer represent the addresses of marked objects that were evacuated). > > I agree with @earthling-amzn that we should be able to reset young-generation mark bitmap even if this is old_gc_bootstrap and even if old marking is in progress. We should dive deeper to figure out the crash you observed. It seems we don't fully understand the root cause. > > I also suggest rewording the comment. trigged? (See other comments about increasing generality of this approach.) I have tested it after removing `if (!_do_old_gc_bootstrap && !heap->is_concurrent_old_mark_in_progress())`, and always get crash in stress test like: # # A fatal error has been detected by the Java Runtime Environment: # # Internal Error (/codebuild/output/src48/src/s3/00/src/hotspot/share/gc/shenandoah/shenandoahVerifier.cpp:1270), pid=1578, tid=1595 # Error: Remembered set violation at init-update-references; clean card should be dirty Referenced from: interior location: 0x00000007f8000008 inside Java heap not in collection set region: | 2528|R |O|BTE 7f8000000, 7f8400000, 7f8400000|TAMS 7f8400000|UWM 7f8400000|U 4096K|T 0B|G 4096K|P 0B|S 0B|L 672B|CP 0 Object: 0x00000007f5dc8b58 - klass 0x0000078000249400 java.lang.invoke.MethodType not allocated after mark start not after update watermark marked strong not marked weak not in collection set age: 8 mark: mark(is_unlocked no_hash age=8) region: | 2519|R |Y|BTE 7f5c00000, 7f6000000, 7f6000000|TAMS 7f6000000|UWM 7f6000000|U 4096K|T 0B|G 4096K|P 0B|S 0B|L 4091K|CP 0 It could be something wrong in remembered set scan, resetting young region bitmaps somehow tickles the issue. I have created another [JBS ticket](https://bugs.openjdk.org/browse/JDK-8347371) to track the issue in remembered set scan, and keep this test for now. I'll update the comments in code. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22778#discussion_r1909312883 From xpeng at openjdk.org Thu Jan 9 19:28:08 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Thu, 9 Jan 2025 19:28:08 GMT Subject: RFR: 8338737: Shenandoah: Reset marking bitmaps after the cycle [v4] In-Reply-To: <6duTgo8vKHyCUnasOsrHp341B2krxcK8jNogKjX09gs=.af63669e-9c8d-4f17-b055-bf3a03a9618e@github.com> References: <6duTgo8vKHyCUnasOsrHp341B2krxcK8jNogKjX09gs=.af63669e-9c8d-4f17-b055-bf3a03a9618e@github.com> Message-ID: <5u5owTlpSq3Y69GNr2LGLerK6uTR0i0_-rYZ1Q6wrnc=.1af37f2c-00ad-4589-a3d9-666a216fb1af@github.com> > Reset marking bitmaps after collection cycle; for GenShen only do this for young generation, also choose not do this for Degen and full GC since both are running at safepoint, we should leave safepoint as ASAP. > > I have run same workload for 30s with Shenandoah in generational mode and classic mode, average average time of concurrent reset dropped significantly since in most case bitmap for young gen should have been reset after pervious concurrent cycle finishes if there is no need to preserve bitmap states. > > GenShen: > Before: > > [33.342s][info][gc,stats ] Concurrent Reset = 0.023 s (a = 1921 us) (n = 12) (lvls, us = 133, 385, 1191, 1836, 8878) > > > After: > > [33.597s][info][gc,stats ] Concurrent Reset = 0.004 s (a = 317 us) (n = 13) (lvls, us = 58, 119, 217, 410, 670) > [33.597s][info][gc,stats ] Concurrent Reset After Collect = 0.018 s (a = 1365 us) (n = 13) (lvls, us = 91, 186, 818, 1836, 3872) > > > Shenandoah: > Before: > > [33.144s][info][gc,stats ] Concurrent Reset = 0.014 s (a = 1067 us) (n = 13) (lvls, us = 139, 277, 898, 1328, 2118) > > After: > > [33.128s][info][gc,stats ] Concurrent Reset = 0.003 s (a = 225 us) (n = 13) (lvls, us = 32, 92, 137, 295, 542) > [33.128s][info][gc,stats ] Concurrent Reset After Collect = 0.009 s (a = 661 us) (n = 13) (lvls, us = 92, 160, 594, 896, 1661) > > > Additional changes: > * Remove `ShenandoahResetBitmapClosure` and `ShenandoahPrepareForMarkClosure`, merge the code with `ShenandoahResetBitmapClosure`, saving one iteration over all the regions. > * Use API `ShenandoahGeneration::parallel_heap_region_iterate_free` to iterate the regions, two benefits from this: > - Underneath it calls `ShenandoahHeap::parallel_heap_region_iterate`, which is faster for very light tasks, see https://bugs.openjdk.org/browse/JDK-8337154 > - `ShenandoahGeneration::parallel_heap_region_iterate_free` decorate the closure with `ShenandoahExcludeRegionClosure`, which simplifies the code in closure. > * When `_do_old_gc_bootstrap is true`, instead of reset mark bitmap for old gen separately, simply reset the global generations, so we don't need walk the all regions twice. > * Clean up FullGC code, remove duplicate code. > > Additional tests: > - [x] CONF=macosx-aarch64-server-fastdebug make test T... Xiaolong Peng has updated the pull request incrementally with three additional commits since the last revision: - Adding condition "!_do_old_gc_bootstrap && !heap->is_concurrent_old_mark_in_progress()" back and address some PR comments - Remove entry_reset_after_collect from ShenandoahOldGC - Remove condition check !_do_old_gc_bootstrap && !heap->is_concurrent_old_mark_in_progress() from op_reset_after_collect ------------- Changes: - all: https://git.openjdk.org/jdk/pull/22778/files - new: https://git.openjdk.org/jdk/pull/22778/files/f82fdfaa..04299a76 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=22778&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=22778&range=02-03 Stats: 13 lines in 2 files changed: 6 ins; 5 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/22778.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22778/head:pull/22778 PR: https://git.openjdk.org/jdk/pull/22778 From xpeng at openjdk.org Thu Jan 9 19:28:08 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Thu, 9 Jan 2025 19:28:08 GMT Subject: RFR: 8338737: Shenandoah: Reset marking bitmaps after the cycle [v3] In-Reply-To: References: <6duTgo8vKHyCUnasOsrHp341B2krxcK8jNogKjX09gs=.af63669e-9c8d-4f17-b055-bf3a03a9618e@github.com>

Message-ID: On Wed, 8 Jan 2025 22:46:12 GMT, Xiaolong Peng wrote: >> src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp line 242: >> >>> 240: // Instead of always reset before collect, some reset can be done after collect to save >>> 241: // the time before before the cycle so the cycle can be started as soon as possible. >>> 242: entry_reset_after_collect(); >> >> For comment, I would say: "Instead of always resetting immediately before the start of a new GC, we can often reset at the end of the previous GC. This allows us to start the next GC cycle more quickly after a trigger condition is detected, reducing the likelihood that GC will degenerate." > > I'll update comments, thanks Kelvin! Fixed. thanks! ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22778#discussion_r1909330187 From xpeng at openjdk.org Thu Jan 9 19:40:45 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Thu, 9 Jan 2025 19:40:45 GMT Subject: RFR: 8338737: Shenandoah: Reset marking bitmaps after the cycle [v3] In-Reply-To: References: <6duTgo8vKHyCUnasOsrHp341B2krxcK8jNogKjX09gs=.af63669e-9c8d-4f17-b055-bf3a03a9618e@github.com>

Message-ID: On Wed, 8 Jan 2025 21:38:16 GMT, Xiaolong Peng wrote: >> src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp line 592: >> >>> 590: // If it is old GC bootstrap cycle, always clear bitmap for global gen >>> 591: // to ensure bitmap for old gen is clear for old GC cycle after this. >>> 592: if (_do_old_gc_bootstrap) { >> >> This may deserve a comment. It seems we ought to clear the old-gen mark bitmap at the end of coalesce-and-fill. But that does not allow us to avoid clearing old-gen mark bitmaps at start of bootstrap because when young-gen regions are promoted in place, the mark bitmap is preserved for those regions, and since they are considered old at the end of the GC cycle during which they were promoted, those bitmaps will not be cleared by op_reset_after_collect(). Is there a way to improve this behavior? >> >> For example, in op_reset_after_collect(), maybe we should clear old-gen bitmaps also (at least for recently promoted in place regions) unless old marking is in process and/or mixed evacuations are in progress. >> >> Maybe this can be tackled in a separate PR, but would be good to file JBS ticket now if there is agreement on the approach. > > Yes, We can reset bimap of old region when there in place promotion and all old regions after coalesce-and-fill for old gen. > > Thanks Kelvin, I'll create a JBS ticket for this. This is ShenandoahConcurrentGC::op_reset(), it is executed when a cycle starts. I have removed line 361 to 371, which traverse all regions and apply reset for old regions when `_do_old_gc_bootstrap` is true, so it used to iterate regions twice when `_do_old_gc_bootstrap` is true. With this change, it only iterate once and reset bitmap for all regions when `_do_old_gc_bootstrap` is true. Here is the ticket https://bugs.openjdk.org/browse/JDK-8347372 to follow up the possible improvements on old GC. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22778#discussion_r1909344212 From xpeng at openjdk.org Thu Jan 9 19:53:05 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Thu, 9 Jan 2025 19:53:05 GMT Subject: [jdk24] RFR: 8345423: Shenandoah: Parallelize concurrent cleanup Message-ID: Clean backport, improve performance of concurrent cleanup of Shenandoah and GenShen, remove the use of heap lock from concurrent cleanup. ------------- Commit messages: - Backport 4da6fd4283a13be1711e7ad948f1d05a0a9148a5 Changes: https://git.openjdk.org/jdk/pull/22991/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=22991&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8345423 Stats: 228 lines in 13 files changed: 79 ins; 56 del; 93 mod Patch: https://git.openjdk.org/jdk/pull/22991.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22991/head:pull/22991 PR: https://git.openjdk.org/jdk/pull/22991 From coleenp at openjdk.org Thu Jan 9 20:39:41 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Thu, 9 Jan 2025 20:39:41 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v5] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: <3rsYHTsq8K_5SIPzeMJQJFM6HMWNTz7OdCBgVBwUUD8=.f3b67c30-8ecb-4034-b0b7-8396c5f8b531@github.com> On Tue, 7 Jan 2025 12:51:33 GMT, Coleen Phillimore wrote: >> There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. >> >> Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. > > Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: > > Restore copyright and macro. The intention is to keep INTPTR_FORMAT and some of the other format specifiers that vary by platform. I have another issue to remove the SIZE_FORMAT ones but that's a bigger change. So this mixture is intentional. JLONG_FORMAT might be something we can remove too but I didn't want to do it all at once. ------------- PR Comment: https://git.openjdk.org/jdk/pull/22916#issuecomment-2581199763 From matsaave at openjdk.org Thu Jan 9 21:52:47 2025 From: matsaave at openjdk.org (Matias Saavedra Silva) Date: Thu, 9 Jan 2025 21:52:47 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v5] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: <2NN6jS-4TNxlwq8K0ovl2o9A3ZdCsTVJJ6NcOWDh-P8=.069b6da4-4c08-4cc6-9532-2b1f96a1793a@github.com> On Tue, 7 Jan 2025 12:51:33 GMT, Coleen Phillimore wrote: >> There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. >> >> Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. > > Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: > > Restore copyright and macro. Looks good! I saw the discussion on `UINTPTR_FORMAT_X_0` so I left it alone. src/hotspot/share/runtime/objectMonitor.cpp line 2500: > 2498: // The minimal things to print for markWord printing, more can be added for debugging and logging. > 2499: st->print("{contentions=0x%08x,waiters=0x%08x" > 2500: ",recursions=%zd,owner=" INT64_FORMAT "}", Is `INT64_FORMAT` different from `INTX_FORMAT`? ------------- Marked as reviewed by matsaave (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/22916#pullrequestreview-2540981143 PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1909469703 From kbarrett at openjdk.org Thu Jan 9 22:00:59 2025 From: kbarrett at openjdk.org (Kim Barrett) Date: Thu, 9 Jan 2025 22:00:59 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v5] In-Reply-To: <2NN6jS-4TNxlwq8K0ovl2o9A3ZdCsTVJJ6NcOWDh-P8=.069b6da4-4c08-4cc6-9532-2b1f96a1793a@github.com> References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> <2NN6jS-4TNxlwq8K0ovl2o9A3ZdCsTVJJ6NcOWDh-P8=.069b6da4-4c08-4cc6-9532-2b1f96a1793a@github.com> Message-ID: On Thu, 9 Jan 2025 21:47:47 GMT, Matias Saavedra Silva wrote: >> Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: >> >> Restore copyright and macro. > > src/hotspot/share/runtime/objectMonitor.cpp line 2500: > >> 2498: // The minimal things to print for markWord printing, more can be added for debugging and logging. >> 2499: st->print("{contentions=0x%08x,waiters=0x%08x" >> 2500: ",recursions=%zd,owner=" INT64_FORMAT "}", > > Is `INT64_FORMAT` different from `INTX_FORMAT`? Currently yes. The type underlying [u]intx varies by platform, being a 32-bit type on 32-bit platforms and a 64-bit type on 64-bit platforms. We've been trimming the set of supported 32-bit platforms though, so maybe someday we won't need that distinction any more. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1909478987 From xpeng at openjdk.org Thu Jan 9 22:44:45 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Thu, 9 Jan 2025 22:44:45 GMT Subject: [jdk24] Withdrawn: 8345423: Shenandoah: Parallelize concurrent cleanup In-Reply-To: References: Message-ID: On Wed, 8 Jan 2025 23:57:36 GMT, Xiaolong Peng wrote: > Clean backport, improve performance of concurrent cleanup of Shenandoah and GenShen, remove the use of heap lock from concurrent cleanup. This pull request has been closed without being integrated. ------------- PR: https://git.openjdk.org/jdk/pull/22991 From ysr at openjdk.org Fri Jan 10 00:42:47 2025 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Fri, 10 Jan 2025 00:42:47 GMT Subject: RFR: 8344049: Shenandoah: Eliminate init-update-refs safepoint [v2] In-Reply-To: References: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> Message-ID: On Thu, 9 Jan 2025 17:47:26 GMT, William Kemper wrote: >> Shenandoah typically takes 4 safepoints per GC cycle. Although Shenandoah itself does not spend much time on these safepoints, it may still take quite some time for all of the mutator threads to reach the safepoint. The occasionally long time-to-safepoint increases latency in the higher percentiles. >> >> The `init-update-refs` safepoint is responsible for retiring GCLABs (and PLABs) used during evacuation. Once evacuation is complete, no threads will access these LABs. This need not be done on a safepoint. `init-update-refs` is also where the global and thread local copies of the `gc_state` are updated. However, here we are turning off the `WEAK_ROOTS` flag _after_ all of the unmarked weak referents have been `nulled` out, so this does not need to happen atomically with respect to the mutators. Neither is it necessary to change the other state flags (EVACUATION, UPDATE_REFS) atomically across all mutators. >> >> Note that the `init-update-refs` safepoint is still taken if either verification or `ShenandoahPacing` are enabled. > > William Kemper has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains 32 additional commits since the last revision: > > - Improve comments > - Merge remote-tracking branch 'jdk/master' into remove-init-update-refs-safepoint > - Fix comments > - Fix comment, revert unnecessary change > - Merge remote-tracking branch 'jdk/master' into remove-init-update-refs-safepoint > - Fix phase encoding to handle weak roots > - WIP: Use Threads::threads_do for propagating gc state (consolidated) > - WIP: Use Threads::threads_do for propagating gc state > - Remove unnecessary gc state propagations > - Encapsulate gc state > - ... and 22 more: https://git.openjdk.org/jdk/compare/e0773235...83ac7b49 src/hotspot/share/gc/shenandoah/shenandoahHeap.hpp line 368: > 366: // This updates the singular, global gc state. This call must happen on a safepoint. > 367: // However, in some cases (init update refs, e.g.), the gc state may change concurrently > 368: // and will be propagated to all threads by a handshake operation. I am a little bit confused by the statement starting at "However, ...". Did you mean that the "local copy of the global state" may be changed outside of a safepoint but not the global state itself? I notice that `set_gc_state()` still asserts that we are at a safepoint: https://github.com/openjdk/jdk/blob/83ac7b49d34081beb3ff58f1c159d22faacd077a/src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp#L2000 Ah, now I see that you use a different API for setting the global gc state outside of a safepoint. If my understanding is correct, then we should probably rename the APIs such that the one that is expected to be set at a safepoint uses `set_gc_state_at_safepoint()` and the one that doesn't might use `set_gc_state_concurrent()` or something like that. That would be less confusing. It also brings up the issue of what specific state predicates it's safe to test when. E.g. whether `is_gc_state()` can be safely tested any time during a safepoint or concurrently. I think it is safe, but explicitly stating this might be useful, not least because we seem to have one state change API that still asserts that we should be at a safepoint. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1909623411 From dholmes at openjdk.org Fri Jan 10 07:21:42 2025 From: dholmes at openjdk.org (David Holmes) Date: Fri, 10 Jan 2025 07:21:42 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References:

Message-ID: On Thu, 9 Jan 2025 09:36:07 GMT, Aleksey Shipilev wrote: >> Hmmm ... okay ... I see something "special" is done only on x86_32, but what is done seems to have nothing to do with x87 code. >> >> Just to be clear these Java methods still get intrinsified, it is just handled in a different way - right? > > It *is* about x87 handling of NaNs, a common problem for x86_32 code in Hotspot, you can read about this mess in [JDK-8076373](https://bugs.openjdk.org/browse/JDK-8076373), if you are interested. If we allow to use native implementations of these conversion methods, we get into trouble with NaNs. What these interpreter intrinsics do on x86_32: going for SSE if available, thus avoiding x87. Since this is a correctness problem, these intrinsics go all the way down to interpreter as well. There is still a gaping hole when SSE is not available, but then we have no choice than to use x87 and have all the relevant issues. > > But all of this is only a headache for x86_32, all other platforms do not have these interpreter intrinsics implemented. With x86_32 going away, we can finally yank these and relevant scaffolding out. > > The C1/C2 intrinsics are still up and enabled for supported platforms: those are for performance :) Okay now I get. Thanks ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22567#discussion_r1909928264 From coleenp at openjdk.org Fri Jan 10 12:57:51 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Fri, 10 Jan 2025 12:57:51 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v2] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com>

Message-ID: On Sat, 4 Jan 2025 09:41:29 GMT, Kim Barrett wrote: >> Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: >> >> Fix %Ix to %zx. > > src/hotspot/share/oops/klass.cpp line 1308: > >> 1306: if (secondary_supers() != nullptr) { >> 1307: st->print(" - "); st->print("%d elements;", _secondary_supers->length()); >> 1308: st->print_cr(" bitmap: " LP64_ONLY("0x%016zu") NOT_LP64("0x%08zu"), _secondary_supers_bitmap); > > Same as in instanceKlass - maybe this shouldn't be changed at all. I restored this. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22916#discussion_r1910340969 From coleenp at openjdk.org Fri Jan 10 13:32:40 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Fri, 10 Jan 2025 13:32:40 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References: Message-ID: On Thu, 5 Dec 2024 08:26:10 GMT, Aleksey Shipilev wrote: > **NOTE: This is work-in-progress draft for interested parties. The JEP is not even submitted, let alone targeted.** > > My plan is to to get this done in a quiet time in mainline to limit the ongoing conflicts with mainline. Feel free to comment in this PR, if you see something ahead of time. These comments might adjust the trajectory we take to implement this removal and/or allows us submit and work out more RFEs ahead of this removal. I plan to re-open a clean PR after this preliminary PR is done, maybe after the round of preliminary reviews. > > This removes the 32-bit x86 port and does a deeper cleaning in Hotspot. The following paragraphs describe what and why was being done. > > Easy stuff first: all files named `*_x86_32` are gone. Those are only built when build system knows we are compiling for x86_32. There is therefore no impact on x86_64. > > The code under `!LP64`, `!AMD64` and `IA32` is removed in `x86`-specific files. There is quite a bit of the code, especially around `Assembler` and `MacroAssembler`. I think these removals make the whole thing cleaner. The downside is that some of the `MacroAssembler::*ptr` functions that were used to select the "machine pointer" instructions either from x86_64 or x86_32 are now exclusively for x86_64. I don't think we want to rewrite `*ptr` -> `*q` at this point. I think we gradually morph the code base to use `*q`-flavored methods in new code. > > x86_32 is the only platform that has special cases for x87 FPU. > > C1 even implements the whole separate thing to deal with x87 FPU: the parts of regalloc treat it specially, there is `FpuStackSim`, there is `VerifyFPU` family of flags, etc. There are also peculiarities with FP conversions that use FPU, that's why x86_32 used to have template interpreter stubs for FP conversion methods. None of that is needed anymore without x86_32. This cleans up some arch-specific code as well. > > Both C1 and C2 implement the workarounds for non-IEEE compliant rounding of x87 FPU. After x86_32 is gone, these are not needed anymore. This removes some C2 nodes, removes the rounding instructions in C1. > > x86_64 is baselined on SSE2+, the VM would not even start if SSE2 is not supported. Most of the checks that we have for `UseSSE < 2` are for the benefit of x86_32. Because of this I folded redundant `UseSSE` checks around Hotspot. > > The one thing I _deliberately_ avoided doing is merging `x86.ad` and `x86_64.ad`. It would likely introduce uncomfortable amount of conflicts with pending work in mainli... I reviewed the template interpreter changes. They look great. src/hotspot/cpu/x86/templateTable_x86.cpp line 330: > 328: void TemplateTable::dconst(int value) { > 329: transition(vtos, dtos); > 330: if (UseSSE >= 2) { I admit that I don't know what UseSSE is but now this is unconditional? Is there a further cleanup necessary for this option? ------------- PR Review: https://git.openjdk.org/jdk/pull/22567#pullrequestreview-2542434532 PR Review Comment: https://git.openjdk.org/jdk/pull/22567#discussion_r1910374250 From shade at openjdk.org Fri Jan 10 13:57:46 2025 From: shade at openjdk.org (Aleksey Shipilev) Date: Fri, 10 Jan 2025 13:57:46 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References:

Message-ID: On Fri, 10 Jan 2025 13:23:46 GMT, Coleen Phillimore wrote: >> **NOTE: This is work-in-progress draft for interested parties. The JEP is not even submitted, let alone targeted.** >> >> My plan is to to get this done in a quiet time in mainline to limit the ongoing conflicts with mainline. Feel free to comment in this PR, if you see something ahead of time. These comments might adjust the trajectory we take to implement this removal and/or allows us submit and work out more RFEs ahead of this removal. I plan to re-open a clean PR after this preliminary PR is done, maybe after the round of preliminary reviews. >> >> This removes the 32-bit x86 port and does a deeper cleaning in Hotspot. The following paragraphs describe what and why was being done. >> >> Easy stuff first: all files named `*_x86_32` are gone. Those are only built when build system knows we are compiling for x86_32. There is therefore no impact on x86_64. >> >> The code under `!LP64`, `!AMD64` and `IA32` is removed in `x86`-specific files. There is quite a bit of the code, especially around `Assembler` and `MacroAssembler`. I think these removals make the whole thing cleaner. The downside is that some of the `MacroAssembler::*ptr` functions that were used to select the "machine pointer" instructions either from x86_64 or x86_32 are now exclusively for x86_64. I don't think we want to rewrite `*ptr` -> `*q` at this point. I think we gradually morph the code base to use `*q`-flavored methods in new code. >> >> x86_32 is the only platform that has special cases for x87 FPU. >> >> C1 even implements the whole separate thing to deal with x87 FPU: the parts of regalloc treat it specially, there is `FpuStackSim`, there is `VerifyFPU` family of flags, etc. There are also peculiarities with FP conversions that use FPU, that's why x86_32 used to have template interpreter stubs for FP conversion methods. None of that is needed anymore without x86_32. This cleans up some arch-specific code as well. >> >> Both C1 and C2 implement the workarounds for non-IEEE compliant rounding of x87 FPU. After x86_32 is gone, these are not needed anymore. This removes some C2 nodes, removes the rounding instructions in C1. >> >> x86_64 is baselined on SSE2+, the VM would not even start if SSE2 is not supported. Most of the checks that we have for `UseSSE < 2` are for the benefit of x86_32. Because of this I folded redundant `UseSSE` checks around Hotspot. >> >> The one thing I _deliberately_ avoided doing is merging `x86.ad` and `x86_64.ad`. It would likely introduce uncomfortable amount of... > > src/hotspot/cpu/x86/templateTable_x86.cpp line 330: > >> 328: void TemplateTable::dconst(int value) { >> 329: transition(vtos, dtos); >> 330: if (UseSSE >= 2) { > > I admit that I don't know what UseSSE is but now this is unconditional? Is there a further cleanup necessary for this option? Yes, now it is unconditional. x86_64 [requires](https://github.com/openjdk/jdk/blob/ec7393e9190c1b93ca08e1107f734c869f400b89/src/hotspot/cpu/x86/vm_version_x86.cpp#L896-L903) UseSSE >= 2. Only x86_32 cared about UseSSE < 2, so now we can eliminate these checks. I think I got the majority, if not all of the cases where these checks are now redundant: there are more in various assemblers and compiler code. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22567#discussion_r1910408581 From coleenp at openjdk.org Fri Jan 10 16:24:47 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Fri, 10 Jan 2025 16:24:47 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References:

Message-ID: On Fri, 10 Jan 2025 13:54:32 GMT, Aleksey Shipilev wrote: >> src/hotspot/cpu/x86/templateTable_x86.cpp line 330: >> >>> 328: void TemplateTable::dconst(int value) { >>> 329: transition(vtos, dtos); >>> 330: if (UseSSE >= 2) { >> >> I admit that I don't know what UseSSE is but now this is unconditional? Is there a further cleanup necessary for this option? > > Yes, now it is unconditional. x86_64 [requires](https://github.com/openjdk/jdk/blob/ec7393e9190c1b93ca08e1107f734c869f400b89/src/hotspot/cpu/x86/vm_version_x86.cpp#L896-L903) UseSSE >= 2. Only x86_32 cared about UseSSE < 2, so now we can eliminate these checks. I think I got the majority, if not all of the cases where these checks are now redundant: there are more in various assemblers and compiler code. Maybe this should change from range (2,4) then. product(int, UseSSE, 4, \ "Highest supported SSE instructions set on x86/x64") \ range(0, 4) \ ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22567#discussion_r1910614336 From xpeng at openjdk.org Fri Jan 10 17:08:10 2025 From: xpeng at openjdk.org (Xiaolong Peng) Date: Fri, 10 Jan 2025 17:08:10 GMT Subject: RFR: 8338737: Shenandoah: Reset marking bitmaps after the cycle [v5] In-Reply-To: <6duTgo8vKHyCUnasOsrHp341B2krxcK8jNogKjX09gs=.af63669e-9c8d-4f17-b055-bf3a03a9618e@github.com> References: <6duTgo8vKHyCUnasOsrHp341B2krxcK8jNogKjX09gs=.af63669e-9c8d-4f17-b055-bf3a03a9618e@github.com> Message-ID: > Reset marking bitmaps after collection cycle; for GenShen only do this for young generation, also choose not do this for Degen and full GC since both are running at safepoint, we should leave safepoint as ASAP. > > I have run same workload for 30s with Shenandoah in generational mode and classic mode, average average time of concurrent reset dropped significantly since in most case bitmap for young gen should have been reset after pervious concurrent cycle finishes if there is no need to preserve bitmap states. > > GenShen: > Before: > > [33.342s][info][gc,stats ] Concurrent Reset = 0.023 s (a = 1921 us) (n = 12) (lvls, us = 133, 385, 1191, 1836, 8878) > > > After: > > [33.597s][info][gc,stats ] Concurrent Reset = 0.004 s (a = 317 us) (n = 13) (lvls, us = 58, 119, 217, 410, 670) > [33.597s][info][gc,stats ] Concurrent Reset After Collect = 0.018 s (a = 1365 us) (n = 13) (lvls, us = 91, 186, 818, 1836, 3872) > > > Shenandoah: > Before: > > [33.144s][info][gc,stats ] Concurrent Reset = 0.014 s (a = 1067 us) (n = 13) (lvls, us = 139, 277, 898, 1328, 2118) > > After: > > [33.128s][info][gc,stats ] Concurrent Reset = 0.003 s (a = 225 us) (n = 13) (lvls, us = 32, 92, 137, 295, 542) > [33.128s][info][gc,stats ] Concurrent Reset After Collect = 0.009 s (a = 661 us) (n = 13) (lvls, us = 92, 160, 594, 896, 1661) > > > Additional changes: > * Remove `ShenandoahResetBitmapClosure` and `ShenandoahPrepareForMarkClosure`, merge the code with `ShenandoahResetBitmapClosure`, saving one iteration over all the regions. > * Use API `ShenandoahGeneration::parallel_heap_region_iterate_free` to iterate the regions, two benefits from this: > - Underneath it calls `ShenandoahHeap::parallel_heap_region_iterate`, which is faster for very light tasks, see https://bugs.openjdk.org/browse/JDK-8337154 > - `ShenandoahGeneration::parallel_heap_region_iterate_free` decorate the closure with `ShenandoahExcludeRegionClosure`, which simplifies the code in closure. > * When `_do_old_gc_bootstrap is true`, instead of reset mark bitmap for old gen separately, simply reset the global generations, so we don't need walk the all regions twice. > * Clean up FullGC code, remove duplicate code. > > Additional tests: > - [x] CONF=macosx-aarch64-server-fastdebug make test T... Xiaolong Peng has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains 21 additional commits since the last revision: - Merge branch 'openjdk:master' into reset-bitmap - Adding condition "!_do_old_gc_bootstrap && !heap->is_concurrent_old_mark_in_progress()" back and address some PR comments - Remove entry_reset_after_collect from ShenandoahOldGC - Remove condition check !_do_old_gc_bootstrap && !heap->is_concurrent_old_mark_in_progress() from op_reset_after_collect - Merge branch 'openjdk:master' into reset-bitmap - Address review comments - Merge branch 'openjdk:master' into reset-bitmap - Remove ShenandoahResetUpdateRegionStateClosure - Always set_mark_incomplete when reset mark bitmap - Fix - ... and 11 more: https://git.openjdk.org/jdk/compare/15c49f96...5a181473 ------------- Changes: - all: https://git.openjdk.org/jdk/pull/22778/files - new: https://git.openjdk.org/jdk/pull/22778/files/04299a76..5a181473 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=22778&range=04 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=22778&range=03-04 Stats: 20964 lines in 584 files changed: 5664 ins; 12721 del; 2579 mod Patch: https://git.openjdk.org/jdk/pull/22778.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22778/head:pull/22778 PR: https://git.openjdk.org/jdk/pull/22778 From ihse at openjdk.org Fri Jan 10 17:17:46 2025 From: ihse at openjdk.org (Magnus Ihse Bursie) Date: Fri, 10 Jan 2025 17:17:46 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References: Message-ID: On Thu, 5 Dec 2024 08:26:10 GMT, Aleksey Shipilev wrote: > **NOTE: This is work-in-progress draft for interested parties. The JEP is not even submitted, let alone targeted.** > > My plan is to to get this done in a quiet time in mainline to limit the ongoing conflicts with mainline. Feel free to comment in this PR, if you see something ahead of time. These comments might adjust the trajectory we take to implement this removal and/or allows us submit and work out more RFEs ahead of this removal. I plan to re-open a clean PR after this preliminary PR is done, maybe after the round of preliminary reviews. > > This removes the 32-bit x86 port and does a deeper cleaning in Hotspot. The following paragraphs describe what and why was being done. > > Easy stuff first: all files named `*_x86_32` are gone. Those are only built when build system knows we are compiling for x86_32. There is therefore no impact on x86_64. > > The code under `!LP64`, `!AMD64` and `IA32` is removed in `x86`-specific files. There is quite a bit of the code, especially around `Assembler` and `MacroAssembler`. I think these removals make the whole thing cleaner. The downside is that some of the `MacroAssembler::*ptr` functions that were used to select the "machine pointer" instructions either from x86_64 or x86_32 are now exclusively for x86_64. I don't think we want to rewrite `*ptr` -> `*q` at this point. I think we gradually morph the code base to use `*q`-flavored methods in new code. > > x86_32 is the only platform that has special cases for x87 FPU. > > C1 even implements the whole separate thing to deal with x87 FPU: the parts of regalloc treat it specially, there is `FpuStackSim`, there is `VerifyFPU` family of flags, etc. There are also peculiarities with FP conversions that use FPU, that's why x86_32 used to have template interpreter stubs for FP conversion methods. None of that is needed anymore without x86_32. This cleans up some arch-specific code as well. > > Both C1 and C2 implement the workarounds for non-IEEE compliant rounding of x87 FPU. After x86_32 is gone, these are not needed anymore. This removes some C2 nodes, removes the rounding instructions in C1. > > x86_64 is baselined on SSE2+, the VM would not even start if SSE2 is not supported. Most of the checks that we have for `UseSSE < 2` are for the benefit of x86_32. Because of this I folded redundant `UseSSE` checks around Hotspot. > > The one thing I _deliberately_ avoided doing is merging `x86.ad` and `x86_64.ad`. It would likely introduce uncomfortable amount of conflicts with pending work in mainli... Don't forget the 32-bit x86 classes under `src/jdk.hotspot.agent/share/classes/sun/jvm/hotspot`. There might be other x86-specific code in other JDK libraries as well, and not just in Hotspot. ------------- PR Comment: https://git.openjdk.org/jdk/pull/22567#issuecomment-2583294294 From shade at openjdk.org Fri Jan 10 18:23:49 2025 From: shade at openjdk.org (Aleksey Shipilev) Date: Fri, 10 Jan 2025 18:23:49 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References:

Message-ID: On Fri, 10 Jan 2025 16:22:06 GMT, Coleen Phillimore wrote: >> Yes, now it is unconditional. x86_64 [requires](https://github.com/openjdk/jdk/blob/ec7393e9190c1b93ca08e1107f734c869f400b89/src/hotspot/cpu/x86/vm_version_x86.cpp#L896-L903) UseSSE >= 2. Only x86_32 cared about UseSSE < 2, so now we can eliminate these checks. I think I got the majority, if not all of the cases where these checks are now redundant: there are more in various assemblers and compiler code. > > Maybe this should change from range (2,4) then. > product(int, UseSSE, 4, \ > "Highest supported SSE instructions set on x86/x64") \ > range(0, 4) \ Right. Now that I am thinking more deeply about it, maybe that would be a first step here: lift UseSSE >= 2 for x86_32 ahead of this JEP, eliminate all UseSSE < 2 parts. I can see how intrusive this gets. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22567#discussion_r1910843574 From kbarrett at openjdk.org Fri Jan 10 18:33:47 2025 From: kbarrett at openjdk.org (Kim Barrett) Date: Fri, 10 Jan 2025 18:33:47 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References:

Message-ID: On Fri, 10 Jan 2025 18:21:25 GMT, Aleksey Shipilev wrote: >> Maybe this should change from range (2,4) then. >> product(int, UseSSE, 4, \ >> "Highest supported SSE instructions set on x86/x64") \ >> range(0, 4) \ > > Right. Now that I am thinking more deeply about it, maybe that would be a first step here: lift UseSSE >= 2 for x86_32 ahead of this JEP, eliminate all UseSSE < 2 parts. I can see how intrusive this gets. [not reviewing, just a drive-by comment] Does UseSSE < 2 provide a way to _avoid_ using relevant parts of SSE on x86_64, perhaps for debugging? Or does x86_64 effectively hard-wire UseSSE >= 2? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22567#discussion_r1910878630 From wkemper at openjdk.org Fri Jan 10 19:35:46 2025 From: wkemper at openjdk.org (William Kemper) Date: Fri, 10 Jan 2025 19:35:46 GMT Subject: RFR: 8344049: Shenandoah: Eliminate init-update-refs safepoint [v2] In-Reply-To: References: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com>

Message-ID: On Fri, 10 Jan 2025 00:40:26 GMT, Y. Srinivas Ramakrishna wrote: >> William Kemper has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains 32 additional commits since the last revision: >> >> - Improve comments >> - Merge remote-tracking branch 'jdk/master' into remove-init-update-refs-safepoint >> - Fix comments >> - Fix comment, revert unnecessary change >> - Merge remote-tracking branch 'jdk/master' into remove-init-update-refs-safepoint >> - Fix phase encoding to handle weak roots >> - WIP: Use Threads::threads_do for propagating gc state (consolidated) >> - WIP: Use Threads::threads_do for propagating gc state >> - Remove unnecessary gc state propagations >> - Encapsulate gc state >> - ... and 22 more: https://git.openjdk.org/jdk/compare/34b4faa5...83ac7b49 > > src/hotspot/share/gc/shenandoah/shenandoahHeap.hpp line 368: > >> 366: // This updates the singular, global gc state. This call must happen on a safepoint. >> 367: // However, in some cases (init update refs, e.g.), the gc state may change concurrently >> 368: // and will be propagated to all threads by a handshake operation. > > I am a little bit confused by the statement starting at "However, ...". Did you mean that the "local copy of the global state" may be changed outside of a safepoint but not the global state itself? > I notice that `set_gc_state()` still asserts that we are at a safepoint: > https://github.com/openjdk/jdk/blob/83ac7b49d34081beb3ff58f1c159d22faacd077a/src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp#L2000 > > Ah, now I see that you use a different API for setting the global gc state outside of a safepoint. > > If my understanding is correct, then we should probably rename the APIs such that the one that is expected to be set at a safepoint uses `set_gc_state_at_safepoint()` and the one that doesn't might use `set_gc_state_concurrent()` or something like that. That would be less confusing. > > It also brings up the issue of what specific state predicates it's safe to test when. E.g. whether `is_gc_state()` can be safely tested any time during a safepoint or concurrently. I think it is safe, but explicitly stating this might be useful, not least because we seem to have one state change API that still asserts that we should be at a safepoint. It is always safe to call `is_gc_state` because it uses the most recently changed value. That is, if the `_gc_state` changes on a safepoint, `is_gc_state` will use the value set during the safepoint. Otherwise, it uses the value which was either changed concurrently through the thread local handshake (init-update-refs) or the value which was propagated to all threads at the end of the safepoint. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1911094315 From wkemper at openjdk.org Fri Jan 10 19:49:17 2025 From: wkemper at openjdk.org (William Kemper) Date: Fri, 10 Jan 2025 19:49:17 GMT Subject: RFR: 8344049: Shenandoah: Eliminate init-update-refs safepoint [v3] In-Reply-To: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> References: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> Message-ID: > Shenandoah typically takes 4 safepoints per GC cycle. Although Shenandoah itself does not spend much time on these safepoints, it may still take quite some time for all of the mutator threads to reach the safepoint. The occasionally long time-to-safepoint increases latency in the higher percentiles. > > The `init-update-refs` safepoint is responsible for retiring GCLABs (and PLABs) used during evacuation. Once evacuation is complete, no threads will access these LABs. This need not be done on a safepoint. `init-update-refs` is also where the global and thread local copies of the `gc_state` are updated. However, here we are turning off the `WEAK_ROOTS` flag _after_ all of the unmarked weak referents have been `nulled` out, so this does not need to happen atomically with respect to the mutators. Neither is it necessary to change the other state flags (EVACUATION, UPDATE_REFS) atomically across all mutators. > > Note that the `init-update-refs` safepoint is still taken if either verification or `ShenandoahPacing` are enabled. William Kemper has updated the pull request incrementally with one additional commit since the last revision: Improve comments and method names ------------- Changes: - all: https://git.openjdk.org/jdk/pull/22688/files - new: https://git.openjdk.org/jdk/pull/22688/files/83ac7b49..89c20a14 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=22688&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=22688&range=01-02 Stats: 14 lines in 2 files changed: 2 ins; 2 del; 10 mod Patch: https://git.openjdk.org/jdk/pull/22688.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22688/head:pull/22688 PR: https://git.openjdk.org/jdk/pull/22688 From kvn at openjdk.org Fri Jan 10 20:28:49 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Fri, 10 Jan 2025 20:28:49 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References:

Message-ID: On Fri, 10 Jan 2025 18:30:04 GMT, Kim Barrett wrote: >> Right. Now that I am thinking more deeply about it, maybe that would be a first step here: lift UseSSE >= 2 for x86_32 ahead of this JEP, eliminate all UseSSE < 2 parts. I can see how intrusive this gets. > > [not reviewing, just a drive-by comment] Does UseSSE < 2 provide a way to _avoid_ using relevant parts of > SSE on x86_64, perhaps for debugging? Or does x86_64 effectively hard-wire UseSSE >= 2? By default all 64-bits x86 CPU (starting from AMD64) supports all instructions up to SSE2. 32-bit x86 CPU may not support SSE2. We can generated sse1 or use FPU instructions in 64-bit VM but we decided not to do that - SSE2 instructions version were much easier to use. We purged all uses of FPU in JDK 15: [JDK-7175279](https://bugs.openjdk.org/browse/JDK-7175279) by using SSE set of instructions because we did not want to mess (save/restore state) with FPU anymore in 64-bit VM. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22567#discussion_r1911258635 From vlivanov at openjdk.org Fri Jan 10 20:28:49 2025 From: vlivanov at openjdk.org (Vladimir Ivanov) Date: Fri, 10 Jan 2025 20:28:49 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References: Message-ID: On Thu, 5 Dec 2024 08:26:10 GMT, Aleksey Shipilev wrote: > **NOTE: This is work-in-progress draft for interested parties. The JEP is not even submitted, let alone targeted.** > > My plan is to to get this done in a quiet time in mainline to limit the ongoing conflicts with mainline. Feel free to comment in this PR, if you see something ahead of time. These comments might adjust the trajectory we take to implement this removal and/or allows us submit and work out more RFEs ahead of this removal. I plan to re-open a clean PR after this preliminary PR is done, maybe after the round of preliminary reviews. > > This removes the 32-bit x86 port and does a deeper cleaning in Hotspot. The following paragraphs describe what and why was being done. > > Easy stuff first: all files named `*_x86_32` are gone. Those are only built when build system knows we are compiling for x86_32. There is therefore no impact on x86_64. > > The code under `!LP64`, `!AMD64` and `IA32` is removed in `x86`-specific files. There is quite a bit of the code, especially around `Assembler` and `MacroAssembler`. I think these removals make the whole thing cleaner. The downside is that some of the `MacroAssembler::*ptr` functions that were used to select the "machine pointer" instructions either from x86_64 or x86_32 are now exclusively for x86_64. I don't think we want to rewrite `*ptr` -> `*q` at this point. I think we gradually morph the code base to use `*q`-flavored methods in new code. > > x86_32 is the only platform that has special cases for x87 FPU. > > C1 even implements the whole separate thing to deal with x87 FPU: the parts of regalloc treat it specially, there is `FpuStackSim`, there is `VerifyFPU` family of flags, etc. There are also peculiarities with FP conversions that use FPU, that's why x86_32 used to have template interpreter stubs for FP conversion methods. None of that is needed anymore without x86_32. This cleans up some arch-specific code as well. > > Both C1 and C2 implement the workarounds for non-IEEE compliant rounding of x87 FPU. After x86_32 is gone, these are not needed anymore. This removes some C2 nodes, removes the rounding instructions in C1. > > x86_64 is baselined on SSE2+, the VM would not even start if SSE2 is not supported. Most of the checks that we have for `UseSSE < 2` are for the benefit of x86_32. Because of this I folded redundant `UseSSE` checks around Hotspot. > > The one thing I _deliberately_ avoided doing is merging `x86.ad` and `x86_64.ad`. It would likely introduce uncomfortable amount of conflicts with pending work in mainli... Personally, I'd prefer to see initial x86-32 removal changeset as straighforward as possible: x86-32-specific files, plus (optionally) x86-32-specific code in x86-specific files. IMO it's better to cover the rest (getting rid of unused features after x86-32 removal) as follow-up cleanups. ------------- PR Comment: https://git.openjdk.org/jdk/pull/22567#issuecomment-2584013803 From kvn at openjdk.org Fri Jan 10 20:33:46 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Fri, 10 Jan 2025 20:33:46 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References:

Message-ID: On Fri, 10 Jan 2025 20:23:28 GMT, Vladimir Kozlov wrote: >> [not reviewing, just a drive-by comment] Does UseSSE < 2 provide a way to _avoid_ using relevant parts of >> SSE on x86_64, perhaps for debugging? Or does x86_64 effectively hard-wire UseSSE >= 2? > > By default all 64-bits x86 CPU (starting from AMD64) supports all instructions up to SSE2. 32-bit x86 CPU may not support SSE2. > > We can generated sse1 or use FPU instructions in 64-bit VM but we decided not to do that - SSE2 instructions version were much easier to use. We purged all uses of FPU in JDK 15: [JDK-7175279](https://bugs.openjdk.org/browse/JDK-7175279) by using SSE set of instructions because we did not want to mess (save/restore state) with FPU anymore in 64-bit VM. I think there are several places in 64-bit VM where we assume SSE2 instructions are always available. So if you set `UseSSE=1 or = 0` in debugger VM may crash. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22567#discussion_r1911279275 From ysr at openjdk.org Sat Jan 11 02:04:49 2025 From: ysr at openjdk.org (Y. Srinivas Ramakrishna) Date: Sat, 11 Jan 2025 02:04:49 GMT Subject: RFR: 8344049: Shenandoah: Eliminate init-update-refs safepoint [v3] In-Reply-To: References: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> Message-ID: <2smNeh6fdjcA_HtcFLFy9IqJBFETW_CRnqzyW1Z7rbI=.8bd30d87-3602-42cd-9f54-c0b818446e7d@github.com> On Fri, 10 Jan 2025 19:49:17 GMT, William Kemper wrote: >> Shenandoah typically takes 4 safepoints per GC cycle. Although Shenandoah itself does not spend much time on these safepoints, it may still take quite some time for all of the mutator threads to reach the safepoint. The occasionally long time-to-safepoint increases latency in the higher percentiles. >> >> The `init-update-refs` safepoint is responsible for retiring GCLABs (and PLABs) used during evacuation. Once evacuation is complete, no threads will access these LABs. This need not be done on a safepoint. `init-update-refs` is also where the global and thread local copies of the `gc_state` are updated. However, here we are turning off the `WEAK_ROOTS` flag _after_ all of the unmarked weak referents have been `nulled` out, so this does not need to happen atomically with respect to the mutators. Neither is it necessary to change the other state flags (EVACUATION, UPDATE_REFS) atomically across all mutators. >> >> Note that the `init-update-refs` safepoint is still taken if either verification or `ShenandoahPacing` are enabled. > > William Kemper has updated the pull request incrementally with one additional commit since the last revision: > > Improve comments and method names Left a few more comments, and will make one more final readthrough and approve. Thanks for your continued patience with my slow review! src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp line 196: > 194: > 195: // Evacuation is complete, retire gc labs > 196: heap->concurrent_prepare_for_update_refs(); For consistency with other related method naming, can we use "updaterefs" instead of "update_refs" (makes IDE searches easier to locate related methods). src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 1245: > 1243: void do_thread(Thread* thread) override { > 1244: _propagator.do_thread(thread); > 1245: if (ShenandoahThreadLocalData::gclab(thread) != nullptr) { Which thread may have this be null? (I am looking at the ShenandoahRetireGCLabClosure which insists that this should be non-null.) I assume we have some threads here that have a gc state that must be updated but which don't have a gc lab. I am wondering if the check for an initialized gclab and in the generational case the plab can be pushed down into the closure rather than being exposed here. At that place, we would want to document (or as needed assert) why some threads targeted by the closure may have null gclab or plab. src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 1267: > 1265: > 1266: // This will propagate the gc state and retire gclabs and plabs for threads that require it. > 1267: ShenandoahPrepareForUpdateRefs prepare_for_update_refs(_gc_state.raw_value()); In looking at this I see that we do not set `_gc_state_changed` here because we don't want individual threads to observe the global state, but only their local state (when it's propagated below). It would be good to emphasise this in the documetation of `_gc_state_changed` use protocol. Indeed, as I had suggested before, I think this might be better encapsulated with a `set_gc_state_concurrent()` that is analogous to `set_gc_state_at_safepoint()` that takes the appropriate state value as an argument, and uses the appropriate `_gc_state_changed` protocol. IIUC, this will be re-used when other safepoints are eliminated in the future. ------------- PR Review: https://git.openjdk.org/jdk/pull/22688#pullrequestreview-2544388098 PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1911793884 PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1911797849 PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1911803283 From kbarrett at openjdk.org Sat Jan 11 14:11:48 2025 From: kbarrett at openjdk.org (Kim Barrett) Date: Sat, 11 Jan 2025 14:11:48 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v5] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: <0T6dqXyqum7hEpCvg97-WsP_zVfOO9JkBCnze1f3sxE=.9b5c6ba1-9f58-4e74-bee8-5478809216cc@github.com> On Tue, 7 Jan 2025 12:51:33 GMT, Coleen Phillimore wrote: >> There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. >> >> Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. > > Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: > > Restore copyright and macro. Looks good. ------------- Marked as reviewed by kbarrett (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/22916#pullrequestreview-2544851802 From coleenp at openjdk.org Sat Jan 11 15:29:48 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Sat, 11 Jan 2025 15:29:48 GMT Subject: RFR: 8345169: Implement JEP XXX: Remove the 32-bit x86 Port In-Reply-To: References:

Message-ID: On Fri, 10 Jan 2025 20:30:32 GMT, Vladimir Kozlov wrote: >> By default all 64-bits x86 CPU (starting from AMD64) supports all instructions up to SSE2. 32-bit x86 CPU may not support SSE2. >> >> We can generated sse1 or use FPU instructions in 64-bit VM but we decided not to do that - SSE2 instructions version were much easier to use. We purged all uses of FPU in JDK 15: [JDK-7175279](https://bugs.openjdk.org/browse/JDK-7175279) by using SSE set of instructions because we did not want to mess (save/restore state) with FPU anymore in 64-bit VM. > > I think there are several places in 64-bit VM where we assume SSE2 instructions are always available. > So if you set `UseSSE=1 or = 0` in debugger VM may crash. Having some kind of pre-JEP patch for this this might be helpful so that we don't drill down on this rather than the whole patch. Maybe the JEP patch could simply be what @iwanowww suggests. Then have a post-JEP patch to remove everything else. Sort of like what we did with Security Manager. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22567#discussion_r1912067239 From dholmes at openjdk.org Mon Jan 13 05:05:49 2025 From: dholmes at openjdk.org (David Holmes) Date: Mon, 13 Jan 2025 05:05:49 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v5] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: On Tue, 7 Jan 2025 12:51:33 GMT, Coleen Phillimore wrote: >> There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. >> >> Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. > > Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: > > Restore copyright and macro. Sorry for a "dumb" question but `%z` is for size_t arguments, so why are we using it to replace INTX/UINTX_FORMAT ??? I get that size_t and intx happen to be the same size but still ... if I see `%z` I expect to see a size_t argument passed in. ------------- PR Review: https://git.openjdk.org/jdk/pull/22916#pullrequestreview-2545711471 From coleenp at openjdk.org Mon Jan 13 13:29:45 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Mon, 13 Jan 2025 13:29:45 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v5] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: On Tue, 7 Jan 2025 12:51:33 GMT, Coleen Phillimore wrote: >> There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. >> >> Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. > > Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: > > Restore copyright and macro. They are interchangeable and some places used UINTX_FORMAT when they should have used SIZE_FORMAT. Better to have just one and just use %zu, which looks better in the format specifiers. I'm going to do SIZE_FORMAT next but still negotiating how to handle review tedium. The error message can be confusing though because the error message for %z refers to size_t. But some of our use of intx should probably be size_t. ------------- PR Comment: https://git.openjdk.org/jdk/pull/22916#issuecomment-2587101349 From coleenp at openjdk.org Mon Jan 13 15:49:15 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Mon, 13 Jan 2025 15:49:15 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v6] In-Reply-To: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: > There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. > > Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: Add Oracle copyright to shenandoah files for this change. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/22916/files - new: https://git.openjdk.org/jdk/pull/22916/files/ae9d9f6f..763c3908 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=22916&range=05 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=22916&range=04-05 Stats: 4 lines in 4 files changed: 4 ins; 0 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/22916.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22916/head:pull/22916 PR: https://git.openjdk.org/jdk/pull/22916 From kbarrett at openjdk.org Mon Jan 13 16:54:53 2025 From: kbarrett at openjdk.org (Kim Barrett) Date: Mon, 13 Jan 2025 16:54:53 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v6] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: On Mon, 13 Jan 2025 15:49:15 GMT, Coleen Phillimore wrote: >> There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. >> >> Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. > > Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: > > Add Oracle copyright to shenandoah files for this change. Still good. ------------- Marked as reviewed by kbarrett (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/22916#pullrequestreview-2547230951 From wkemper at openjdk.org Mon Jan 13 18:22:38 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 13 Jan 2025 18:22:38 GMT Subject: RFR: 8344049: Shenandoah: Eliminate init-update-refs safepoint [v3] In-Reply-To: <2smNeh6fdjcA_HtcFLFy9IqJBFETW_CRnqzyW1Z7rbI=.8bd30d87-3602-42cd-9f54-c0b818446e7d@github.com> References: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> <2smNeh6fdjcA_HtcFLFy9IqJBFETW_CRnqzyW1Z7rbI=.8bd30d87-3602-42cd-9f54-c0b818446e7d@github.com> Message-ID: On Sat, 11 Jan 2025 01:50:39 GMT, Y. Srinivas Ramakrishna wrote: >> William Kemper has updated the pull request incrementally with one additional commit since the last revision: >> >> Improve comments and method names > > src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 1245: > >> 1243: void do_thread(Thread* thread) override { >> 1244: _propagator.do_thread(thread); >> 1245: if (ShenandoahThreadLocalData::gclab(thread) != nullptr) { > > Which thread may have this be null? (I am looking at the ShenandoahRetireGCLabClosure which insists that this should be non-null.) > > I assume we have some threads here that have a gc state that must be updated but which don't have a gc lab. > > I am wondering if the check for an initialized gclab and in the generational case the plab can be pushed down into the closure rather than being exposed here. At that place, we would want to document (or as needed assert) why some threads targeted by the closure may have null gclab or plab. Only worker threads and java threads are required to have gclabs. In other use cases (`shHeap::make_labs_parsable`, `shHeap::retire_gclabs`), this closure is _only_ used on java and worker threads, so pushing the test into the closure would be redundant for other uses. I will put in a comment here instead? Additionally, I noticed an inconsistency between `make_labs_parsable` (which skips the safepoint workers) and `retire_gclabs` (which also visited the control thread). An earlier version of this PR had given gclabs to the control and vm threads, but these threads will only perform evacuations in very rare circumstances, so I've removed their gclabs. > src/hotspot/share/gc/shenandoah/shenandoahHeap.cpp line 1267: > >> 1265: >> 1266: // This will propagate the gc state and retire gclabs and plabs for threads that require it. >> 1267: ShenandoahPrepareForUpdateRefs prepare_for_update_refs(_gc_state.raw_value()); > > In looking at this I see that we do not set `_gc_state_changed` here because we don't want individual threads to observe the global state, but only their local state (when it's propagated below). It would be good to emphasise this in the documetation of `_gc_state_changed` use protocol. > > Indeed, as I had suggested before, I think this might be better encapsulated with a `set_gc_state_concurrent()` that is analogous to `set_gc_state_at_safepoint()` that takes the appropriate state value as an argument, and uses the appropriate `_gc_state_changed` protocol. > > IIUC, this will be re-used when other safepoints are eliminated in the future. I'll encapsulate the access here and improve the documentation for `_gc_state_changed`. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1913617339 PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1913620624 From wkemper at openjdk.org Mon Jan 13 18:32:47 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 13 Jan 2025 18:32:47 GMT Subject: RFR: 8344049: Shenandoah: Eliminate init-update-refs safepoint [v3] In-Reply-To: <2smNeh6fdjcA_HtcFLFy9IqJBFETW_CRnqzyW1Z7rbI=.8bd30d87-3602-42cd-9f54-c0b818446e7d@github.com> References: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> <2smNeh6fdjcA_HtcFLFy9IqJBFETW_CRnqzyW1Z7rbI=.8bd30d87-3602-42cd-9f54-c0b818446e7d@github.com> Message-ID: On Sat, 11 Jan 2025 01:35:06 GMT, Y. Srinivas Ramakrishna wrote: >> William Kemper has updated the pull request incrementally with one additional commit since the last revision: >> >> Improve comments and method names > > src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp line 196: > >> 194: >> 195: // Evacuation is complete, retire gc labs >> 196: heap->concurrent_prepare_for_update_refs(); > > For consistency with other related method naming, can we use "updaterefs" instead of "update_refs" (makes IDE searches easier to locate related methods). I'm all for making this consistent, but it seems that `update_refs` is more commonly used in method and variable declarations: [0] % grep -r --include "*.hpp" updaterefs src/hotspot/share/gc/shenandoah | wc -l 17 [0] % grep -r --include "*.hpp" update_refs src/hotspot/share/gc/shenandoah | wc -l 27 ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1913637958 From wkemper at openjdk.org Mon Jan 13 18:36:44 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 13 Jan 2025 18:36:44 GMT Subject: RFR: 8344049: Shenandoah: Eliminate init-update-refs safepoint [v3] In-Reply-To: References: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> <2smNeh6fdjcA_HtcFLFy9IqJBFETW_CRnqzyW1Z7rbI=.8bd30d87-3602-42cd-9f54-c0b818446e7d@github.com> Message-ID: On Mon, 13 Jan 2025 18:29:52 GMT, William Kemper wrote: >> src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp line 196: >> >>> 194: >>> 195: // Evacuation is complete, retire gc labs >>> 196: heap->concurrent_prepare_for_update_refs(); >> >> For consistency with other related method naming, can we use "updaterefs" instead of "update_refs" (makes IDE searches easier to locate related methods). > > I'm all for making this consistent, but it seems that `update_refs` is more commonly used in method and variable declarations: > > [0] % grep -r --include "*.hpp" updaterefs src/hotspot/share/gc/shenandoah | wc -l > 17 > > [0] % grep -r --include "*.hpp" update_refs src/hotspot/share/gc/shenandoah | wc -l > 27 If it's okay with you, I will do this on a separate PR so that this current PR is not cluttered by the change. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1913642618 From wkemper at openjdk.org Mon Jan 13 18:45:23 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 13 Jan 2025 18:45:23 GMT Subject: RFR: 8344049: Shenandoah: Eliminate init-update-refs safepoint [v4] In-Reply-To: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> References: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> Message-ID: > Shenandoah typically takes 4 safepoints per GC cycle. Although Shenandoah itself does not spend much time on these safepoints, it may still take quite some time for all of the mutator threads to reach the safepoint. The occasionally long time-to-safepoint increases latency in the higher percentiles. > > The `init-update-refs` safepoint is responsible for retiring GCLABs (and PLABs) used during evacuation. Once evacuation is complete, no threads will access these LABs. This need not be done on a safepoint. `init-update-refs` is also where the global and thread local copies of the `gc_state` are updated. However, here we are turning off the `WEAK_ROOTS` flag _after_ all of the unmarked weak referents have been `nulled` out, so this does not need to happen atomically with respect to the mutators. Neither is it necessary to change the other state flags (EVACUATION, UPDATE_REFS) atomically across all mutators. > > Note that the `init-update-refs` safepoint is still taken if either verification or `ShenandoahPacing` are enabled. William Kemper has updated the pull request incrementally with two additional commits since the last revision: - Encapsulate and document a method for making concurrent gc_state changes - Control thread doesn't need a gc lab, also make gclabs for safepoint workers parsable ------------- Changes: - all: https://git.openjdk.org/jdk/pull/22688/files - new: https://git.openjdk.org/jdk/pull/22688/files/89c20a14..26e382c5 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=22688&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=22688&range=02-03 Stats: 21 lines in 2 files changed: 14 ins; 3 del; 4 mod Patch: https://git.openjdk.org/jdk/pull/22688.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/22688/head:pull/22688 PR: https://git.openjdk.org/jdk/pull/22688 From coleenp at openjdk.org Mon Jan 13 18:52:56 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Mon, 13 Jan 2025 18:52:56 GMT Subject: RFR: 8347566: Replace SSIZE_FORMAT with 'z' length modifier Message-ID: Please review this change that replaces SSIZE_FORMAT with %zd. Tested with tier1 on Oracle supported platforms (and here with GHA). ------------- Commit messages: - 8347566: Replace SSIZE_FORMAT with 'z' length modifier Changes: https://git.openjdk.org/jdk/pull/23084/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23084&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8347566 Stats: 82 lines in 7 files changed: 2 ins; 3 del; 77 mod Patch: https://git.openjdk.org/jdk/pull/23084.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23084/head:pull/23084 PR: https://git.openjdk.org/jdk/pull/23084 From wkemper at openjdk.org Mon Jan 13 18:55:50 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 13 Jan 2025 18:55:50 GMT Subject: RFR: 8344049: Shenandoah: Eliminate init-update-refs safepoint [v3] In-Reply-To: References: <6ZVLoWPco9LC3XZOturDKG9F42n20Ie4h61f5Ap5iIY=.bbeb52d3-3de0-4778-b504-a69dc6ef7d3b@github.com> <2smNeh6fdjcA_HtcFLFy9IqJBFETW_CRnqzyW1Z7rbI=.8bd30d87-3602-42cd-9f54-c0b818446e7d@github.com>

Message-ID: On Mon, 13 Jan 2025 18:34:06 GMT, William Kemper wrote: >> I'm all for making this consistent, but it seems that `update_refs` is more commonly used in method and variable declarations: >> >> [0] % grep -r --include "*.hpp" updaterefs src/hotspot/share/gc/shenandoah | wc -l >> 17 >> >> [0] % grep -r --include "*.hpp" update_refs src/hotspot/share/gc/shenandoah | wc -l >> 27 > > If it's okay with you, I will do this on a separate PR so that this current PR is not cluttered by the change. https://bugs.openjdk.org/browse/JDK-8347617 ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/22688#discussion_r1913666119 From dlong at openjdk.org Mon Jan 13 19:35:43 2025 From: dlong at openjdk.org (Dean Long) Date: Mon, 13 Jan 2025 19:35:43 GMT Subject: RFR: 8347566: Replace SSIZE_FORMAT with 'z' length modifier In-Reply-To: References: Message-ID: On Mon, 13 Jan 2025 18:47:35 GMT, Coleen Phillimore wrote: > Please review this change that replaces SSIZE_FORMAT with %zd. > Tested with tier1 on Oracle supported platforms (and here with GHA). src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 160: > 158: uintx free_bits = mutator_bits | collector_bits | old_collector_bits; > 159: uintx notfree_bits = ~free_bits; > 160: log_debug(gc)("%6zu : " SIZE_FORMAT_X_0 " 0x" SIZE_FORMAT_X_0 " 0x" SIZE_FORMAT_X_0 " 0x" SIZE_FORMAT_X_0, Why is this %6zu instead of %6zd? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23084#discussion_r1913708708 From wkemper at openjdk.org Mon Jan 13 20:13:19 2025 From: wkemper at openjdk.org (William Kemper) Date: Mon, 13 Jan 2025 20:13:19 GMT Subject: RFR: 8347620: Shenandoah: Use 'free' tag for free set related logging Message-ID: Without a distinguishing tag, debug logging is too voluminous to enable when we really only want the free set's debug messages. ------------- Commit messages: - Use 'free' tag with free set messages Changes: https://git.openjdk.org/jdk/pull/23086/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=23086&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8347620 Stats: 78 lines in 1 file changed: 7 ins; 7 del; 64 mod Patch: https://git.openjdk.org/jdk/pull/23086.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23086/head:pull/23086 PR: https://git.openjdk.org/jdk/pull/23086 From coleenp at openjdk.org Mon Jan 13 20:39:53 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Mon, 13 Jan 2025 20:39:53 GMT Subject: RFR: 8347566: Replace SSIZE_FORMAT with 'z' length modifier [v2] In-Reply-To: References: Message-ID: <8ngxf9I1hYdlTzQjB3ebRjQZr7ByhUlldFZH73SULPY=.11d7d8c3-0d97-4cfc-942c-924c668525b2@github.com> > Please review this change that replaces SSIZE_FORMAT with %zd. > Tested with tier1 on Oracle supported platforms (and here with GHA). Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: Fix one zu -> zd. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/23084/files - new: https://git.openjdk.org/jdk/pull/23084/files/1cf9c88e..011ab8d2 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=23084&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=23084&range=00-01 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/23084.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/23084/head:pull/23084 PR: https://git.openjdk.org/jdk/pull/23084 From coleenp at openjdk.org Mon Jan 13 20:39:53 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Mon, 13 Jan 2025 20:39:53 GMT Subject: RFR: 8347566: Replace SSIZE_FORMAT with 'z' length modifier [v2] In-Reply-To: References:

Message-ID: On Mon, 13 Jan 2025 19:32:06 GMT, Dean Long wrote: >> Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: >> >> Fix one zu -> zd. > > src/hotspot/share/gc/shenandoah/shenandoahFreeSet.cpp line 160: > >> 158: uintx free_bits = mutator_bits | collector_bits | old_collector_bits; >> 159: uintx notfree_bits = ~free_bits; >> 160: log_debug(gc)("%6zu : " SIZE_FORMAT_X_0 " 0x" SIZE_FORMAT_X_0 " 0x" SIZE_FORMAT_X_0 " 0x" SIZE_FORMAT_X_0, > > Why is this %6zu instead of %6zd? I must have done this one by hand. Thank you for spotting it. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/23084#discussion_r1913773820 From dlong at openjdk.org Mon Jan 13 20:44:40 2025 From: dlong at openjdk.org (Dean Long) Date: Mon, 13 Jan 2025 20:44:40 GMT Subject: RFR: 8347566: Replace SSIZE_FORMAT with 'z' length modifier [v2] In-Reply-To: <8ngxf9I1hYdlTzQjB3ebRjQZr7ByhUlldFZH73SULPY=.11d7d8c3-0d97-4cfc-942c-924c668525b2@github.com> References: <8ngxf9I1hYdlTzQjB3ebRjQZr7ByhUlldFZH73SULPY=.11d7d8c3-0d97-4cfc-942c-924c668525b2@github.com> Message-ID: On Mon, 13 Jan 2025 20:39:53 GMT, Coleen Phillimore wrote: >> Please review this change that replaces SSIZE_FORMAT with %zd. >> Tested with tier1 on Oracle supported platforms (and here with GHA). > > Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: > > Fix one zu -> zd. Marked as reviewed by dlong (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/23084#pullrequestreview-2547837099 From dholmes at openjdk.org Mon Jan 13 21:07:44 2025 From: dholmes at openjdk.org (David Holmes) Date: Mon, 13 Jan 2025 21:07:44 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v6] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: <9iKSUKvDXxUoqoAxOYVEqzUGmf2PQwDLy_MfbFABs88=.30a14b62-a21e-4a6e-bf32-31454689a33f@github.com> On Mon, 13 Jan 2025 15:49:15 GMT, Coleen Phillimore wrote: >> There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. >> >> Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. > > Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: > > Add Oracle copyright to shenandoah files for this change. Marked as reviewed by dholmes (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/22916#pullrequestreview-2547892700 From coleenp at openjdk.org Mon Jan 13 22:02:36 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Mon, 13 Jan 2025 22:02:36 GMT Subject: RFR: 8347566: Replace SSIZE_FORMAT with 'z' length modifier [v2] In-Reply-To: <8ngxf9I1hYdlTzQjB3ebRjQZr7ByhUlldFZH73SULPY=.11d7d8c3-0d97-4cfc-942c-924c668525b2@github.com> References: <8ngxf9I1hYdlTzQjB3ebRjQZr7ByhUlldFZH73SULPY=.11d7d8c3-0d97-4cfc-942c-924c668525b2@github.com> Message-ID: On Mon, 13 Jan 2025 20:39:53 GMT, Coleen Phillimore wrote: >> Please review this change that replaces SSIZE_FORMAT with %zd. >> Tested with tier1 on Oracle supported platforms (and here with GHA). > > Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: > > Fix one zu -> zd. Thanks Dean. ------------- PR Comment: https://git.openjdk.org/jdk/pull/23084#issuecomment-2588311136 From coleenp at openjdk.org Mon Jan 13 22:06:49 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Mon, 13 Jan 2025 22:06:49 GMT Subject: Integrated: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros In-Reply-To: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: On Fri, 3 Jan 2025 14:32:39 GMT, Coleen Phillimore wrote: > There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. > > Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. This pull request has now been integrated. Changeset: 379d05bc Author: Coleen Phillimore URL: https://git.openjdk.org/jdk/commit/379d05bcc130446086786ecf6ca5a6b8e977386c Stats: 344 lines in 83 files changed: 6 ins; 19 del; 319 mod 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros Reviewed-by: kbarrett, dholmes, matsaave ------------- PR: https://git.openjdk.org/jdk/pull/22916 From coleenp at openjdk.org Mon Jan 13 22:06:48 2025 From: coleenp at openjdk.org (Coleen Phillimore) Date: Mon, 13 Jan 2025 22:06:48 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v6] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: On Mon, 13 Jan 2025 15:49:15 GMT, Coleen Phillimore wrote: >> There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. >> >> Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. > > Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: > > Add Oracle copyright to shenandoah files for this change. Thank you Matias, Kim and David. ------------- PR Comment: https://git.openjdk.org/jdk/pull/22916#issuecomment-2588312123 From dholmes at openjdk.org Tue Jan 14 01:18:43 2025 From: dholmes at openjdk.org (David Holmes) Date: Tue, 14 Jan 2025 01:18:43 GMT Subject: RFR: 8346990: Remove INTX_FORMAT and UINTX_FORMAT macros [v6] In-Reply-To: References: <3DB-2pH7wwVWDuJfkD1XoQwGKJOYxJKhuDQ0UeuxBC4=.03b5f432-6051-49d9-8ea9-34a9ea769ad1@github.com> Message-ID: On Mon, 13 Jan 2025 15:49:15 GMT, Coleen Phillimore wrote: >> There are a lot of format modifiers that are noisy and unnecessary in the code. This change removes the INTX variants. It's not that disruptive even for backporting because %z modifier has been available for a long time so should backport fine. This was mostly done with a sed script plus some hand fixups. >> >> Testing mach5 and other platform cross compilations in progress. Opening this for GHA testing. > > Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: > > Add Oracle copyright to shenandoah files for this change. We have belatedly discovered that `0x%zx` and `%#zx` behave differently in their handling of zero. The former prints `0x0` while the latter just prints `0`. This has broken the compiler replay tests as the parsing of 0 no longer works. ------------- PR Comment: https://git.openjdk.org/jdk/pull/22916#issuecomment-2588550581 From dholmes at openjdk.org Tue Jan 14 06:55:41 2025 From: dholmes at openjdk.org (David Holmes) Date: Tue, 14 Jan 2025 06:55:41 GMT Subject: RFR: 8347566: Replace SSIZE_FORMAT with 'z' length modifier [v2] In-Reply-To: <8ngxf9I1hYdlTzQjB3ebRjQZr7ByhUlldFZH73SULPY=.11d7d8c3-0d97-4cfc-942c-924c668525b2@github.com> References: <8ngxf9I1hYdlTzQjB3ebRjQZr7ByhUlldFZH73SULPY=.11d7d8c3-0d97-4cfc-942c-924c668525b2@github.com> Message-ID: On Mon, 13 Jan 2025 20:39:53 GMT, Coleen Phillimore wrote: >> Please review this change that replaces SSIZE_FORMAT with %zd. >> Tested with tier1 on Oracle supported platforms (and here with GHA). > > Coleen Phillimore has updated the pull request incrementally with one additional commit since the last revision: > > Fix one zu -> zd. Looks good! Thanks ------------- Marked as reviewed by dholmes (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/23084#pullrequestreview-2548932282 From kbarrett at openjdk.org Tue Jan 14 08:18:39 2025 From: kbarrett at openjdk.org (Kim Barrett) Date: Tue, 14 Jan 2025 08:18:39 GMT Subject: RFR: 8347566: Replace SSIZE_FORMAT with 'z' length modifier [v2] In-Reply-To: <8ngxf9I1hYdlTzQjB3ebRjQZr7ByhUlldFZH73SULPY=.11d7d8c3-0d97-4cfc-942c-924c668525b2@github.com> References: <8ngxf9I1hYdlTzQjB3ebRjQZr7ByhUlldFZH73SULPY=.11d7d8c3-0d97-4cfc-942c-924c668525b2@github.com> Message-ID: On Mon, 13 Jan 2025 20:39:53 GMT, Coleen Phillimore