From kbarrett at openjdk.org Sun Feb 1 00:21:01 2026 From: kbarrett at openjdk.org (Kim Barrett) Date: Sun, 1 Feb 2026 00:21:01 GMT Subject: RFR: 8376131: Convert ContiguousSpace to use Atomic In-Reply-To: References: Message-ID: On Thu, 22 Jan 2026 17:51:08 GMT, Thomas Schatzl wrote: > Hi all, > > please review this conversions of `ContiguousSpace` to use `Atomic`. > > Testing: gha, tier1-5 > > Thanks, > Thomas Looks good. ------------- Marked as reviewed by kbarrett (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/29370#pullrequestreview-3734044371 From jbhateja at openjdk.org Sun Feb 1 07:41:59 2026 From: jbhateja at openjdk.org (Jatin Bhateja) Date: Sun, 1 Feb 2026 07:41:59 GMT Subject: RFR: 8376187: [VectorAPI] Define new lane type constants and pass them to intrinsic entries [v5] In-Reply-To: References: Message-ID: > As per [discussions ](https://github.com/openjdk/jdk/pull/28002#issuecomment-3789507594) on JDK-8370691 pull request, splitting out portion of PR#28002 into a separate patch in preparation of Float16 vector API support. > > Patch add new lane type constants and pass them to vector intrinsic entry points. > > All existing Vector API jtreg test are passing with the patch. > > Kindly review and share your feedback. > > Best Regards, > Jatin Jatin Bhateja has updated the pull request incrementally with one additional commit since the last revision: Review comments resolution ------------- Changes: - all: https://git.openjdk.org/jdk/pull/29481/files - new: https://git.openjdk.org/jdk/pull/29481/files/ff73dc3d..0c60016b Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=29481&range=04 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=29481&range=03-04 Stats: 401 lines in 39 files changed: 28 ins; 62 del; 311 mod Patch: https://git.openjdk.org/jdk/pull/29481.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/29481/head:pull/29481 PR: https://git.openjdk.org/jdk/pull/29481 From jbhateja at openjdk.org Sun Feb 1 07:42:04 2026 From: jbhateja at openjdk.org (Jatin Bhateja) Date: Sun, 1 Feb 2026 07:42:04 GMT Subject: RFR: 8376187: [VectorAPI] Define new lane type constants and pass them to intrinsic entries [v4] In-Reply-To: <-fsfUEvFpvmAsupQFgx1CBkH9vr_efE5-qYeUzy5VFQ=.4abb05e0-1f82-4d6c-8bc4-ca4bc6fc5e80@github.com> References:

<-fsfUEvFpvmAsupQFgx1CBkH9vr_efE5-qYeUzy5VFQ=.4abb05e0-1f82-4d6c-8bc4-ca4bc6fc5e80@github.com> Message-ID: On Fri, 30 Jan 2026 23:31:29 GMT, Paul Sandoz wrote: >> Jatin Bhateja has updated the pull request incrementally with one additional commit since the last revision: >> >> Review comments resolutions > > src/jdk.incubator.vector/share/classes/jdk/incubator/vector/AbstractSpecies.java line 152: > >> 150: int laneTypeOrdinal() { >> 151: return laneType.ordinal(); >> 152: } > > Is this needed? Won't all concrete sub types override this? This interface provides access to lane type constant though species, its used for consistency, please have a look at following line and other places around it. https://github.com/jatin-bhateja/jdk/blob/ff73dc3d48a9435c4395556c8325fbce7610cba9/src/jdk.incubator.vector/share/classes/jdk/incubator/vector/DoubleVector.java#L3374 > src/jdk.incubator.vector/share/classes/jdk/incubator/vector/Byte128Vector.java line 60: > >> 58: >> 59: static final int LANE_TYPE_ORDINAL = LT_BYTE; >> 60: > > You can move this up to `ByteVector` and then reuse it to replace `byte.class`, so it is used consistently. Done > src/jdk.incubator.vector/share/classes/jdk/incubator/vector/VectorOperators.java line 821: > >> 819: convert(String name, char kind, Class dom, Class ran, int opCode, int flags) { >> 820: int domran = ((LaneType.of(dom).ordinal() << VO_DOM_SHIFT) + >> 821: (LaneType.of(ran).ordinal() << VO_RAN_SHIFT)); > > As i understand this is still correct because the maximum ordinal value is less than 16 (as was already the case for the basic type). Correct. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/29481#discussion_r2750675259 PR Review Comment: https://git.openjdk.org/jdk/pull/29481#discussion_r2750675162 PR Review Comment: https://git.openjdk.org/jdk/pull/29481#discussion_r2750675209 From psandoz at openjdk.org Sun Feb 1 17:11:11 2026 From: psandoz at openjdk.org (Paul Sandoz) Date: Sun, 1 Feb 2026 17:11:11 GMT Subject: RFR: 8376187: [VectorAPI] Define new lane type constants and pass them to intrinsic entries [v4] In-Reply-To: References:

<-fsfUEvFpvmAsupQFgx1CBkH9vr_efE5-qYeUzy5VFQ=.4abb05e0-1f82-4d6c-8bc4-ca4bc6fc5e80@github.com> Message-ID: On Sun, 1 Feb 2026 07:36:35 GMT, Jatin Bhateja wrote: >> src/jdk.incubator.vector/share/classes/jdk/incubator/vector/AbstractSpecies.java line 152: >> >>> 150: int laneTypeOrdinal() { >>> 151: return laneType.ordinal(); >>> 152: } >> >> Is this needed? Won't all concrete sub types override this? > > This interface provides access to lane type constant though species, its used for consistency, please have a look at following line and other places around it. > https://github.com/jatin-bhateja/jdk/blob/ff73dc3d48a9435c4395556c8325fbce7610cba9/src/jdk.incubator.vector/share/classes/jdk/incubator/vector/DoubleVector.java#L3374 Agreed that this method is required, but i was wondering why `AbstractSpecies` need to implement it. Ok, i see now you are copying the same pattern as some other methods such as `elementType`, so this is a more general issue we should not resolve in this PR. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/29481#discussion_r2751614740 From psandoz at openjdk.org Sun Feb 1 17:15:09 2026 From: psandoz at openjdk.org (Paul Sandoz) Date: Sun, 1 Feb 2026 17:15:09 GMT Subject: RFR: 8376187: [VectorAPI] Define new lane type constants and pass them to intrinsic entries [v5] In-Reply-To: References:

Message-ID: On Sun, 1 Feb 2026 07:41:59 GMT, Jatin Bhateja wrote: >> As per [discussions ](https://github.com/openjdk/jdk/pull/28002#issuecomment-3789507594) on JDK-8370691 pull request, splitting out portion of PR#28002 into a separate patch in preparation of Float16 vector API support. >> >> Patch add new lane type constants and pass them to vector intrinsic entry points. >> >> All existing Vector API jtreg test are passing with the patch. >> >> Kindly review and share your feedback. >> >> Best Regards, >> Jatin > > Jatin Bhateja has updated the pull request incrementally with one additional commit since the last revision: > > Review comments resolution src/jdk.incubator.vector/share/classes/jdk/incubator/vector/ByteVector.java line 580: > 578: public static ByteVector zero(VectorSpecies species) { > 579: ByteSpecies vsp = (ByteSpecies) species; > 580: return VectorSupport.fromBitsCoerced(vsp.vectorType(), vsp.laneTypeOrdinal(), species.length(), You can now use `LANE_TYPE_ORDINAL` rather than `vsp.laneTypeOrdinal()`, which better fits the prior pattern. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/29481#discussion_r2751629721 From dholmes at openjdk.org Mon Feb 2 01:21:41 2026 From: dholmes at openjdk.org (David Holmes) Date: Mon, 2 Feb 2026 01:21:41 GMT Subject: RFR: 8376855: ASAN reports out-of-range read in strncmp in MethodHandles::is_basic_type_signature Message-ID: An ASAN enabled build reported heap-buffer-overflow in `MethodHandles::is_basic_type_signature` with `ASAN_OPTIONS=strict_string_checks=true` when running test `jdk/jdk/jfr/api/metadata/annotations/TestThrottle.java` The code is here: bool MethodHandles::is_basic_type_signature(Symbol* sig) { assert(vmSymbols::object_signature()->utf8_length() == (int)OBJ_SIG_LEN, ""); assert(vmSymbols::object_signature()->equals(OBJ_SIG), ""); for (SignatureStream ss(sig, sig->starts_with(JVM_SIGNATURE_FUNC)); !ss.is_done(); ss.next()) { switch (ss.type()) { case T_OBJECT: // only java/lang/Object is valid here if (strncmp((char*) ss.raw_bytes(), OBJ_SIG, OBJ_SIG_LEN) != 0) The ASAN `strncmp` interceptor acts as follows: INTERCEPTOR(int, strncmp, const char *s1, const char *s2, size_t n) { void *ctx; ASAN_INTERCEPTOR_ENTER(linker, strncmp); // Sets up context ASAN_READ_RANGE(s1, n); // Validates s1 ASAN_READ_RANGE(s2, n); // Validates s2 return REAL(strncmp)(s1, s2, n); // Calls original function } With the test given `s1` is a buffer of size 15, containing a non-nul-terminated string, and `n` is 18, so `ASAN_READ_RANGE` fails for `s1` as we could potentially read beyond the end of the buffer. In practice however, given `s1` is guaranteed to be a valid type-string from a signature symbol of type `T_OBJECT`, its final character is `;` and the final character of `s2` is also `;` (it is the string constant `Ljava/lang/Object;`). Hence the comparison must terminate before we can run off the end of `s1`. To appease ASAN we can make a simple change to the `strncmp` call to compare at most `ss.raw_length()` bytes. Testing - ASAN no longer reports an error - tiers 1-3 sanity Thanks ------------- Commit messages: - copyright-year - 8376855: ASAN reports out-of-range read in strncmp in MethodHandles::is_basic_type_signature Changes: https://git.openjdk.org/jdk/pull/29516/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=29516&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8376855 Stats: 2 lines in 1 file changed: 0 ins; 0 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/29516.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/29516/head:pull/29516 PR: https://git.openjdk.org/jdk/pull/29516 From dholmes at openjdk.org Mon Feb 2 02:14:06 2026 From: dholmes at openjdk.org (David Holmes) Date: Mon, 2 Feb 2026 02:14:06 GMT Subject: RFR: 8373367: interp-only mechanism fails to work for carrier threads in a corner case [v3] In-Reply-To: References: <4kL5ukI7hOKtKX0zkyc6K_7RMq3v1t_fJdvdwvmXfsw=.60ebbe1d-0133-4bff-953c-db953eed86db@github.com> Message-ID: On Fri, 30 Jan 2026 07:45:04 GMT, Serguei Spitsyn wrote: >> The `interp-only` mechanism is based on the `JavaThread` objects. Carrier and virtual threads can temporary share the same `JavaThread`. The `java_thread->jvmti_thread_state()` is re-linked to a virtual thread at `mount` and to the carrier thread at `unmount`. The `JvmtiThreadState` has a back link to the `JavaThread` which is also set for virtual thread at a `mount` and carrier thread at an `unmount`. Just one of these two links at the same time is set to the `JavaThread`, the other one has to be set to `nullptr`. The `interp-only` mechanism needs this invariant. >> However, there is a corner case when this invariant is broken. It happens when the `JvmtiThreadState` for carrier thread has just been created. In such case, the link to `JavaThread` is always `non-nullptr` even though a virtual thread is currently mounted on a carrier thread. This simple update fixes the issue in the `JvmtiThreadState` ctor. >> >> Testing: >> - TBD: Mach5 tiers 1-6 > > Serguei Spitsyn has updated the pull request incrementally with one additional commit since the last revision: > > review: moved and extended comment in JvmtiThreadState ctor I appreciate the expanded comments but I still don't fully understand what `_thread` and `_saved_thread` point to at different times. The lifecycle of these fields really needs to be clearly described somewhere. A couple of typos are present - see below. Thanks src/hotspot/share/prims/jvmtiThreadState.cpp line 61: > 59: > 60: // The _thread field is a link to the JavaThread associated with JvmtiThreadState. > 61: // The _thread_saved field is used for carrier threads only when a virtual thread, Suggestion: // The _thread_saved field is used for carrier threads only when a virtual thread src/hotspot/share/prims/jvmtiThreadState.cpp line 65: > 63: // Carrier and virtual threads can temporarily share same JavaThread. In such a case, > 64: // only virtual _thread should have a link from JvmtiThreadState to JavaThread. > 65: // The carrier thread _thread filed is set to nullptr if a virtual thread is monted. Suggestion: // The carrier thread _thread field is set to nullptr if a virtual thread is mounted. ------------- Changes requested by dholmes (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/29436#pullrequestreview-3737008826 PR Review Comment: https://git.openjdk.org/jdk/pull/29436#discussion_r2752276119 PR Review Comment: https://git.openjdk.org/jdk/pull/29436#discussion_r2752275470 From shade at openjdk.org Mon Feb 2 07:15:19 2026 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 2 Feb 2026 07:15:19 GMT Subject: RFR: 8376472: Shenandoah: Assembler store barriers read destination memory despite the decorators [v2] In-Reply-To: References:

Message-ID: <-XK_Jf4sJArKYhzJltTtV3CUe3k4iI1ZpVT4E5QaDbo=.52cd97d5-9d47-44bd-9618-01f10fb04ed9@github.com> On Fri, 30 Jan 2026 10:16:19 GMT, Aleksey Shipilev wrote: >> The issue is really a correctness issue, and it readily manifests in Valhalla, which sometimes does the stores with `IS_DEST_UNINITIALIZED` set. Unfortunately, Shenandoah SATB barriers ignore this attribute, and attempt to read the memory at store address. At best it crashes the VM with the "oopness" asserts, at worst it feeds "garbage" pointers into SATB machinery, which then wrecks havoc on everything else. >> >> We need to make sure store barriers are consistently checking these attributes. Unfortunately, that would mean doing the changes in arch-specific assembler code. >> >> This PR makes sure the ShenandoahBarrierSetAssembler store barriers are roughly in the same shape, and that they consult `ShenandoahBarrierSet::need_*_barrier` to make the proper decisions whether to use SATB/card barriers. >> >> `hotspot_gc_shenandoah` is enough to sanity-check this patch, but I am also running `all` tests for extra safety. >> >> Additional testing: >> - [x] Linux x86_64 server fastdebug, `hotspot_gc_shenandoah` >> - [x] Linux AArch64 server fastdebug, `hotspot_gc_shenandoah` >> - [x] Linux x86_64 server fastdebug, `all` + `-XX:+UseShenandoahGC` >> - [x] Linux AArch64 server fastdebug, `all` + `-XX:+UseShenandoahGC` >> - [x] Linux {PPC64, RISC-V, S390X} server fastdebug, cross-compilation > > Aleksey Shipilev has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains six additional commits since the last revision: > > - Missing return in PPC64 for non-reference stores > - Merge branch 'master' into JDK-8376472-shenandoah-store-barriers > - More polish > - RISC-V version > - More touchups, AArch64 version > - Store barrier cleanup Let's go! Thanks for reviews. ------------- PR Comment: https://git.openjdk.org/jdk/pull/29444#issuecomment-3833369307 From shade at openjdk.org Mon Feb 2 07:15:20 2026 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 2 Feb 2026 07:15:20 GMT Subject: Integrated: 8376472: Shenandoah: Assembler store barriers read destination memory despite the decorators In-Reply-To: References: Message-ID: On Tue, 27 Jan 2026 10:47:54 GMT, Aleksey Shipilev wrote: > The issue is really a correctness issue, and it readily manifests in Valhalla, which sometimes does the stores with `IS_DEST_UNINITIALIZED` set. Unfortunately, Shenandoah SATB barriers ignore this attribute, and attempt to read the memory at store address. At best it crashes the VM with the "oopness" asserts, at worst it feeds "garbage" pointers into SATB machinery, which then wrecks havoc on everything else. > > We need to make sure store barriers are consistently checking these attributes. Unfortunately, that would mean doing the changes in arch-specific assembler code. > > This PR makes sure the ShenandoahBarrierSetAssembler store barriers are roughly in the same shape, and that they consult `ShenandoahBarrierSet::need_*_barrier` to make the proper decisions whether to use SATB/card barriers. > > `hotspot_gc_shenandoah` is enough to sanity-check this patch, but I am also running `all` tests for extra safety. > > Additional testing: > - [x] Linux x86_64 server fastdebug, `hotspot_gc_shenandoah` > - [x] Linux AArch64 server fastdebug, `hotspot_gc_shenandoah` > - [x] Linux x86_64 server fastdebug, `all` + `-XX:+UseShenandoahGC` > - [x] Linux AArch64 server fastdebug, `all` + `-XX:+UseShenandoahGC` > - [x] Linux {PPC64, RISC-V, S390X} server fastdebug, cross-compilation This pull request has now been integrated. Changeset: f8b0ff26 Author: Aleksey Shipilev URL: https://git.openjdk.org/jdk/commit/f8b0ff26c9e6643e96f06c18c509ddaf50326205 Stats: 270 lines in 10 files changed: 48 ins; 61 del; 161 mod 8376472: Shenandoah: Assembler store barriers read destination memory despite the decorators Reviewed-by: mdoerr, wkemper ------------- PR: https://git.openjdk.org/jdk/pull/29444 From shade at openjdk.org Mon Feb 2 07:45:03 2026 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 2 Feb 2026 07:45:03 GMT Subject: RFR: 8376355: Update to use jtreg 8.2.1 In-Reply-To: References: Message-ID: On Tue, 27 Jan 2026 15:26:20 GMT, Christian Stein wrote: > Please review the change to update to using jtreg 8.2.1. > > The primary change is to the `jib-profiles.js` file, which specifies the version of jtreg to use, for those systems that rely on this file. In addition, the `requiredVersion` has been updated in the various `TEST.ROOT` files. Nice to see no actual test changes are required for compatibility. ------------- Marked as reviewed by shade (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/29452#pullrequestreview-3737777336 From tschatzl at openjdk.org Mon Feb 2 08:01:19 2026 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 2 Feb 2026 08:01:19 GMT Subject: RFR: 8376131: Convert ContiguousSpace to use Atomic In-Reply-To: References:

Message-ID: On Wed, 28 Jan 2026 04:59:09 GMT, David Holmes wrote: >> Hi all, >> >> please review this conversions of `ContiguousSpace` to use `Atomic`. >> >> Testing: gha, tier1-5 >> >> Thanks, >> Thomas > > Looks good. Thanks Thanks @dholmes-ora @kimbarrett for your reviews ------------- PR Comment: https://git.openjdk.org/jdk/pull/29370#issuecomment-3833527727 From tschatzl at openjdk.org Mon Feb 2 08:01:21 2026 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 2 Feb 2026 08:01:21 GMT Subject: Integrated: 8376131: Convert ContiguousSpace to use Atomic In-Reply-To: References: Message-ID: On Thu, 22 Jan 2026 17:51:08 GMT, Thomas Schatzl wrote: > Hi all, > > please review this conversions of `ContiguousSpace` to use `Atomic`. > > Testing: gha, tier1-5 > > Thanks, > Thomas This pull request has now been integrated. Changeset: f22bc1cd Author: Thomas Schatzl URL: https://git.openjdk.org/jdk/commit/f22bc1cd518bc7f09dc49b78e40d06210226d2b7 Stats: 21 lines in 4 files changed: 2 ins; 7 del; 12 mod 8376131: Convert ContiguousSpace to use Atomic Reviewed-by: dholmes, kbarrett ------------- PR: https://git.openjdk.org/jdk/pull/29370 From lkorinth at openjdk.org Mon Feb 2 08:05:21 2026 From: lkorinth at openjdk.org (Leo Korinth) Date: Mon, 2 Feb 2026 08:05:21 GMT Subject: RFR: 8367993: G1: Speed up ConcurrentMark initialization [v9] In-Reply-To: References:

Message-ID: On Thu, 29 Jan 2026 14:47:12 GMT, Leo Korinth wrote: >> This change moves almost all of the ConcurrentMark initialisation from its constructor to the method `G1ConcurrentMark::fully_initialize()`. Thus, creation time of the VM can be slightly improved by postponing creation of ConcurrentMark. Most time is saved postponing creation of statistics buffers and threads. >> >> It is not obvious that this is the best solution. I have earlier experimented with lazily allocating statistics buffers _only_. One could also initialise a little bit more eagerly (for example the concurrent mark thread) and maybe get a slightly cleaner change. However IMO it seems better to not have ConcurrentMark "half initiated" with a created mark thread, but un-initialised worker threads. >> >> This change is depending on the integration of https://bugs.openjdk.org/browse/JDK-8373253. >> >> I will be out for vacation, and will be back after new year (and will not answer questions during that time), but I thought I get the pull request out now so that you can have a look. > > Leo Korinth has updated the pull request incrementally with two additional commits since the last revision: > > - Reapply "remove commented out code" > > This reverts commit d0d1860058f0dae7813c3e5115e2784da8331f3b. > - Reapply "Stefan J 4" > > This reverts commit c5a7e2bb44ce111f8c8d1d7f728f1bf8013475e0. Thanks everyone! ------------- PR Comment: https://git.openjdk.org/jdk/pull/28723#issuecomment-3833554735 From lkorinth at openjdk.org Mon Feb 2 08:05:23 2026 From: lkorinth at openjdk.org (Leo Korinth) Date: Mon, 2 Feb 2026 08:05:23 GMT Subject: Integrated: 8367993: G1: Speed up ConcurrentMark initialization In-Reply-To: References: Message-ID: On Tue, 9 Dec 2025 14:56:49 GMT, Leo Korinth wrote: > This change moves almost all of the ConcurrentMark initialisation from its constructor to the method `G1ConcurrentMark::fully_initialize()`. Thus, creation time of the VM can be slightly improved by postponing creation of ConcurrentMark. Most time is saved postponing creation of statistics buffers and threads. > > It is not obvious that this is the best solution. I have earlier experimented with lazily allocating statistics buffers _only_. One could also initialise a little bit more eagerly (for example the concurrent mark thread) and maybe get a slightly cleaner change. However IMO it seems better to not have ConcurrentMark "half initiated" with a created mark thread, but un-initialised worker threads. > > This change is depending on the integration of https://bugs.openjdk.org/browse/JDK-8373253. > > I will be out for vacation, and will be back after new year (and will not answer questions during that time), but I thought I get the pull request out now so that you can have a look. This pull request has now been integrated. Changeset: 766e03b1 Author: Leo Korinth URL: https://git.openjdk.org/jdk/commit/766e03b151b2972108ddc207eed10428e9a91c30 Stats: 57 lines in 9 files changed: 30 ins; 6 del; 21 mod 8367993: G1: Speed up ConcurrentMark initialization Reviewed-by: sjohanss, tschatzl ------------- PR: https://git.openjdk.org/jdk/pull/28723 From erfang at openjdk.org Mon Feb 2 08:25:32 2026 From: erfang at openjdk.org (Eric Fang) Date: Mon, 2 Feb 2026 08:25:32 GMT Subject: RFR: 8374349: [VectorAPI]: AArch64: Prefer merging mode SVE CPY instruction [v2] In-Reply-To: <_0ouKSVAIyzg0g9hA2jZXNH-_cCqJjNCSh7kM2dn80w=.b93145c3-c465-423a-ab68-c8d7bd7e4280@github.com> References: <_0ouKSVAIyzg0g9hA2jZXNH-_cCqJjNCSh7kM2dn80w=.b93145c3-c465-423a-ab68-c8d7bd7e4280@github.com> Message-ID: <595lDgLFjcH0tzzdeacMVa_1fPt3PQhKIhibehSvpZk=.3f01b98a-ce6b-4c81-92da-235443e81f9b@github.com> > When optimizing some VectorMask related APIs , we found an optimization opportunity related to the `cpy (immediate, zeroing)` instruction [1]. Implementing the functionality of this instruction using `cpy (immediate, merging)` instruction [2] leads to better performance. > > Currently the `cpy (imm, zeroing)` instruction is used in code generated by `VectorStoreMaskNode` and `VectorReinterpretNode`. Doing this optimization benefits all vector APIs that generate these two IRs potentially, such as `VectorMask.intoArray()` and `VectorMask.toLong()`. > > Microbenchmarks show this change brings performance uplift ranging from **11%** to **33%**, depending on the specific operation and data types. > > The specific changes in this PR: > 1. Achieve the functionality of the `cpy (imm, zeroing)` instruction with the `movi + cpy (imm, merging)` instructions in assembler: > > cpy z17.d, p1/z, #1 => > > movi v17.2d, #0 // this instruction is zero cost > cpy z17.d, p1/m, #1 > > > 2. Add a new option `PreferSVEMergingModeCPY` to indicate whether to apply this optimization or not. > - This option belongs to the Arch product category. > - The default value is true on Neoverse-V1/V2 where the improvement has been confirmed, false on others. > - When its value is true, the change is applied. > > 3. Add a jtreg test to verify the behavior of this option. > > This PR was tested on aarch64 and x86 machines with different configurations, and all tests passed. > > JMH benchmarks: > > On a Nvidia Grace (Neoverse-V2) machine with 128-bit SVE2: > > Benchmark Unit size Before Error After Error Uplift > byteIndexInRange ops/ms 7.00 471816.15 1125.96 473237.77 1593.92 1.00 > byteIndexInRange ops/ms 256.00 149654.21 416.57 149259.95 116.59 1.00 > byteIndexInRange ops/ms 259.00 177850.31 991.13 179785.19 1110.07 1.01 > byteIndexInRange ops/ms 512.00 133393.26 167.26 133484.61 281.83 1.00 > doubleIndexInRange ops/ms 7.00 302176.39 12848.8 299813.02 37.76 0.99 > doubleIndexInRange ops/ms 256.00 47831.93 56.70 46708.70 56.11 0.98 > doubleIndexInRange ops/ms 259.00 11550.02 27.95 15333.50 10.40 1.33 > doubleIndexInRange ops/ms 512.00 23687.76 61.65 23996.08 69.52 1.01 > floatIndexInRange ops/ms 7.00 412195.79 124.71 411770.23 78.73 1.00 > floatIndexInRange ops/ms 256.00 84479.98 70.69 84237.31 70.15 1.00 > floatIndexInRange ops/ms 259.00 22585.65 80.07 28296.21 7.98 1.25 > floatIndexInRange ops/ms 512.00 46902.99 51.60 46686.68 66.01 1.00 > intIndexInRange ops/ms 7.00 413411.70 50.59 420684.66 253.55 1.02 > intIndexInRange ops/... Eric Fang has updated the pull request incrementally with one additional commit since the last revision: Move the implementation into C2_MacroAssembler ------------- Changes: - all: https://git.openjdk.org/jdk/pull/29359/files - new: https://git.openjdk.org/jdk/pull/29359/files/4f5a7bd7..884a11f2 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=29359&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=29359&range=00-01 Stats: 240 lines in 10 files changed: 37 ins; 171 del; 32 mod Patch: https://git.openjdk.org/jdk/pull/29359.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/29359/head:pull/29359 PR: https://git.openjdk.org/jdk/pull/29359 From erfang at openjdk.org Mon Feb 2 08:25:33 2026 From: erfang at openjdk.org (Eric Fang) Date: Mon, 2 Feb 2026 08:25:33 GMT Subject: RFR: 8374349: [VectorAPI]: AArch64: Prefer merging mode SVE CPY instruction In-Reply-To: References: <_0ouKSVAIyzg0g9hA2jZXNH-_cCqJjNCSh7kM2dn80w=.b93145c3-c465-423a-ab68-c8d7bd7e4280@github.com>

<_qJ_Qo_Mqexx7dYu0Vkc9ru4SxZ0izfqifaUIAL1iyQ=.741b11d4-a89d-495e-8d31-78fed690abf6@github.com>

Message-ID: On Wed, 28 Jan 2026 10:17:30 GMT, Andrew Haley wrote: >> @fg1417 thanks for your help, this is really helpful! >> >> You've also noticed slight regression in a few cases, which is reasonable. The optimization effect is influenced by multiple factors, such as the alignment you mentioned on N2, as well as code generation and register allocation. The underlying principle of this optimization is that the latency of the `cpy(imm, zeroing)` instruction seems quite high, while the `movi + cpy(imm, merging)` combination improves the parallelism of the program. In some cases, a `mov` or other instruction with the same effect is already generated before the `cpy(imm, zeroing)` instruction, thus achieving the optimization effect of the `movi + cpy(imm, merging)` instruction combination. Therefore, the slight regression caused by the extra `movi` instruction in these cases is reasonable. However, for cases where this optimization applies, the performance improvement will be more significant. For example, in the following case, I even saw a **2x** performance improvement on Neoverse-V2. >> >> @Param({"128"}) >> private int loop_iteration; >> private static final VectorSpecies ispecies = VectorSpecies.ofLargestShape(int.class); >> private boolean[] mask_arr; >> >> @Setup(Level.Trial) >> public void BmSetup() { >> int array_size = loop_iteration * bspecies.length(); >> mask_arr = new boolean[array_size]; >> Random r = new Random(); >> for (int i = 0; i < array_size; i++) { >> mask_arr[i] = r.nextBoolean(); >> } >> } >> >> @CompilerControl(CompilerControl.Mode.INLINE) >> private long testIndexInRangeToLongKernel(VectorSpecies species) { >> long sum = 0; >> VectorMask m = VectorMask.fromArray(species, mask_arr, 0); >> for (int i = 0; i < loop_iteration; i++) { >> sum += m.indexInRange(i & (m.length() - 1), m.length()).toLong(); >> } >> return sum; >> } >> >> @Benchmark >> public long indexInRangeToLongInt() { >> return testIndexInRangeToLongKernel(ispecies); >> } >> >> >> Therefore, when you test this change using the C case, you will see a significant performance improvement. >>> I see 2% uplift on these numbers. >> >> @theRealAph And I think this also explains your question on these numbers. >> >>> One thing you can do is add a flag to control this minor optimization, but make it constexpr bool = true until we know what other SVE implementations might do. >> In general: >> Dea... > >> Therefore, when you test this change using the C case, you will see a significant performance improvement. >> >> > I see 2% uplift on these numbers. >> >> @theRealAph And I think this also explains your question on these numbers. > > Not at all. > > The performance claim above was: > >> Microbenchmarks show this change brings performance uplift ranging from 11% to 33%, depending on the specific operation and data types. > > But the real performance uplift, as measured in Java microbenchmarks, is 2%. Hi @theRealAph I have moved the implementation into C2_MacroAssember and Added a constexpr flag to guard this optimization, would you mind taking another look, thanks~ ------------- PR Comment: https://git.openjdk.org/jdk/pull/29359#issuecomment-3833659702 From iwalulya at openjdk.org Mon Feb 2 08:43:30 2026 From: iwalulya at openjdk.org (Ivan Walulya) Date: Mon, 2 Feb 2026 08:43:30 GMT Subject: RFR: 8375438: G1: Convert G1HeapRegion related classes to use Atomic [v2] In-Reply-To: References:

Message-ID: On Sun, 1 Feb 2026 17:12:49 GMT, Paul Sandoz wrote: >> Jatin Bhateja has updated the pull request incrementally with one additional commit since the last revision: >> >> Review comments resolution > > src/jdk.incubator.vector/share/classes/jdk/incubator/vector/ByteVector.java line 580: > >> 578: public static ByteVector zero(VectorSpecies species) { >> 579: ByteSpecies vsp = (ByteSpecies) species; >> 580: return VectorSupport.fromBitsCoerced(vsp.vectorType(), vsp.laneTypeOrdinal(), species.length(), > > You can now use `LANE_TYPE_ORDINAL` rather than `vsp.laneTypeOrdinal()`, which better fits the prior pattern. Done ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/29481#discussion_r2753281411 From aph at openjdk.org Mon Feb 2 09:08:48 2026 From: aph at openjdk.org (Andrew Haley) Date: Mon, 2 Feb 2026 09:08:48 GMT Subject: RFR: 8374349: [VectorAPI]: AArch64: Prefer merging mode SVE CPY instruction [v2] In-Reply-To: <595lDgLFjcH0tzzdeacMVa_1fPt3PQhKIhibehSvpZk=.3f01b98a-ce6b-4c81-92da-235443e81f9b@github.com> References: <_0ouKSVAIyzg0g9hA2jZXNH-_cCqJjNCSh7kM2dn80w=.b93145c3-c465-423a-ab68-c8d7bd7e4280@github.com> <595lDgLFjcH0tzzdeacMVa_1fPt3PQhKIhibehSvpZk=.3f01b98a-ce6b-4c81-92da-235443e81f9b@github.com> Message-ID: <4zd3EIKU033iAFrjn7h3BQMbG-4R0DlhELQ_yEAXaZ0=.cfd0af29-7b06-45f9-8547-cf94b91aaab2@github.com> On Mon, 2 Feb 2026 08:25:32 GMT, Eric Fang wrote: >> When optimizing some VectorMask related APIs , we found an optimization opportunity related to the `cpy (immediate, zeroing)` instruction [1]. Implementing the functionality of this instruction using `cpy (immediate, merging)` instruction [2] leads to better performance. >> >> Currently the `cpy (imm, zeroing)` instruction is used in code generated by `VectorStoreMaskNode` and `VectorReinterpretNode`. Doing this optimization benefits all vector APIs that generate these two IRs potentially, such as `VectorMask.intoArray()` and `VectorMask.toLong()`. >> >> Microbenchmarks show this change brings performance uplift ranging from **11%** to **33%**, depending on the specific operation and data types. >> >> The specific changes in this PR: >> 1. Achieve the functionality of the `cpy (imm, zeroing)` instruction with the `movi + cpy (imm, merging)` instructions in assembler: >> >> cpy z17.d, p1/z, #1 => >> >> movi v17.2d, #0 // this instruction is zero cost >> cpy z17.d, p1/m, #1 >> >> >> 2. Add a new option `PreferSVEMergingModeCPY` to indicate whether to apply this optimization or not. >> - This option belongs to the Arch product category. >> - The default value is true on Neoverse-V1/V2 where the improvement has been confirmed, false on others. >> - When its value is true, the change is applied. >> >> 3. Add a jtreg test to verify the behavior of this option. >> >> This PR was tested on aarch64 and x86 machines with different configurations, and all tests passed. >> >> JMH benchmarks: >> >> On a Nvidia Grace (Neoverse-V2) machine with 128-bit SVE2: >> >> Benchmark Unit size Before Error After Error Uplift >> byteIndexInRange ops/ms 7.00 471816.15 1125.96 473237.77 1593.92 1.00 >> byteIndexInRange ops/ms 256.00 149654.21 416.57 149259.95 116.59 1.00 >> byteIndexInRange ops/ms 259.00 177850.31 991.13 179785.19 1110.07 1.01 >> byteIndexInRange ops/ms 512.00 133393.26 167.26 133484.61 281.83 1.00 >> doubleIndexInRange ops/ms 7.00 302176.39 12848.8 299813.02 37.76 0.99 >> doubleIndexInRange ops/ms 256.00 47831.93 56.70 46708.70 56.11 0.98 >> doubleIndexInRange ops/ms 259.00 11550.02 27.95 15333.50 10.40 1.33 >> doubleIndexInRange ops/ms 512.00 23687.76 61.65 23996.08 69.52 1.01 >> floatIndexInRange ops/ms 7.00 412195.79 124.71 411770.23 78.73 1.00 >> floatIndexInRange ops/ms 256.00 84479.98 70.69 84237.31 70.15 1.00 >> floatIndexInRange ops/ms 259.00 22585.65 80.07 28296.21 7.98 1.25 >> floatIndexInRange ops/ms 512.00 46902.99 51.60 46686.68 66.01 1.00 >> intInd... > > Eric Fang has updated the pull request incrementally with one additional commit since the last revision: > > Move the implementation into C2_MacroAssembler src/hotspot/cpu/aarch64/c2_MacroAssembler_aarch64.cpp line 2846: > 2844: void C2_MacroAssembler::sve_cpy_optimized(FloatRegister dst, SIMD_RegVariant T, > 2845: PRegister pg, int imm8, bool isMerge) { > 2846: // When prefer_sve_merging_mode_cpy is enabled, optimize the SVE `cpy This comment says nothing that is not obvious from the code. src/hotspot/cpu/aarch64/c2_MacroAssembler_aarch64.cpp line 2848: > 2846: // When prefer_sve_merging_mode_cpy is enabled, optimize the SVE `cpy > 2847: // (immediate, zeroing)` instruction as `movi + cpy (immediate, merging)` > 2848: // instructions for better performance. Most of this comment is obvious from reading the code. src/hotspot/cpu/aarch64/c2_MacroAssembler_aarch64.cpp line 2855: > 2853: // Z above 128, so this `movi` instruction effectively zeroes the > 2854: // entire Z register. According to the Arm Software Optimization > 2855: // Guide, `movi` is zero cost. I don't think it says that exactly. movi is handled early during renaming, but still occupies a decode slot. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/29359#discussion_r2753291396 PR Review Comment: https://git.openjdk.org/jdk/pull/29359#discussion_r2753296599 PR Review Comment: https://git.openjdk.org/jdk/pull/29359#discussion_r2753295400 From dfenacci at openjdk.org Mon Feb 2 09:31:40 2026 From: dfenacci at openjdk.org (Damon Fenacci) Date: Mon, 2 Feb 2026 09:31:40 GMT Subject: RFR: 8374582: [REDO] Move input validation checks to Java for java.lang.StringCoding intrinsics [v9] In-Reply-To: <3ci9RXEra2BlQPhYl-M0Wnu3hRpWaDvxPnMRzFnJA_k=.67795fb3-95d1-449b-a7a9-44b3776aa626@github.com> References: <3ci9RXEra2BlQPhYl-M0Wnu3hRpWaDvxPnMRzFnJA_k=.67795fb3-95d1-449b-a7a9-44b3776aa626@github.com> Message-ID: > ## Issue > > This is a redo of [JDK-8361842](https://bugs.openjdk.org/browse/JDK-8361842) which was backed out by [JDK-8374210](https://bugs.openjdk.org/browse/JDK-8374210) due to C2-related regressions. The original change moved input validation checks for java.lang.StringCoding from the intrinsic to Java code (leaving the intrinsic check only with the `VerifyIntrinsicChecks` flag). Refer to the [original PR](https://github.com/openjdk/jdk/pull/25998) for details. > > This additional issue happens because, in some cases, for instance when the Java checking code is not inlined and we give an out-of-range constant as input, we fold the data path but not the control path and we crash in the backend. > > ## Causes > > The cause of this is that the out-of-range constant (e.g. -1) floats into the intrinsic and there (assuming the input is valid) we add a constraint to its type to positive integers (e.g. to compute the array address) which makes it top. > > ## Fix > > A possible fix is to introduce an opaque node (OpaqueGuardNode) similar to what we do in `must_be_not_null` for values that we know cannot be null: > https://github.com/openjdk/jdk/blob/ce721665cd61d9a319c667d50d9917c359d6c104/src/hotspot/share/opto/graphKit.cpp#L1484 > This will temporarily add the range check to ensure that C2 figures that out-of-range values cannot reach the intrinsic. Then, during macro expansion, we replace the opaque node with the corresponding constant (true/false) in product builds such that the actually unneeded guards are folded and do not end up in the emitted code. > > # Testing > > * Tier 1-3+ > * 2 JTReg tests added > * `TestRangeCheck.java` as regression test for the reported issue > * `TestOpaqueGuardNodes.java` to check that opaque guard nodes are added when parsing and removed at macro expansion Damon Fenacci has updated the pull request incrementally with two additional commits since the last revision: - Merge branch 'JDK-8374582' of https://github.com/dafedafe/jdk into JDK-8374582 - JDK-8374582: add assert in opaque constructor ------------- Changes: - all: https://git.openjdk.org/jdk/pull/29164/files - new: https://git.openjdk.org/jdk/pull/29164/files/c5390e4a..5e7df6f4 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=29164&range=08 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=29164&range=07-08 Stats: 1 line in 1 file changed: 1 ins; 0 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/29164.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/29164/head:pull/29164 PR: https://git.openjdk.org/jdk/pull/29164 From mhaessig at openjdk.org Mon Feb 2 09:36:25 2026 From: mhaessig at openjdk.org (Manuel =?UTF-8?B?SMOkc3NpZw==?=) Date: Mon, 2 Feb 2026 09:36:25 GMT Subject: RFR: 8370519: C2: Hit MemLimit when running with +VerifyLoopOptimizations [v8] In-Reply-To: References:

Message-ID: <_CO2G_HBJteRozKtjofE4Esyfk0qgZYiqO1uhQxH6Sc=.029ae153-446a-4cf9-a561-a94b5eaca6ed@github.com> On Fri, 30 Jan 2026 16:10:25 GMT, Beno?t Maillard wrote: >>> I was able to come up with this test, which is a bit more that 2 times faster than the original one on my machine. Its `memlimit` is set to `600M`, which is enough to make the old version fail. With the new one, the test passes even with a `memlimit` of `200M`, so this should be a good enough margin. >> >> Great. The new test looks good to me. I replaced the existing test with that one. Thanks for taking the time to do that. >> >>> While looking into this I have also found out that some programs have an unexpectedly high usage of `output` (as was the case in the test case that I initially suggested). I am trying to get a good reproducer and will most likely file a follow-up. >> >> Can you post links to the bugs? Thanks. > >> Can you post links to the bugs? Thanks. > > I haven't filed it yet. I observed something suspicious once, but at the moment I am not able to reproduce it anymore. I will take another look, and I will post here or tag you in the issue if there is any update @rwestrel. @benoitmaillard @mhaessig thanks for the reviews. @eme64 would you mind approving it again? ------------- PR Comment: https://git.openjdk.org/jdk/pull/28581#issuecomment-3833993449 From dfenacci at openjdk.org Mon Feb 2 09:55:37 2026 From: dfenacci at openjdk.org (Damon Fenacci) Date: Mon, 2 Feb 2026 09:55:37 GMT Subject: RFR: 8374582: [REDO] Move input validation checks to Java for java.lang.StringCoding intrinsics [v10] In-Reply-To: <3ci9RXEra2BlQPhYl-M0Wnu3hRpWaDvxPnMRzFnJA_k=.67795fb3-95d1-449b-a7a9-44b3776aa626@github.com> References: <3ci9RXEra2BlQPhYl-M0Wnu3hRpWaDvxPnMRzFnJA_k=.67795fb3-95d1-449b-a7a9-44b3776aa626@github.com> Message-ID: > ## Issue > > This is a redo of [JDK-8361842](https://bugs.openjdk.org/browse/JDK-8361842) which was backed out by [JDK-8374210](https://bugs.openjdk.org/browse/JDK-8374210) due to C2-related regressions. The original change moved input validation checks for java.lang.StringCoding from the intrinsic to Java code (leaving the intrinsic check only with the `VerifyIntrinsicChecks` flag). Refer to the [original PR](https://github.com/openjdk/jdk/pull/25998) for details. > > This additional issue happens because, in some cases, for instance when the Java checking code is not inlined and we give an out-of-range constant as input, we fold the data path but not the control path and we crash in the backend. > > ## Causes > > The cause of this is that the out-of-range constant (e.g. -1) floats into the intrinsic and there (assuming the input is valid) we add a constraint to its type to positive integers (e.g. to compute the array address) which makes it top. > > ## Fix > > A possible fix is to introduce an opaque node (OpaqueGuardNode) similar to what we do in `must_be_not_null` for values that we know cannot be null: > https://github.com/openjdk/jdk/blob/ce721665cd61d9a319c667d50d9917c359d6c104/src/hotspot/share/opto/graphKit.cpp#L1484 > This will temporarily add the range check to ensure that C2 figures that out-of-range values cannot reach the intrinsic. Then, during macro expansion, we replace the opaque node with the corresponding constant (true/false) in product builds such that the actually unneeded guards are folded and do not end up in the emitted code. > > # Testing > > * Tier 1-3+ > * 2 JTReg tests added > * `TestRangeCheck.java` as regression test for the reported issue > * `TestOpaqueGuardNodes.java` to check that opaque guard nodes are added when parsing and removed at macro expansion Damon Fenacci has updated the pull request incrementally with one additional commit since the last revision: JDK-8374582: revert wrong copyright change ------------- Changes: - all: https://git.openjdk.org/jdk/pull/29164/files - new: https://git.openjdk.org/jdk/pull/29164/files/5e7df6f4..5ac3e6e3 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=29164&range=09 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=29164&range=08-09 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/29164.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/29164/head:pull/29164 PR: https://git.openjdk.org/jdk/pull/29164 From erfang at openjdk.org Mon Feb 2 09:59:19 2026 From: erfang at openjdk.org (Eric Fang) Date: Mon, 2 Feb 2026 09:59:19 GMT Subject: RFR: 8374349: [VectorAPI]: AArch64: Prefer merging mode SVE CPY instruction [v2] In-Reply-To: <4zd3EIKU033iAFrjn7h3BQMbG-4R0DlhELQ_yEAXaZ0=.cfd0af29-7b06-45f9-8547-cf94b91aaab2@github.com> References: <_0ouKSVAIyzg0g9hA2jZXNH-_cCqJjNCSh7kM2dn80w=.b93145c3-c465-423a-ab68-c8d7bd7e4280@github.com> <595lDgLFjcH0tzzdeacMVa_1fPt3PQhKIhibehSvpZk=.3f01b98a-ce6b-4c81-92da-235443e81f9b@github.com> <4zd3EIKU033iAFrjn7h3BQMbG-4R0DlhELQ_yEAXaZ0=.cfd0af29-7b06-45f9-8547-cf94b91aaab2@github.com> Message-ID: <9ODLGgIWL6x0UlzS81yDsxVxWWESoCtZh77EcIAjH0U=.f8832988-2525-49dc-9a19-1c1d6f7a1d81@github.com> On Mon, 2 Feb 2026 09:04:21 GMT, Andrew Haley wrote: >> Eric Fang has updated the pull request incrementally with one additional commit since the last revision: >> >> Move the implementation into C2_MacroAssembler > > src/hotspot/cpu/aarch64/c2_MacroAssembler_aarch64.cpp line 2846: > >> 2844: void C2_MacroAssembler::sve_cpy_optimized(FloatRegister dst, SIMD_RegVariant T, >> 2845: PRegister pg, int imm8, bool isMerge) { >> 2846: // When prefer_sve_merging_mode_cpy is enabled, optimize the SVE `cpy > > This comment says nothing that is not obvious from the code. I?d like to briefly document the main idea of this method. How about adding a brief comment before the method like `Provide an optimized implementation for cpy (imm, zeroing) instruction`, or do you think it would be better to remove the comment? > src/hotspot/cpu/aarch64/c2_MacroAssembler_aarch64.cpp line 2855: > >> 2853: // Z above 128, so this `movi` instruction effectively zeroes the >> 2854: // entire Z register. According to the Arm Software Optimization >> 2855: // Guide, `movi` is zero cost. > > I don't think it says that exactly. movi is handled early during renaming, but still occupies a decode slot. Yeah you are right, and the movi uop gets eliminated shortly downstream of the decoder. I should say `zero latency`. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/29359#discussion_r2753482758 PR Review Comment: https://git.openjdk.org/jdk/pull/29359#discussion_r2753500143 From epeter at openjdk.org Mon Feb 2 10:04:44 2026 From: epeter at openjdk.org (Emanuel Peter) Date: Mon, 2 Feb 2026 10:04:44 GMT Subject: RFR: 8370519: C2: Hit MemLimit when running with +VerifyLoopOptimizations [v8] In-Reply-To: References:

Message-ID: On Fri, 30 Jan 2026 15:04:25 GMT, Roland Westrelin wrote: >> For this failure memory stats are: >> >> >> Total Usage: 1095525816 >> --- Arena Usage by Arena Type and compilation phase, at arena usage peak of 1095525816 --- >> Phase Total ra node comp type states reglive regsplit regmask superword cienv ha other >> none 5976032 331560 5402064 197512 33712 10200 0 0 984 0 0 0 0 >> parse 2716464 65456 1145480 196408 1112752 0 0 0 0 0 196368 0 0 >> optimizer 98184 0 32728 0 65456 0 0 0 0 0 0 0 0 >> connectionGraph 32728 0 0 32728 0 0 0 0 0 0 0 0 0 >> iterGVN 32728 0 32728 0 0 0 0 0 0 0 0 0 0 >> idealLoop 918189632 0 38687056 872824784 392776 0 0 0 0 0 6285016 0 0 >> idealLoopVerify 2228144 0 0 2228144 0 0 0 0 0 0 0 0 0 >> macroExpand 32728 0 32728 0 0 0 0 0 0 0 0 0 0 >> graphReshape 32728 0 32728 0 0 0 0 0 0 0 0 0 0 >> matcher 20135944 3369848 9033208 7536400 65456 131032 0 0 0 0 0 0 0 >> postselect_cleanup 294872 294872 0 0 0 0 0 0 0 0 0 0 0 >> scheduler 752944 196488 556456 0 0 0 0 0 0 0 0 0 0 >> regalloc 388736 388736 0 0 0 0 0 0 0 0 0 0 0 >> ... > > Roland Westrelin has updated the pull request incrementally with three additional commits since the last revision: > > - Update src/hotspot/share/memory/arena.hpp > > Co-authored-by: Manuel H?ssig > - Update src/hotspot/share/opto/loopnode.cpp > > Co-authored-by: Manuel H?ssig > - Update src/hotspot/share/opto/loopnode.hpp > > Co-authored-by: Manuel H?ssig Looks good, thanks for the updates @rwestrel ! ------------- Marked as reviewed by epeter (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/28581#pullrequestreview-3738415604 From shade at openjdk.org Mon Feb 2 10:36:29 2026 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 2 Feb 2026 10:36:29 GMT Subject: RFR: 8376570: GrowableArray::remove_{till,range} should work on empty list In-Reply-To: References: Message-ID: On Wed, 28 Jan 2026 09:48:58 GMT, Aleksey Shipilev wrote: > Split from [JDK-8375046](https://bugs.openjdk.org/browse/JDK-8375046), we want to make sure GrowableArray removal methods work appropriately with empty lists. > > Testing: > - [x] New test > - [x] Linux x86_64 server fastdebug, `all` (in course of [JDK-8375046](https://bugs.openjdk.org/browse/JDK-8375046) testing) Thank you! Let's go. ------------- PR Comment: https://git.openjdk.org/jdk/pull/29462#issuecomment-3834285269 From shade at openjdk.org Mon Feb 2 10:36:30 2026 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 2 Feb 2026 10:36:30 GMT Subject: Integrated: 8376570: GrowableArray::remove_{till, range} should work on empty list In-Reply-To: References: Message-ID: On Wed, 28 Jan 2026 09:48:58 GMT, Aleksey Shipilev wrote: > Split from [JDK-8375046](https://bugs.openjdk.org/browse/JDK-8375046), we want to make sure GrowableArray removal methods work appropriately with empty lists. > > Testing: > - [x] New test > - [x] Linux x86_64 server fastdebug, `all` (in course of [JDK-8375046](https://bugs.openjdk.org/browse/JDK-8375046) testing) This pull request has now been integrated. Changeset: e370b8a1 Author: Aleksey Shipilev URL: https://git.openjdk.org/jdk/commit/e370b8a1d834a0a6ebcd1d5946a5533c015ed960 Stats: 122 lines in 2 files changed: 115 ins; 0 del; 7 mod 8376570: GrowableArray::remove_{till,range} should work on empty list Reviewed-by: kbarrett, iwalulya ------------- PR: https://git.openjdk.org/jdk/pull/29462 From shade at openjdk.org Mon Feb 2 11:07:46 2026 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 2 Feb 2026 11:07:46 GMT Subject: RFR: 8375438: G1: Convert G1HeapRegion related classes to use Atomic [v2] In-Reply-To: References:

Message-ID: On Tue, 20 Jan 2026 11:32:13 GMT, Thomas Schatzl wrote: >> Hi all, >> >> please review conversion of G1HeapRegion related classes to use Atomic. >> >> Testing: tier1, tier4, tier5 >> >> (The PipelineLeaksFD failure in gha is a known issue) >> >> Thanks, >> Thomas > > Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: > > * shade review Marked as reviewed by shade (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/29301#pullrequestreview-3738736964 From aph at openjdk.org Mon Feb 2 11:18:38 2026 From: aph at openjdk.org (Andrew Haley) Date: Mon, 2 Feb 2026 11:18:38 GMT Subject: RFR: 8374349: [VectorAPI]: AArch64: Prefer merging mode SVE CPY instruction [v2] In-Reply-To: <9ODLGgIWL6x0UlzS81yDsxVxWWESoCtZh77EcIAjH0U=.f8832988-2525-49dc-9a19-1c1d6f7a1d81@github.com> References: <_0ouKSVAIyzg0g9hA2jZXNH-_cCqJjNCSh7kM2dn80w=.b93145c3-c465-423a-ab68-c8d7bd7e4280@github.com> <595lDgLFjcH0tzzdeacMVa_1fPt3PQhKIhibehSvpZk=.3f01b98a-ce6b-4c81-92da-235443e81f9b@github.com> <4zd3EIKU033iAFrjn7h3BQMbG-4R0DlhELQ_yEAXaZ0=.cfd0af29-7b06-45f9-8547-cf94b91aaab2@github.com> <9ODLGgIWL6x0UlzS81yDsxVxWWESoCtZh77EcIAjH0U=.f8832988-2525-49dc-9a19-1c1d6f7a1d81@github.com> Message-ID: <7g2UNaXs2NbRXX7r7YTHZFdq1X-q0Ix8wjSnHDoZnoQ=.59ae7bec-4a3e-4264-a997-e0ecb9fe0f06@github.com> On Mon, 2 Feb 2026 09:52:31 GMT, Eric Fang wrote: >> src/hotspot/cpu/aarch64/c2_MacroAssembler_aarch64.cpp line 2846: >> >>> 2844: void C2_MacroAssembler::sve_cpy_optimized(FloatRegister dst, SIMD_RegVariant T, >>> 2845: PRegister pg, int imm8, bool isMerge) { >>> 2846: // When prefer_sve_merging_mode_cpy is enabled, optimize the SVE `cpy >> >> This comment says nothing that is not obvious from the code. > > I?d like to briefly document the main idea of this method. How about adding a brief comment before the method like `Provide an optimized implementation for cpy (imm, zeroing) instruction`, or do you think it would be better to remove the comment? If a comment says nothing that is not obvious from reading the code, the comment should be removed. It makes sense to explain why this is better, maybe with reference to documentation elsewhere. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/29359#discussion_r2753831979 From azafari at openjdk.org Mon Feb 2 11:29:03 2026 From: azafari at openjdk.org (Afshin Zafari) Date: Mon, 2 Feb 2026 11:29:03 GMT Subject: RFR: 8376855: ASAN reports out-of-range read in strncmp in MethodHandles::is_basic_type_signature In-Reply-To: References: Message-ID: On Mon, 2 Feb 2026 01:13:35 GMT, David Holmes wrote: > An ASAN enabled build reported heap-buffer-overflow in `MethodHandles::is_basic_type_signature` with `ASAN_OPTIONS=strict_string_checks=true` when running test `jdk/jdk/jfr/api/metadata/annotations/TestThrottle.java` > > The code is here: > > bool MethodHandles::is_basic_type_signature(Symbol* sig) { > assert(vmSymbols::object_signature()->utf8_length() == (int)OBJ_SIG_LEN, ""); > assert(vmSymbols::object_signature()->equals(OBJ_SIG), ""); > for (SignatureStream ss(sig, sig->starts_with(JVM_SIGNATURE_FUNC)); !ss.is_done(); ss.next()) { > switch (ss.type()) { > case T_OBJECT: > // only java/lang/Object is valid here > if (strncmp((char*) ss.raw_bytes(), OBJ_SIG, OBJ_SIG_LEN) != 0) > > The ASAN `strncmp` interceptor acts as follows: > > INTERCEPTOR(int, strncmp, const char *s1, const char *s2, size_t n) { > void *ctx; > ASAN_INTERCEPTOR_ENTER(linker, strncmp); // Sets up context > ASAN_READ_RANGE(s1, n); // Validates s1 > ASAN_READ_RANGE(s2, n); // Validates s2 > return REAL(strncmp)(s1, s2, n); // Calls original function > } > > With the test given `s1` is a buffer of size 15, containing a non-nul-terminated string, and `n` is 18, so `ASAN_READ_RANGE` fails for `s1` as we could potentially read beyond the end of the buffer. In practice however, given `s1` is guaranteed to be a valid type-string from a signature symbol of type `T_OBJECT`, its final character is `;` and the final character of `s2` is also `;` (it is the string constant `Ljava/lang/Object;`). Hence the comparison must terminate before we can run off the end of `s1`. > > To appease ASAN we can make a simple change to the `strncmp` call to compare at most `ss.raw_length()` bytes. > > Testing > - ASAN no longer reports an error > - tiers 1-3 sanity > > Thanks Thank you David for taking and fixing this. ------------- Marked as reviewed by azafari (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/29516#pullrequestreview-3738832412 From thartmann at openjdk.org Mon Feb 2 11:40:03 2026 From: thartmann at openjdk.org (Tobias Hartmann) Date: Mon, 2 Feb 2026 11:40:03 GMT Subject: RFR: 8374582: [REDO] Move input validation checks to Java for java.lang.StringCoding intrinsics [v10] In-Reply-To: References: <3ci9RXEra2BlQPhYl-M0Wnu3hRpWaDvxPnMRzFnJA_k=.67795fb3-95d1-449b-a7a9-44b3776aa626@github.com> Message-ID: On Mon, 2 Feb 2026 09:55:37 GMT, Damon Fenacci wrote: >> ## Issue >> >> This is a redo of [JDK-8361842](https://bugs.openjdk.org/browse/JDK-8361842) which was backed out by [JDK-8374210](https://bugs.openjdk.org/browse/JDK-8374210) due to C2-related regressions. The original change moved input validation checks for java.lang.StringCoding from the intrinsic to Java code (leaving the intrinsic check only with the `VerifyIntrinsicChecks` flag). Refer to the [original PR](https://github.com/openjdk/jdk/pull/25998) for details. >> >> This additional issue happens because, in some cases, for instance when the Java checking code is not inlined and we give an out-of-range constant as input, we fold the data path but not the control path and we crash in the backend. >> >> ## Causes >> >> The cause of this is that the out-of-range constant (e.g. -1) floats into the intrinsic and there (assuming the input is valid) we add a constraint to its type to positive integers (e.g. to compute the array address) which makes it top. >> >> ## Fix >> >> A possible fix is to introduce an opaque node (OpaqueGuardNode) similar to what we do in `must_be_not_null` for values that we know cannot be null: >> https://github.com/openjdk/jdk/blob/ce721665cd61d9a319c667d50d9917c359d6c104/src/hotspot/share/opto/graphKit.cpp#L1484 >> This will temporarily add the range check to ensure that C2 figures that out-of-range values cannot reach the intrinsic. Then, during macro expansion, we replace the opaque node with the corresponding constant (true/false) in product builds such that the actually unneeded guards are folded and do not end up in the emitted code. >> >> # Testing >> >> * Tier 1-3+ >> * 2 JTReg tests added >> * `TestRangeCheck.java` as regression test for the reported issue >> * `TestOpaqueGuardNodes.java` to check that opaque guard nodes are added when parsing and removed at macro expansion > > Damon Fenacci has updated the pull request incrementally with one additional commit since the last revision: > > JDK-8374582: revert wrong copyright change Thanks for working on this Damon. I added a few comments, otherwise it looks good! src/hotspot/share/opto/library_call.cpp line 894: > 892: > 893: inline Node* LibraryCallKit::generate_negative_guard(Node* index, RegionNode* region, > 894: Node** pos_index, bool is_opaque) { As we discussed offline, I think `with_opaque` is better here. src/hotspot/share/opto/opaquenode.hpp line 145: > 143: // with false in product builds such that the actually unneeded guards are folded and do not end up in the emitted code. > 144: // In debug builds, we keep the actual checks as additional verification code (i.e. removing OpaqueConstantBoolNodes and > 145: // use the BoolNode inputs instead). Nice comment! src/hotspot/share/opto/opaquenode.hpp line 148: > 146: class OpaqueConstantBoolNode : public Node { > 147: private: > 148: bool _constant; Should this be `const`? src/hotspot/share/opto/opaquenode.hpp line 150: > 148: bool _constant; > 149: public: > 150: OpaqueConstantBoolNode(Compile* C, Node* tst, bool constant) : Node(nullptr, tst), _constant(constant) { An alternative would be to have the `constant` be an actual input node instead of a field. In macro expansion, you could then do `_igvn.replace_node(n, n->in(2));` instead (maybe define an enum for the input indices). I don't have a strong opinion on this though and leave it up to you to decide ? ------------- Marked as reviewed by thartmann (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/29164#pullrequestreview-3738450475 PR Review Comment: https://git.openjdk.org/jdk/pull/29164#discussion_r2753636949 PR Review Comment: https://git.openjdk.org/jdk/pull/29164#discussion_r2753548067 PR Review Comment: https://git.openjdk.org/jdk/pull/29164#discussion_r2753551976 PR Review Comment: https://git.openjdk.org/jdk/pull/29164#discussion_r2753586409 From roland at openjdk.org Mon Feb 2 11:46:30 2026 From: roland at openjdk.org (Roland Westrelin) Date: Mon, 2 Feb 2026 11:46:30 GMT Subject: RFR: 8370519: C2: Hit MemLimit when running with +VerifyLoopOptimizations [v8] In-Reply-To: References:

Message-ID: On Mon, 2 Feb 2026 10:01:33 GMT, Emanuel Peter wrote: >> Roland Westrelin has updated the pull request incrementally with three additional commits since the last revision: >> >> - Update src/hotspot/share/memory/arena.hpp >> >> Co-authored-by: Manuel H?ssig >> - Update src/hotspot/share/opto/loopnode.cpp >> >> Co-authored-by: Manuel H?ssig >> - Update src/hotspot/share/opto/loopnode.hpp >> >> Co-authored-by: Manuel H?ssig > > Looks good, thanks for the updates @rwestrel ! @eme64 thanks! ------------- PR Comment: https://git.openjdk.org/jdk/pull/28581#issuecomment-3834624426 From roland at openjdk.org Mon Feb 2 11:46:32 2026 From: roland at openjdk.org (Roland Westrelin) Date: Mon, 2 Feb 2026 11:46:32 GMT Subject: Integrated: 8370519: C2: Hit MemLimit when running with +VerifyLoopOptimizations In-Reply-To: References: Message-ID: On Mon, 1 Dec 2025 15:40:00 GMT, Roland Westrelin wrote: > For this failure memory stats are: > > > Total Usage: 1095525816 > --- Arena Usage by Arena Type and compilation phase, at arena usage peak of 1095525816 --- > Phase Total ra node comp type states reglive regsplit regmask superword cienv ha other > none 5976032 331560 5402064 197512 33712 10200 0 0 984 0 0 0 0 > parse 2716464 65456 1145480 196408 1112752 0 0 0 0 0 196368 0 0 > optimizer 98184 0 32728 0 65456 0 0 0 0 0 0 0 0 > connectionGraph 32728 0 0 32728 0 0 0 0 0 0 0 0 0 > iterGVN 32728 0 32728 0 0 0 0 0 0 0 0 0 0 > idealLoop 918189632 0 38687056 872824784 392776 0 0 0 0 0 6285016 0 0 > idealLoopVerify 2228144 0 0 2228144 0 0 0 0 0 0 0 0 0 > macroExpand 32728 0 32728 0 0 0 0 0 0 0 0 0 0 > graphReshape 32728 0 32728 0 0 0 0 0 0 0 0 0 0 > matcher 20135944 3369848 9033208 7536400 65456 131032 0 0 0 0 0 0 0 > postselect_cleanup 294872 294872 0 0 0 0 0 0 0 0 0 0 0 > scheduler 752944 196488 556456 0 0 0 0 0 0 0 0 0 0 > regalloc 388736 388736 0 0 0 0 0 0 0 0 0 0 0 > ctorChaitin 160032 ... This pull request has now been integrated. Changeset: 176422b8 Author: Roland Westrelin URL: https://git.openjdk.org/jdk/commit/176422b885d2d045dd44b61b7fcdcb01be2d00a7 Stats: 171 lines in 4 files changed: 147 ins; 14 del; 10 mod 8370519: C2: Hit MemLimit when running with +VerifyLoopOptimizations Co-authored-by: Beno?t Maillard Reviewed-by: mhaessig, bmaillard, epeter ------------- PR: https://git.openjdk.org/jdk/pull/28581 From dfenacci at openjdk.org Mon Feb 2 12:01:51 2026 From: dfenacci at openjdk.org (Damon Fenacci) Date: Mon, 2 Feb 2026 12:01:51 GMT Subject: RFR: 8374582: [REDO] Move input validation checks to Java for java.lang.StringCoding intrinsics [v11] In-Reply-To: <3ci9RXEra2BlQPhYl-M0Wnu3hRpWaDvxPnMRzFnJA_k=.67795fb3-95d1-449b-a7a9-44b3776aa626@github.com> References: <3ci9RXEra2BlQPhYl-M0Wnu3hRpWaDvxPnMRzFnJA_k=.67795fb3-95d1-449b-a7a9-44b3776aa626@github.com> Message-ID: > ## Issue > > This is a redo of [JDK-8361842](https://bugs.openjdk.org/browse/JDK-8361842) which was backed out by [JDK-8374210](https://bugs.openjdk.org/browse/JDK-8374210) due to C2-related regressions. The original change moved input validation checks for java.lang.StringCoding from the intrinsic to Java code (leaving the intrinsic check only with the `VerifyIntrinsicChecks` flag). Refer to the [original PR](https://github.com/openjdk/jdk/pull/25998) for details. > > This additional issue happens because, in some cases, for instance when the Java checking code is not inlined and we give an out-of-range constant as input, we fold the data path but not the control path and we crash in the backend. > > ## Causes > > The cause of this is that the out-of-range constant (e.g. -1) floats into the intrinsic and there (assuming the input is valid) we add a constraint to its type to positive integers (e.g. to compute the array address) which makes it top. > > ## Fix > > A possible fix is to introduce an opaque node (OpaqueGuardNode) similar to what we do in `must_be_not_null` for values that we know cannot be null: > https://github.com/openjdk/jdk/blob/ce721665cd61d9a319c667d50d9917c359d6c104/src/hotspot/share/opto/graphKit.cpp#L1484 > This will temporarily add the range check to ensure that C2 figures that out-of-range values cannot reach the intrinsic. Then, during macro expansion, we replace the opaque node with the corresponding constant (true/false) in product builds such that the actually unneeded guards are folded and do not end up in the emitted code. > > # Testing > > * Tier 1-3+ > * 2 JTReg tests added > * `TestRangeCheck.java` as regression test for the reported issue > * `TestOpaqueGuardNodes.java` to check that opaque guard nodes are added when parsing and removed at macro expansion Damon Fenacci has updated the pull request incrementally with two additional commits since the last revision: - JDK-8374582: add const - JDK-8374582: with_opaque ------------- Changes: - all: https://git.openjdk.org/jdk/pull/29164/files - new: https://git.openjdk.org/jdk/pull/29164/files/5ac3e6e3..0d4eef88 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=29164&range=10 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=29164&range=09-10 Stats: 7 lines in 3 files changed: 0 ins; 0 del; 7 mod Patch: https://git.openjdk.org/jdk/pull/29164.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/29164/head:pull/29164 PR: https://git.openjdk.org/jdk/pull/29164 From dfenacci at openjdk.org Mon Feb 2 12:13:36 2026 From: dfenacci at openjdk.org (Damon Fenacci) Date: Mon, 2 Feb 2026 12:13:36 GMT Subject: RFR: 8374582: [REDO] Move input validation checks to Java for java.lang.StringCoding intrinsics [v10] In-Reply-To: References: <3ci9RXEra2BlQPhYl-M0Wnu3hRpWaDvxPnMRzFnJA_k=.67795fb3-95d1-449b-a7a9-44b3776aa626@github.com>

Message-ID: On Mon, 2 Feb 2026 10:29:20 GMT, Tobias Hartmann wrote: >> Damon Fenacci has updated the pull request incrementally with one additional commit since the last revision: >> >> JDK-8374582: revert wrong copyright change > > src/hotspot/share/opto/library_call.cpp line 894: > >> 892: >> 893: inline Node* LibraryCallKit::generate_negative_guard(Node* index, RegionNode* region, >> 894: Node** pos_index, bool is_opaque) { > > As we discussed offline, I think `with_opaque` is better here. Renamed. Thanks @TobiHartmann. > src/hotspot/share/opto/opaquenode.hpp line 148: > >> 146: class OpaqueConstantBoolNode : public Node { >> 147: private: >> 148: bool _constant; > > Should this be `const`? Yep, fixed. > src/hotspot/share/opto/opaquenode.hpp line 150: > >> 148: bool _constant; >> 149: public: >> 150: OpaqueConstantBoolNode(Compile* C, Node* tst, bool constant) : Node(nullptr, tst), _constant(constant) { > > An alternative would be to have the `constant` be an actual input node instead of a field. In macro expansion, you could then do `_igvn.replace_node(n, n->in(2));` instead (maybe define an enum for the input indices). I don't have a strong opinion on this though and leave it up to you to decide ? Cool trick! ?... but now I can't decide between the two ? @chhagedorn do you fancy being the tiebreaker? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/29164#discussion_r2754030906 PR Review Comment: https://git.openjdk.org/jdk/pull/29164#discussion_r2754030537 PR Review Comment: https://git.openjdk.org/jdk/pull/29164#discussion_r2754032138 From duke at openjdk.org Mon Feb 2 12:34:01 2026 From: duke at openjdk.org (duke) Date: Mon, 2 Feb 2026 12:34:01 GMT Subject: Withdrawn: 8369021: A crash in ConstantPool::klass_at_impl In-Reply-To: References: Message-ID: On Wed, 1 Oct 2025 20:21:45 GMT, Jan Kratochvil wrote: > https://bugs.openjdk.org/browse/JDK-8369021 This pull request has been closed without being integrated. ------------- PR: https://git.openjdk.org/jdk/pull/27595 From duke at openjdk.org Mon Feb 2 12:42:26 2026 From: duke at openjdk.org (Ruben) Date: Mon, 2 Feb 2026 12:42:26 GMT Subject: RFR: 8372942: AArch64: Set JVM flags for Neoverse V3AE core [v2] In-Reply-To: <8eI5E6cyFbIzKfiWurr2ovAUQEML2LDiJIW11BFX27w=.962bede5-2cc2-4e90-97f9-4953750f4b11@github.com> References:

<8eI5E6cyFbIzKfiWurr2ovAUQEML2LDiJIW11BFX27w=.962bede5-2cc2-4e90-97f9-4953750f4b11@github.com> Message-ID: On Wed, 14 Jan 2026 09:02:33 GMT, Andrew Haley wrote: >> Thanks, this is fine. >> I wonder if we should be thinking about replacing some of this open-coded logic with something more expressive and concise. This bunch of model_is() expressions could be a switch, for example. > >> Thank you for review, @theRealAph, >> >> > I wonder if we should be thinking about replacing some of this open-coded logic with something more expressive and concise. This bunch of model_is() expressions could be a switch, for example. >> >> While switch-case might not be easily applicable because we have two variables, both of which have to be compared with the values, > > I only see one here, the `model_is`. > >> perhaps an interface like `bool is_model_any_of(std::initializer_list list)` can simplify the code. Would this approach be suitable? > > Maybe, if it's made as simple as possible. > >> Would you like this to be changed within this PR? > > I think so. @theRealAph, > I'm thinking of bool is_model_any_of(std::initializer_list list) which would iterate over the list and call model_is for each candidate I've added this as `model_is_in` interface. Does the new implementation look suitable? ------------- PR Comment: https://git.openjdk.org/jdk/pull/28607#issuecomment-3834887509 From chagedorn at openjdk.org Mon Feb 2 13:28:57 2026 From: chagedorn at openjdk.org (Christian Hagedorn) Date: Mon, 2 Feb 2026 13:28:57 GMT Subject: RFR: 8374582: [REDO] Move input validation checks to Java for java.lang.StringCoding intrinsics [v10] In-Reply-To: References: <3ci9RXEra2BlQPhYl-M0Wnu3hRpWaDvxPnMRzFnJA_k=.67795fb3-95d1-449b-a7a9-44b3776aa626@github.com>

Message-ID: On Mon, 2 Feb 2026 12:10:48 GMT, Damon Fenacci wrote: >> src/hotspot/share/opto/opaquenode.hpp line 150: >> >>> 148: bool _constant; >>> 149: public: >>> 150: OpaqueConstantBoolNode(Compile* C, Node* tst, bool constant) : Node(nullptr, tst), _constant(constant) { >> >> An alternative would be to have the `constant` be an actual input node instead of a field. In macro expansion, you could then do `_igvn.replace_node(n, n->in(2));` instead (maybe define an enum for the input indices). I don't have a strong opinion on this though and leave it up to you to decide ? > > Cool trick! ?... but now I can't decide between the two ? @chhagedorn do you fancy being the tiebreaker? The old `Opaque4` nodes used to have two data inputs where the second one was the replacement. I found it a little harder to view graphs in IGV with one more input. You also do not need to worry about trying to understand what the second input means. So, I would rather have a field if I may break the tie but both options are fine :-) When going with a field, you could add a NOT_PRODUCT(void dump_spec(outputStream* st) const); that prints `#true` or `#false` depending on `_constant` (that could also then be shown in IGV with the "Show custom node info"). ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/29164#discussion_r2754324868 From dfenacci at openjdk.org Mon Feb 2 14:03:00 2026 From: dfenacci at openjdk.org (Damon Fenacci) Date: Mon, 2 Feb 2026 14:03:00 GMT Subject: RFR: 8374582: [REDO] Move input validation checks to Java for java.lang.StringCoding intrinsics [v12] In-Reply-To: <3ci9RXEra2BlQPhYl-M0Wnu3hRpWaDvxPnMRzFnJA_k=.67795fb3-95d1-449b-a7a9-44b3776aa626@github.com> References: <3ci9RXEra2BlQPhYl-M0Wnu3hRpWaDvxPnMRzFnJA_k=.67795fb3-95d1-449b-a7a9-44b3776aa626@github.com> Message-ID: <2_bA8sRgRlbc279Aia0oD9gPBn8bcD5kLP3RnA4Xl4Q=.deaeaaf0-27a1-40f8-81f3-c8283c4d9529@github.com> > ## Issue > > This is a redo of [JDK-8361842](https://bugs.openjdk.org/browse/JDK-8361842) which was backed out by [JDK-8374210](https://bugs.openjdk.org/browse/JDK-8374210) due to C2-related regressions. The original change moved input validation checks for java.lang.StringCoding from the intrinsic to Java code (leaving the intrinsic check only with the `VerifyIntrinsicChecks` flag). Refer to the [original PR](https://github.com/openjdk/jdk/pull/25998) for details. > > This additional issue happens because, in some cases, for instance when the Java checking code is not inlined and we give an out-of-range constant as input, we fold the data path but not the control path and we crash in the backend. > > ## Causes > > The cause of this is that the out-of-range constant (e.g. -1) floats into the intrinsic and there (assuming the input is valid) we add a constraint to its type to positive integers (e.g. to compute the array address) which makes it top. > > ## Fix > > A possible fix is to introduce an opaque node (OpaqueGuardNode) similar to what we do in `must_be_not_null` for values that we know cannot be null: > https://github.com/openjdk/jdk/blob/ce721665cd61d9a319c667d50d9917c359d6c104/src/hotspot/share/opto/graphKit.cpp#L1484 > This will temporarily add the range check to ensure that C2 figures that out-of-range values cannot reach the intrinsic. Then, during macro expansion, we replace the opaque node with the corresponding constant (true/false) in product builds such that the actually unneeded guards are folded and do not end up in the emitted code. > > # Testing > > * Tier 1-3+ > * 2 JTReg tests added > * `TestRangeCheck.java` as regression test for the reported issue > * `TestOpaqueGuardNodes.java` to check that opaque guard nodes are added when parsing and removed at macro expansion Damon Fenacci has updated the pull request incrementally with two additional commits since the last revision: - JDK-8374582: remove empty line - JDK-8374582: add constant dump ------------- Changes: - all: https://git.openjdk.org/jdk/pull/29164/files - new: https://git.openjdk.org/jdk/pull/29164/files/0d4eef88..44b68dbc Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=29164&range=11 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=29164&range=10-11 Stats: 7 lines in 2 files changed: 7 ins; 0 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/29164.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/29164/head:pull/29164 PR: https://git.openjdk.org/jdk/pull/29164 From dfenacci at openjdk.org Mon Feb 2 14:05:43 2026 From: dfenacci at openjdk.org (Damon Fenacci) Date: Mon, 2 Feb 2026 14:05:43 GMT Subject: RFR: 8374582: [REDO] Move input validation checks to Java for java.lang.StringCoding intrinsics [v10] In-Reply-To: References: <3ci9RXEra2BlQPhYl-M0Wnu3hRpWaDvxPnMRzFnJA_k=.67795fb3-95d1-449b-a7a9-44b3776aa626@github.com>

Message-ID: <9TRsuJgH4W8hsmU02_3jXvLwPotWWdihBDjXoA_DZ3A=.577eb61f-6333-4cca-ad89-bc17c73bb660@github.com> On Mon, 2 Feb 2026 13:26:12 GMT, Christian Hagedorn wrote: > So, I would rather have a field if I may break Let's go with the field then ? > ```NOT_PRODUCT(void dump_spec(outputStream* st) const);``` Good idea! Added. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/29164#discussion_r2754497278 From jsjolen at openjdk.org Mon Feb 2 14:05:43 2026 From: jsjolen at openjdk.org (Johan =?UTF-8?B?U2rDtmxlbg==?=) Date: Mon, 2 Feb 2026 14:05:43 GMT Subject: RFR: 8376855: ASAN reports out-of-range read in strncmp in MethodHandles::is_basic_type_signature In-Reply-To: References: Message-ID: On Mon, 2 Feb 2026 01:13:35 GMT, David Holmes wrote: > An ASAN enabled build reported heap-buffer-overflow in `MethodHandles::is_basic_type_signature` with `ASAN_OPTIONS=strict_string_checks=true` when running test `jdk/jdk/jfr/api/metadata/annotations/TestThrottle.java` > > The code is here: > > bool MethodHandles::is_basic_type_signature(Symbol* sig) { > assert(vmSymbols::object_signature()->utf8_length() == (int)OBJ_SIG_LEN, ""); > assert(vmSymbols::object_signature()->equals(OBJ_SIG), ""); > for (SignatureStream ss(sig, sig->starts_with(JVM_SIGNATURE_FUNC)); !ss.is_done(); ss.next()) { > switch (ss.type()) { > case T_OBJECT: > // only java/lang/Object is valid here > if (strncmp((char*) ss.raw_bytes(), OBJ_SIG, OBJ_SIG_LEN) != 0) > > The ASAN `strncmp` interceptor acts as follows: > > INTERCEPTOR(int, strncmp, const char *s1, const char *s2, size_t n) { > void *ctx; > ASAN_INTERCEPTOR_ENTER(linker, strncmp); // Sets up context > ASAN_READ_RANGE(s1, n); // Validates s1 > ASAN_READ_RANGE(s2, n); // Validates s2 > return REAL(strncmp)(s1, s2, n); // Calls original function > } > > With the test given `s1` is a buffer of size 15, containing a non-nul-terminated string, and `n` is 18, so `ASAN_READ_RANGE` fails for `s1` as we could potentially read beyond the end of the buffer. In practice however, given `s1` is guaranteed to be a valid type-string from a signature symbol of type `T_OBJECT`, its final character is `;` and the final character of `s2` is also `;` (it is the string constant `Ljava/lang/Object;`). Hence the comparison must terminate before we can run off the end of `s1`. > > To appease ASAN we can make a simple change to the `strncmp` call to compare at most `ss.raw_length()` bytes. > > Testing > - ASAN no longer reports an error > - tiers 1-3 sanity > > Thanks LGTM ------------- Marked as reviewed by jsjolen (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/29516#pullrequestreview-3739605984 From iwalulya at openjdk.org Mon Feb 2 14:53:45 2026 From: iwalulya at openjdk.org (Ivan Walulya) Date: Mon, 2 Feb 2026 14:53:45 GMT Subject: RFR: 8376357: Parallel: Convert MutableSpace classes to use Atomic In-Reply-To: References: Message-ID: On Mon, 26 Jan 2026 17:14:07 GMT, Thomas Schatzl wrote: > Hi all, > > please review these changes that convert `MutableSpace` classes to use `Atomic`. > > Testing: gha, tier1-5 > > Thanks, > Thomas Marked as reviewed by iwalulya (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/29427#pullrequestreview-3739922441 From tschatzl at openjdk.org Mon Feb 2 15:19:28 2026 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 2 Feb 2026 15:19:28 GMT Subject: RFR: 8375438: G1: Convert G1HeapRegion related classes to use Atomic [v2] In-Reply-To: References:

Message-ID: On Mon, 2 Feb 2026 11:05:07 GMT, Aleksey Shipilev wrote: >> Thomas Schatzl has updated the pull request incrementally with one additional commit since the last revision: >> >> * shade review > > Marked as reviewed by shade (Reviewer). Thanks @shipilev @walulyai for your reviews ------------- PR Comment: https://git.openjdk.org/jdk/pull/29301#issuecomment-3835773407 From tschatzl at openjdk.org Mon Feb 2 15:19:40 2026 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 2 Feb 2026 15:19:40 GMT Subject: RFR: 8376357: Parallel: Convert MutableSpace classes to use Atomic In-Reply-To: <6AE-M8uvq7FAfVoArxQTWI5zM61RxTapF0OjUHz8YNY=.fed7cbf8-6f33-4c78-a38b-1beb56580beb@github.com> References: <6AE-M8uvq7FAfVoArxQTWI5zM61RxTapF0OjUHz8YNY=.fed7cbf8-6f33-4c78-a38b-1beb56580beb@github.com> Message-ID: On Wed, 28 Jan 2026 21:18:07 GMT, David Holmes wrote: >> Hi all, >> >> please review these changes that convert `MutableSpace` classes to use `Atomic`. >> >> Testing: gha, tier1-5 >> >> Thanks, >> Thomas > > Overall looks good to me, but one nit. > > Thanks Thanks @dholmes-ora @walulyai for your reviews ------------- PR Comment: https://git.openjdk.org/jdk/pull/29427#issuecomment-3835756480 From tschatzl at openjdk.org Mon Feb 2 15:19:43 2026 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 2 Feb 2026 15:19:43 GMT Subject: Integrated: 8376357: Parallel: Convert MutableSpace classes to use Atomic In-Reply-To: References: Message-ID: On Mon, 26 Jan 2026 17:14:07 GMT, Thomas Schatzl wrote: > Hi all, > > please review these changes that convert `MutableSpace` classes to use `Atomic`. > > Testing: gha, tier1-5 > > Thanks, > Thomas This pull request has now been integrated. Changeset: b7128b7c Author: Thomas Schatzl URL: https://git.openjdk.org/jdk/commit/b7128b7c30f3de2c1dcee2be567bb25d407c71a2 Stats: 30 lines in 4 files changed: 3 ins; 8 del; 19 mod 8376357: Parallel: Convert MutableSpace classes to use Atomic Reviewed-by: dholmes, iwalulya ------------- PR: https://git.openjdk.org/jdk/pull/29427 From tschatzl at openjdk.org Mon Feb 2 15:22:10 2026 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 2 Feb 2026 15:22:10 GMT Subject: Integrated: 8375438: G1: Convert G1HeapRegion related classes to use Atomic In-Reply-To: References: Message-ID: On Mon, 19 Jan 2026 13:32:46 GMT, Thomas Schatzl wrote: > Hi all, > > please review conversion of G1HeapRegion related classes to use Atomic. > > Testing: tier1, tier4, tier5 > > (The PipelineLeaksFD failure in gha is a known issue) > > Thanks, > Thomas This pull request has now been integrated. Changeset: 903b3fe1 Author: Thomas Schatzl URL: https://git.openjdk.org/jdk/commit/903b3fe19596adaeac7cfb0d749b6e83f668f52f Stats: 60 lines in 8 files changed: 19 ins; 3 del; 38 mod 8375438: G1: Convert G1HeapRegion related classes to use Atomic Reviewed-by: shade, iwalulya ------------- PR: https://git.openjdk.org/jdk/pull/29301 From tschatzl at openjdk.org Mon Feb 2 15:43:59 2026 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 2 Feb 2026 15:43:59 GMT Subject: RFR: 8375535: G1: Convert CardTableBarrierSet and subclasses to use Atomic [v2] In-Reply-To: References: Message-ID: <9Ky4MZ0LMzsR9J_k7ushMAg37KSBjhcjdBFN0zNvOQk=.352bedd6-6eba-49e1-90a0-4aebbc46510d@github.com> > Hi all, > > use `Atomic` instead of `AtomicAccess` in `CardTableBarrierSet` and subclasses. Since this modifies `CardTableBarrierSet::_card_table` the change has some fan-out. > > Testing: gha > > Thanks, > Thomas Thomas Schatzl has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains two commits: - Merge branch 'master' into submit/8375535-use-atomic-t-cardtablebarrierset - 8375535 Hi all, use `Atomic` instead of `AtomicAccess` in `CardTableBarrierSet` and subclasses. Since this modifies `CardTableBarrierSet::_card_table` the change has some fan-out. Testing: gha Thanks, Thomas ------------- Changes: https://git.openjdk.org/jdk/pull/29360/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=29360&range=01 Stats: 35 lines in 7 files changed: 6 ins; 0 del; 29 mod Patch: https://git.openjdk.org/jdk/pull/29360.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/29360/head:pull/29360 PR: https://git.openjdk.org/jdk/pull/29360 From qpzhang at openjdk.org Mon Feb 2 15:47:00 2026 From: qpzhang at openjdk.org (Patrick Zhang) Date: Mon, 2 Feb 2026 15:47:00 GMT Subject: RFR: 8365991: AArch64: Ignore BlockZeroingLowLimit when UseBlockZeroing is false [v12] In-Reply-To: References: Message-ID: > Issue: > In AArch64 port, `UseBlockZeroing` is by default set to true and `BlockZeroingLowLimit` is initialized to 256. If `DC ZVA` is supported, `BlockZeroingLowLimit` is later updated to `4 * VM_Version::zva_length()`. When `UseBlockZeroing` is set to false, all related conditional checks should ignore `BlockZeroingLowLimit`. However, the function `MacroAssembler::zero_words(Register base, uint64_t cnt)` still evaluates the lower limit and bases its code generation logic on it, which seems to be an incomplete conditional check. > > This PR: > 1. Reset `BlockZeroingLowLimit` to `4 * VM_Version::zva_length()` or 256 with a warning message if it was manually configured from the default while `UseBlockZeroing` is disabled. > 2. Added necessary comments in `MacroAssembler::zero_words(Register base, uint64_t cnt)` and `MacroAssembler::zero_words(Register ptr, Register cnt)` to explain why we do not check `UseBlockZeroing` in the outer part of these functions. Instead, the decision is delegated to the stub function `zero_blocks`, which encapsulates the DC ZVA instructions and serves as the inner implementation of `zero_words`. This approach helps better control the increase in code cache size during array or object instance initialization. > 3. Added more testing sizes to `test/micro/org/openjdk/bench/vm/gc/RawAllocationRate.java` to better cover scenarios involving smaller arrays and objects.. > > Tests: > 1. Performance tests on the bundled JMH `vm.compiler.ClearMemory`, and `vm.gc.RawAllocationRate` (including `arrayTest` and `instanceTest`) showed no obvious regression. Negative tests with `jdk/bin/java -jar images/test/micro/benchmarks.jar RawAllocationRate.arrayTest_C1 -bm thrpt -gc false -wi 0 -w 30 -i 1 -r 30 -t 1 -f 1 -tu s -jvmArgs "-XX:-UseBlockZeroing -XX:BlockZeroingLowLimit=8" -p size=32` demonstrated good wall times on `zero_words_reg_imm` calls, as expected. > 2. Jtreg ter1 test on Ampere Altra, AmpereOne, Graviton2 and 3, tier2 on Altra. No new issues found. Passed tests of GHA Sanity Checks. Patrick Zhang has updated the pull request incrementally with one additional commit since the last revision: Trigger OCA recheck ------------- Changes: - all: https://git.openjdk.org/jdk/pull/26917/files - new: https://git.openjdk.org/jdk/pull/26917/files/5535721e..082bafe0 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=26917&range=11 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=26917&range=10-11 Stats: 0 lines in 0 files changed: 0 ins; 0 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/26917.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/26917/head:pull/26917 PR: https://git.openjdk.org/jdk/pull/26917 From thartmann at openjdk.org Mon Feb 2 16:02:56 2026 From: thartmann at openjdk.org (Tobias Hartmann) Date: Mon, 2 Feb 2026 16:02:56 GMT Subject: RFR: 8374582: [REDO] Move input validation checks to Java for java.lang.StringCoding intrinsics [v12] In-Reply-To: <2_bA8sRgRlbc279Aia0oD9gPBn8bcD5kLP3RnA4Xl4Q=.deaeaaf0-27a1-40f8-81f3-c8283c4d9529@github.com> References: <3ci9RXEra2BlQPhYl-M0Wnu3hRpWaDvxPnMRzFnJA_k=.67795fb3-95d1-449b-a7a9-44b3776aa626@github.com> <2_bA8sRgRlbc279Aia0oD9gPBn8bcD5kLP3RnA4Xl4Q=.deaeaaf0-27a1-40f8-81f3-c8283c4d9529@github.com> Message-ID: <_mVonDnsPn3yCi7haKqAlC_3iD8GNOojYbMt4xuUf_Y=.2c887c19-5dba-4501-bec4-faba0a2dca9b@github.com> On Mon, 2 Feb 2026 14:03:00 GMT, Damon Fenacci wrote: >> ## Issue >> >> This is a redo of [JDK-8361842](https://bugs.openjdk.org/browse/JDK-8361842) which was backed out by [JDK-8374210](https://bugs.openjdk.org/browse/JDK-8374210) due to C2-related regressions. The original change moved input validation checks for java.lang.StringCoding from the intrinsic to Java code (leaving the intrinsic check only with the `VerifyIntrinsicChecks` flag). Refer to the [original PR](https://github.com/openjdk/jdk/pull/25998) for details. >> >> This additional issue happens because, in some cases, for instance when the Java checking code is not inlined and we give an out-of-range constant as input, we fold the data path but not the control path and we crash in the backend. >> >> ## Causes >> >> The cause of this is that the out-of-range constant (e.g. -1) floats into the intrinsic and there (assuming the input is valid) we add a constraint to its type to positive integers (e.g. to compute the array address) which makes it top. >> >> ## Fix >> >> A possible fix is to introduce an opaque node (OpaqueGuardNode) similar to what we do in `must_be_not_null` for values that we know cannot be null: >> https://github.com/openjdk/jdk/blob/ce721665cd61d9a319c667d50d9917c359d6c104/src/hotspot/share/opto/graphKit.cpp#L1484 >> This will temporarily add the range check to ensure that C2 figures that out-of-range values cannot reach the intrinsic. Then, during macro expansion, we replace the opaque node with the corresponding constant (true/false) in product builds such that the actually unneeded guards are folded and do not end up in the emitted code. >> >> # Testing >> >> * Tier 1-3+ >> * 2 JTReg tests added >> * `TestRangeCheck.java` as regression test for the reported issue >> * `TestOpaqueGuardNodes.java` to check that opaque guard nodes are added when parsing and removed at macro expansion > > Damon Fenacci has updated the pull request incrementally with two additional commits since the last revision: > > - JDK-8374582: remove empty line > - JDK-8374582: add constant dump That looks good to me. ------------- Marked as reviewed by thartmann (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/29164#pullrequestreview-3740390312 From kbarrett at openjdk.org Mon Feb 2 16:06:34 2026 From: kbarrett at openjdk.org (Kim Barrett) Date: Mon, 2 Feb 2026 16:06:34 GMT Subject: RFR: 8375535: G1: Convert CardTableBarrierSet and subclasses to use Atomic [v2] In-Reply-To: <9Ky4MZ0LMzsR9J_k7ushMAg37KSBjhcjdBFN0zNvOQk=.352bedd6-6eba-49e1-90a0-4aebbc46510d@github.com> References: <9Ky4MZ0LMzsR9J_k7ushMAg37KSBjhcjdBFN0zNvOQk=.352bedd6-6eba-49e1-90a0-4aebbc46510d@github.com> Message-ID: On Mon, 2 Feb 2026 15:43:59 GMT, Thomas Schatzl wrote: >> Hi all, >> >> use `Atomic` instead of `AtomicAccess` in `CardTableBarrierSet` and subclasses. Since this modifies `CardTableBarrierSet::_card_table` the change has some fan-out. >> >> Testing: gha >> >> Thanks, >> Thomas > > Thomas Schatzl has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains two commits: > > - Merge branch 'master' into submit/8375535-use-atomic-t-cardtablebarrierset > - 8375535 > > Hi all, > > use `Atomic` instead of `AtomicAccess` in `CardTableBarrierSet` and subclasses. Since this modifies `CardTableBarrierSet::_card_table` the change has some fan-out. > > Testing: gha > > Thanks, > Thomas Still good. ------------- Marked as reviewed by kbarrett (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/29360#pullrequestreview-3740383119 From tschatzl at openjdk.org Mon Feb 2 16:06:35 2026 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 2 Feb 2026 16:06:35 GMT Subject: RFR: 8375535: G1: Convert CardTableBarrierSet and subclasses to use Atomic [v2] In-Reply-To: References: <9Ky4MZ0LMzsR9J_k7ushMAg37KSBjhcjdBFN0zNvOQk=.352bedd6-6eba-49e1-90a0-4aebbc46510d@github.com> Message-ID: On Mon, 2 Feb 2026 15:59:12 GMT, Kim Barrett wrote: >> Thomas Schatzl has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains two commits: >> >> - Merge branch 'master' into submit/8375535-use-atomic-t-cardtablebarrierset >> - 8375535 >> >> Hi all, >> >> use `Atomic` instead of `AtomicAccess` in `CardTableBarrierSet` and subclasses. Since this modifies `CardTableBarrierSet::_card_table` the change has some fan-out. >> >> Testing: gha >> >> Thanks, >> Thomas > > Still good. Thanks @kimbarrett @walulyai for your reviews ------------- PR Comment: https://git.openjdk.org/jdk/pull/29360#issuecomment-3836097517 From tschatzl at openjdk.org Mon Feb 2 16:06:37 2026 From: tschatzl at openjdk.org (Thomas Schatzl) Date: Mon, 2 Feb 2026 16:06:37 GMT Subject: Integrated: 8375535: G1: Convert CardTableBarrierSet and subclasses to use Atomic In-Reply-To: References: Message-ID: On Thu, 22 Jan 2026 12:58:39 GMT, Thomas Schatzl wrote: > Hi all, > > use `Atomic` instead of `AtomicAccess` in `CardTableBarrierSet` and subclasses. Since this modifies `CardTableBarrierSet::_card_table` the change has some fan-out. > > Testing: gha > > Thanks, > Thomas This pull request has now been integrated. Changeset: 9871e2d3 Author: Thomas Schatzl URL: https://git.openjdk.org/jdk/commit/9871e2d3f771ee2bc1b2473c0eb28a0bfc1c5456 Stats: 35 lines in 7 files changed: 6 ins; 0 del; 29 mod 8375535: G1: Convert CardTableBarrierSet and subclasses to use Atomic Reviewed-by: kbarrett, iwalulya ------------- PR: https://git.openjdk.org/jdk/pull/29360 From aph at openjdk.org Mon Feb 2 16:26:10 2026 From: aph at openjdk.org (Andrew Haley) Date: Mon, 2 Feb 2026 16:26:10 GMT Subject: RFR: 8372942: AArch64: Set JVM flags for Neoverse V3AE core [v2] In-Reply-To: References:

Message-ID: On Fri, 30 Jan 2026 22:22:50 GMT, Ruben wrote: >> For Neoverse N1, N2, N3, V1, V2 and V3, the following JVM flags are set: >> - UseSIMDForMemoryOps=true >> - OnSpinWaitInst=isb >> - OnSpinWaitInstCount=1 >> - AlwaysMergeDMB=false >> >> Additionally, for Neoverse V1, V2 and V3 only, these flags are set: >> - UseCryptoPmullForCRC32=true >> - CodeEntryAlignment=32 >> >> Enable the same flags for Neoverse V3AE. > > Ruben has updated the pull request incrementally with one additional commit since the last revision: > > Introduce `model_is_in` Great! Ship it. ------------- Marked as reviewed by aph (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/28607#pullrequestreview-3740534141 From iwalulya at openjdk.org Mon Feb 2 17:10:28 2026 From: iwalulya at openjdk.org (Ivan Walulya) Date: Mon, 2 Feb 2026 17:10:28 GMT Subject: RFR: 8376195: Convert ThreadLocalAllocBuffer to use Atomic [v2] In-Reply-To: <2rAYqg4JIvntyUtk-qDU1oywjCA372LK1JyAZYQxTss=.12c4303f-cbcf-464e-83fd-edde06c83f30@github.com> References: <1TAUwWsHEcAIzMF35Q3v9xsDhsNV6ZhGZDCO9fh93KI=.f14438d0-8874-4259-b33d-83ad8b9bf2b3@github.com> <2rAYqg4JIvntyUtk-qDU1oywjCA372LK1JyAZYQxTss=.12c4303f-cbcf-464e-83fd-edde06c83f30@github.com> Message-ID: On Mon, 26 Jan 2026 11:00:10 GMT, Thomas Schatzl wrote: >> Hi all, >> >> please review the change to use `Atomic` in `ThreadLocalAllocBuffer`. >> >> Testing: gha >> >> Thanks, >> Thomas > > Thomas Schatzl has updated the pull request incrementally with two additional commits since the last revision: > > - * kbarrett review > - * kbarrett review Marked as reviewed by iwalulya (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/29386#pullrequestreview-3740800467 From aph at openjdk.org Mon Feb 2 18:08:54 2026 From: aph at openjdk.org (Andrew Haley) Date: Mon, 2 Feb 2026 18:08:54 GMT Subject: RFR: 8328306: AArch64: MacOS lazy JIT "write xor execute" switching [v26] In-Reply-To: <3IdZZGAKHVuMXfeM10Z-VSDNlJmcu5XFilLQfEKb9OY=.5213f5ca-2bca-41eb-b7ce-7621510552be@github.com> References: <3IdZZGAKHVuMXfeM10Z-VSDNlJmcu5XFilLQfEKb9OY=.5213f5ca-2bca-41eb-b7ce-7621510552be@github.com> Message-ID: <5ndj0gE-T75cTq9SIs6slsLOnumMzlXPWOFGk3KZvgE=.a4d9ede5-5e71-4da5-a8f3-d380e58f1a34@github.com> > In MacOS/AArch64 HotSpot, we have to deal with the fact that a thread must be in one of two modes: it either may write to code cache memory or it may execute (and read) code or data in it. A system call `pthread_jit_write_protect_np(int enabled)` changes from one to the other. > > Today, we change mode whenever making a transition from interpreter to VM. This means that we change mode a lot: experiments have shown that during `jshell` startup we change mode 4 million times. Other experiments have shown that we only needed to change mode 45 thousand times. > > This "eager" mode switching is perhaps too eager, and we'd be better off switching lazily. While the system call that changes mode is very fast, mode switching still amounts to about 100ms of startup time. Switching eagerly also means that some native calls (e.g. to do arithmetic) are disproportionately expensive, given that they have no need of mode switching at all. > > The approach in this PR is to defer transitioning from exec-but-don't-write mode (`WXExec`) to write-but-don't-exec mode (`WXWrite`) until we need to write. Instead of enabling `WXWrite` immediately, we switch to a mode called `WXArmedForWrite`. When in this mode, when we need to write into code memory we call `os_bsd_jit_exec_enabled(false)` to enable writing and then set the current mode to `WXWrite`. > > We mark all sites that we know will write to code memory with > `MACOS_AARCH64_ONLY(os::thread_wx_enable_write());` Judicious placement of these markers, such as when entering patching code, means that we have a fairly small number of these. > > We also keep track (in thread-local storage) of the current state of `pthread_jit_write_protect_np` in order to avoid making the system call unnecessarily. > > It is possible that we have missed some sites where we do need to make a transition from write-protected to -enabled. While we haven't seen any in testing, we have a fallback path. An attempt to write into code memory triggers a `SIGILL` signal. A signal handler detects this, and if the current mode `WXArmedForWrite` it changes mode to write-enabled and returns. In addition, the handler "heals" the VM entry point so that next time the same point is entered (and for the rest of the lifetime of the VM) it will immediately transition to `WXWrite`. > > One other possibility remains: we could omit all of the `wx_enable_write` markers and use healing instead. We've experimented with this. It works well enough, but is rather crude, and it's better to be able to ... Andrew Haley has updated the pull request incrementally with one additional commit since the last revision: Back out 37730e6aac899e1fbdcf4f201ac2ae1013201432 ------------- Changes: - all: https://git.openjdk.org/jdk/pull/26562/files - new: https://git.openjdk.org/jdk/pull/26562/files/c6652628..05860429 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=26562&range=25 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=26562&range=24-25 Stats: 1 line in 1 file changed: 1 ins; 0 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/26562.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/26562/head:pull/26562 PR: https://git.openjdk.org/jdk/pull/26562 From aph at openjdk.org Mon Feb 2 18:11:12 2026 From: aph at openjdk.org (Andrew Haley) Date: Mon, 2 Feb 2026 18:11:12 GMT Subject: RFR: 8328306: AArch64: MacOS lazy JIT "write xor execute" switching [v23] In-Reply-To: References: <3IdZZGAKHVuMXfeM10Z-VSDNlJmcu5XFilLQfEKb9OY=.5213f5ca-2bca-41eb-b7ce-7621510552be@github.com>

Message-ID: On Fri, 30 Jan 2026 03:42:50 GMT, Dean Long wrote: >> So, I'll happily drop this one change. > > Yes, drop this change and I'll test it again. Done. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/26562#discussion_r2755622645 From missa at openjdk.org Mon Feb 2 18:40:37 2026 From: missa at openjdk.org (Mohamed Issa) Date: Mon, 2 Feb 2026 18:40:37 GMT Subject: RFR: 8371955: Support AVX10 floating point comparison instructions [v9] In-Reply-To: References:

Message-ID: <71uI1BCZPJmBfUhtRMBcRREf63StolB9Ch0vhgPgZeU=.c3bbfca7-e9da-4caf-82c6-be28ef4f98fe@github.com> On Thu, 29 Jan 2026 09:02:03 GMT, Emanuel Peter wrote: > FYI: testing launched ? @eme64 Did tests pass? ------------- PR Comment: https://git.openjdk.org/jdk/pull/28337#issuecomment-3836979281 From psandoz at openjdk.org Mon Feb 2 20:25:15 2026 From: psandoz at openjdk.org (Paul Sandoz) Date: Mon, 2 Feb 2026 20:25:15 GMT Subject: RFR: 8376187: [VectorAPI] Define new lane type constants and pass them to intrinsic entries [v6] In-Reply-To: References:

Message-ID: On Mon, 2 Feb 2026 09:07:21 GMT, Jatin Bhateja wrote: >> As per [discussions ](https://github.com/openjdk/jdk/pull/28002#issuecomment-3789507594) on JDK-8370691 pull request, splitting out portion of PR#28002 into a separate patch in preparation of Float16 vector API support. >> >> Patch add new lane type constants and pass them to vector intrinsic entry points. >> >> All existing Vector API jtreg test are passing with the patch. >> >> Kindly review and share your feedback. >> >> Best Regards, >> Jatin > > Jatin Bhateja has updated the pull request incrementally with one additional commit since the last revision: > > Review comment resolution Very good. Approved, there is just one comment related to adding a comment for the LT_* values. Thank you for separating this out from the float16 PR. Needs a HotSpot reviewer too. We will run it through tier 1 to 3 testing. src/hotspot/share/prims/vectorSupport.hpp line 140: > 138: }; > 139: > 140: enum LaneType { Please add a comment referencing `LaneType` and that the values in this enum correspond to the LaneType ordinal values. ------------- Marked as reviewed by psandoz (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/29481#pullrequestreview-3741431390 PR Review Comment: https://git.openjdk.org/jdk/pull/29481#discussion_r2755893774 From dholmes at openjdk.org Mon Feb 2 22:42:11 2026 From: dholmes at openjdk.org (David Holmes) Date: Mon, 2 Feb 2026 22:42:11 GMT Subject: RFR: 8376855: ASAN reports out-of-range read in strncmp in MethodHandles::is_basic_type_signature In-Reply-To: References:

Message-ID: On Mon, 2 Feb 2026 11:26:01 GMT, Afshin Zafari wrote: >> An ASAN enabled build reported heap-buffer-overflow in `MethodHandles::is_basic_type_signature` with `ASAN_OPTIONS=strict_string_checks=true` when running test `jdk/jdk/jfr/api/metadata/annotations/TestThrottle.java` >> >> The code is here: >> >> bool MethodHandles::is_basic_type_signature(Symbol* sig) { >> assert(vmSymbols::object_signature()->utf8_length() == (int)OBJ_SIG_LEN, ""); >> assert(vmSymbols::object_signature()->equals(OBJ_SIG), ""); >> for (SignatureStream ss(sig, sig->starts_with(JVM_SIGNATURE_FUNC)); !ss.is_done(); ss.next()) { >> switch (ss.type()) { >> case T_OBJECT: >> // only java/lang/Object is valid here >> if (strncmp((char*) ss.raw_bytes(), OBJ_SIG, OBJ_SIG_LEN) != 0) >> >> The ASAN `strncmp` interceptor acts as follows: >> >> INTERCEPTOR(int, strncmp, const char *s1, const char *s2, size_t n) { >> void *ctx; >> ASAN_INTERCEPTOR_ENTER(linker, strncmp); // Sets up context >> ASAN_READ_RANGE(s1, n); // Validates s1 >> ASAN_READ_RANGE(s2, n); // Validates s2 >> return REAL(strncmp)(s1, s2, n); // Calls original function >> } >> >> With the test given `s1` is a buffer of size 15, containing a non-nul-terminated string, and `n` is 18, so `ASAN_READ_RANGE` fails for `s1` as we could potentially read beyond the end of the buffer. In practice however, given `s1` is guaranteed to be a valid type-string from a signature symbol of type `T_OBJECT`, its final character is `;` and the final character of `s2` is also `;` (it is the string constant `Ljava/lang/Object;`). Hence the comparison must terminate before we can run off the end of `s1`. >> >> To appease ASAN we can make a simple change to the `strncmp` call to compare at most `ss.raw_length()` bytes. >> >> Testing >> - ASAN no longer reports an error >> - tiers 1-3 sanity >> >> Thanks > > Thank you David for taking and fixing this. Thanks for the reviews @afshin-zafari and @jdksjolen ! ------------- PR Comment: https://git.openjdk.org/jdk/pull/29516#issuecomment-3837656354 From dholmes at openjdk.org Mon Feb 2 22:42:13 2026 From: dholmes at openjdk.org (David Holmes) Date: Mon, 2 Feb 2026 22:42:13 GMT Subject: Integrated: 8376855: ASAN reports out-of-range read in strncmp in MethodHandles::is_basic_type_signature In-Reply-To: References: Message-ID: <6hTVYTUP53ChQvL9GpxoDlnZ-WWyVgRa3CN-kt0hvDU=.1d5ef6c2-9c06-4b85-85b4-bcd3f98157f2@github.com> On Mon, 2 Feb 2026 01:13:35 GMT, David Holmes wrote: > An ASAN enabled build reported heap-buffer-overflow in `MethodHandles::is_basic_type_signature` with `ASAN_OPTIONS=strict_string_checks=true` when running test `jdk/jdk/jfr/api/metadata/annotations/TestThrottle.java` > > The code is here: > > bool MethodHandles::is_basic_type_signature(Symbol* sig) { > assert(vmSymbols::object_signature()->utf8_length() == (int)OBJ_SIG_LEN, ""); > assert(vmSymbols::object_signature()->equals(OBJ_SIG), ""); > for (SignatureStream ss(sig, sig->starts_with(JVM_SIGNATURE_FUNC)); !ss.is_done(); ss.next()) { > switch (ss.type()) { > case T_OBJECT: > // only java/lang/Object is valid here > if (strncmp((char*) ss.raw_bytes(), OBJ_SIG, OBJ_SIG_LEN) != 0) > > The ASAN `strncmp` interceptor acts as follows: > > INTERCEPTOR(int, strncmp, const char *s1, const char *s2, size_t n) { > void *ctx; > ASAN_INTERCEPTOR_ENTER(linker, strncmp); // Sets up context > ASAN_READ_RANGE(s1, n); // Validates s1 > ASAN_READ_RANGE(s2, n); // Validates s2 > return REAL(strncmp)(s1, s2, n); // Calls original function > } > > With the test given `s1` is a buffer of size 15, containing a non-nul-terminated string, and `n` is 18, so `ASAN_READ_RANGE` fails for `s1` as we could potentially read beyond the end of the buffer. In practice however, given `s1` is guaranteed to be a valid type-string from a signature symbol of type `T_OBJECT`, its final character is `;` and the final character of `s2` is also `;` (it is the string constant `Ljava/lang/Object;`). Hence the comparison must terminate before we can run off the end of `s1`. > > To appease ASAN we can make a simple change to the `strncmp` call to compare at most `ss.raw_length()` bytes. > > Testing > - ASAN no longer reports an error > - tiers 1-3 sanity > > Thanks This pull request has now been integrated. Changeset: 1cb4ef85 Author: David Holmes URL: https://git.openjdk.org/jdk/commit/1cb4ef8581b5c5572474a5376baf4fd88c5ffeab Stats: 2 lines in 1 file changed: 0 ins; 0 del; 2 mod 8376855: ASAN reports out-of-range read in strncmp in MethodHandles::is_basic_type_signature Reviewed-by: azafari, jsjolen ------------- PR: https://git.openjdk.org/jdk/pull/29516 From xuelei at openjdk.org Mon Feb 2 23:37:55 2026 From: xuelei at openjdk.org (Xue-Lei Andrew Fan) Date: Mon, 2 Feb 2026 23:37:55 GMT Subject: RFR: 8376125: Out of memory in the CDS archive error with lot of classes [v3] In-Reply-To: References:

<43jWfoF7waaehspCCA-pV-eWsXF5AGCKvjyiC2uguTU=.297fbe19-7cb1-49e9-9994-f4b8ffb1ef09@github.com>

Message-ID: <64hZpvWXWK3cRG_gVpeRvkMT2f35dkwJqgO0ZfY4YHY=.fe44589f-bf65-4ad7-bb57-f02da7f6548e@github.com> On Mon, 2 Feb 2026 20:17:15 GMT, Ashutosh Mehra wrote: >> Yes. Please refer to test/hotspot/jtreg/resourcehogs/runtime/aot/LargeArchive.java, where the archive size is more than 2GB. Without this update, the test will fail. > > umm, I removed this change to `os.cpp` and ran `LargeArchive.java` test on x86-64 system and it passed. On which platform/OS did you see the failure? I was on MacOS. Here is the failure if revert os.cpp update: % make test TEST="test/hotspot/jtreg/resourcehogs/runtime/aot/LargeArchive.java" JTREG="JAVA_OPTIONS=-Dtest.archive.large.all.workflows=true" ... [0.018s][info][aot] JVM_StartThread() ignored: java.lang.ref.Reference$ReferenceHandler [0.018s][info][aot] JVM_StartThread() ignored: java.lang.ref.Finalizer$FinalizerThread [0.039s][info][cds] Loading classes to share ... [0.039s][info][cds] Parsing LargeArchive.classlist [0.047s][info][aot] JVM_StartThread() ignored: jdk.internal.misc.InnocuousThread [136.124s][info][cds] Parsing /Users/xuelei.fan/workspace/openjdk/jdk-xf.git/build/macosx-aarch64-server-release/images/jdk/lib/classlist (lambda form invokers only) [136.126s][info][cds] Loading classes to share: done. [136.127s][info][aot] Rewriting and linking classes ... [136.245s][info][aot] Rewriting and linking classes: done [136.245s][info][aot] Regenerate MethodHandle Holder classes... [136.344s][info][aot] Regenerate MethodHandle Holder classes...done [136.351s][info][cds] Dumping shared data to file: LargeArchive.static.jsa [136.351s][info][cds] Gathering all archivable objects ... [136.371s][info][cds] Skipping java/lang/invoke/BoundMethodHandle$Species_F: dynamically generated [136.371s][info][cds] Skipping java/lang/invoke/BoundMethodHandle$Species_J: dynamically generated [136.413s][info][cds] Skipping jdk/internal/misc/CDS$UnregisteredClassLoader$Source: used only when dumping CDS archive [136.413s][info][cds] Skipping jdk/internal/misc/CDS$UnregisteredClassLoader: used only when dumping CDS archive [136.417s][info][cds] Heap range = [0x00000003c0000000 - 0x00000004c0000000] [136.430s][info][aot] Archived 7975 interned strings [136.431s][info][cds] Gathering classes and symbols ... [143.110s][info][cds] Sorting symbols ... [143.113s][info][cds] Sorting classes ... [149.873s][info][cds] Reserved output buffer space at 0x0000007000000000 [34359738368 bytes] [149.900s][info][cds] Allocating RW objects ... [149.954s][info][cds] done (46510 objects) [149.954s][info][cds] Allocating RO objects ... [151.033s][info][cds] done (142334 objects) [151.033s][info][cds] Relocating embedded pointers in core regions ... [152.400s][info][cds] Relocating 150345176 pointers, 0 tagged, 17461 nulled [152.400s][info][aot] Make classes shareable [152.558s][info][cds] Number of classes 4485 [152.558s][info][cds] instance classes = 4340, aot-linked = 0, inited = 0 [152.558s][info][cds] boot = 838, aot-linked = 0, inited = 0 [152.558s][info][cds] vm = 153, aot-linked = 0, inited = 0 [152.558s][info][cds] platform = 0, aot-linked = 0, inited = 0 [152.558s][info][cds] app = 3502, aot-linked = 0, inited = 0 [152.558s][info][cds] unregistered = 0, aot-linked = 0, inited = 0 [152.558s][info][cds] (enum) = 30, aot-linked = 0, inited = 0 [152.558s][info][cds] (hidden) = 8, aot-linked = 0, inited = 0 [152.558s][info][cds] (old) = 0, aot-linked = 0, inited = 0 [152.558s][info][cds] (unlinked) = 0, boot = 0, plat = 0, app = 0, unreg = 0 [152.558s][info][cds] obj array classes = 136 [152.558s][info][cds] type array classes = 9 [152.558s][info][cds] symbols = 93208 [153.627s][info][aot] sorting heap objects [153.628s][info][aot] computed ranks [153.629s][info][aot] sorting heap objects done [153.635s][info][aot] Size of heap region = 1461632 bytes, 31877 objects, 13159 roots, 0 native ptrs [153.642s][info][aot] oopmap = 4 ... 365408 ( 0% ... 100% = 99%) [153.642s][info][aot] ptrmap = 35175 ... 142547 ( 19% ... 78% = 58%) [153.642s][info][aot] Dumping symbol table ... [153.652s][info][aot] Archived 0 method handle intrinsics (16 bytes) [153.652s][info][aot] Adjust lambda proxy class dictionary [153.652s][info][cds] Make training data shareable [153.890s][info][cds] Shared file region (rw) 0: 305713152 bytes, addr 0x0000000800004000 file offset 0x00004000 crc 0xa7da7141 [154.232s][info][cds] Shared file region (ro) 1: 3269057552 bytes, addr 0x0000000812394000 file offset 0x12394000 crc 0xdce21a0b [154.237s][error][cds] An error has occurred while writing the shared archive file. [154.237s][error][cds] Unable to write to shared archive. [154.237s][error][cds] Unable to seek to position 3574808575 (errno=9: Bad file descriptor) [154.237s][info ][cds] An error has occurred while processing the shared archive file. Run with -Xlog:aot,cds for details. [154.237s][info ][cds] unrecoverable error Error occurred during initialization of VM Unable to use shared archive. Unrecoverable archive loading error (run with -Xlog:aot,cds for details): unrecoverable error ]; stderr: [] exitValue = 1 java.lang.RuntimeException: Expected to get exit value of [0], exit value is: [1] at jdk.test.lib.process.OutputAnalyzer.shouldHaveExitValue(OutputAnalyzer.java:549) at jdk.test.lib.cds.CDSAppTester.executeAndCheck(CDSAppTester.java:219) at jdk.test.lib.cds.CDSAppTester.dumpStaticArchive(CDSAppTester.java:319) at jdk.test.lib.cds.CDSAppTester.runStaticWorkflow(CDSAppTester.java:470) at jdk.test.lib.cds.SimpleCDSAppTester.runStaticWorkflow(SimpleCDSAppTester.java:196) at LargeArchive.main(LargeArchive.java:77) at java.base/jdk.internal.reflect.DirectMethodHandleAccessor.invoke(DirectMethodHandleAccessor.java:104) at java.base/java.lang.reflect.Method.invoke(Method.java:565) at com.sun.javatest.regtest.agent.MainWrapper$MainTask.run(MainWrapper.java:138) at java.base/java.lang.Thread.run(Thread.java:1516) JavaTest Message: Test threw exception: java.lang.RuntimeException: Expected to get exit value of [0], exit value is: [1] JavaTest Message: shutting down test ... ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/29494#discussion_r2756468548 From abakhtin at openjdk.org Tue Feb 3 00:41:00 2026 From: abakhtin at openjdk.org (Alexey Bakhtin) Date: Tue, 3 Feb 2026 00:41:00 GMT Subject: RFR: 8376125: Out of memory in the CDS archive error with lot of classes In-Reply-To: References:

Message-ID: <0T2Eu5ZqTNlBV3T3wsj11szuDgtVw7yxASYDRbAl5_0=.1d49b3e9-18fe-4763-b0cb-1d15e97f7272@github.com> On Sun, 1 Feb 2026 04:34:02 GMT, Xue-Lei Andrew Fan wrote: >> There are two more apis that return "unchecked" offset: `ArchiveBuilder::buffer_to_offset()` and `ArchiveBuilder::any_to_offset()`. These apis are not returning the scaled offset. I think it is better to get rid of these apis and replace their usage with `_u4` version which has the offset range check. I noticed there are only 1-2 instances that use these "unchecked" apis. > >> There are two more apis that return "unchecked" offset: `ArchiveBuilder::buffer_to_offset()` and `ArchiveBuilder::any_to_offset()`. These apis are not returning the scaled offset. I think it is better to get rid of these apis and replace their usage with `_u4` version which has the offset range check. I noticed there are only 1-2 instances that use these "unchecked" apis. > > Thanks for the suggestion. I looked into this and found that buffer_to_offset() and any_to_offset() serve a different purpose than the _u4 versions. The _u4 versions use scaled encoding (with MetadataOffsetShift) and return a compact u4 for metadata pointer storage. The raw versions return unscaled byte offsets stored in larger types. These usages cannot switch to _u4 versions because they need raw byte offsets (not scaled) and store them in 64-bit types. > > However, the comments for the methods may be misleading after introducing the _u4 methods. What do you think to revise the comment as: > > // The address p points to an object inside the output buffer. When the archive is mapped > // at the requested address, what's the byte offset of this object from _requested_static_archive_bottom? > uintx buffer_to_offset(address p) const; > > // Same as buffer_to_offset, except that the address p points to either (a) an object > // inside the output buffer, or (b), an object in the currently mapped static archive. > uintx any_to_offset(address p) const; > > // The reverse of buffer_to_offset_u4() - converts scaled offset units back to buffered address. > address offset_to_buffered_address(u4 offset_units) const; > > > I am also OK to rename the method names to: `buffer_to_offset_bytes()` and `any_to_offset_bytes()`, if the new names are clearer. > > @ashu-mehra What do you think? Hi @XueleiFan, I've tried the suggested code with an archive size more than 4Gb, but it fails with an assertion: # Internal Error (aotMetaspace.cpp:1955), pid=96332, tid=4099 # guarantee(archive_space_size < max_encoding_range_size - class_space_alignment) failed: Archive too large CDC archive was created successfully: [187.068s][info ][cds ] Shared file region (rw) 0: 822453584 bytes, addr 0x0000000800004000 file offset 0x00004000 crc 0x132b652e [189.176s][info ][cds ] Shared file region (ro) 1: 3576115584 bytes, addr 0x0000000831060000 file offset 0x31060000 crc 0x71b020a2 [197.653s][info ][cds ] Shared file region (ac) 4: 0 bytes [198.870s][info ][cds ] Shared file region (bm) 2: 56555664 bytes, addr 0x0000000000000000 file offset 0x1062d4000 crc 0xbd87f804 [199.504s][info ][cds ] Shared file region (hp) 3: 16091256 bytes, addr 0x00000000ff000000 file offset 0x1098c4000 crc 0x7834b7c3 [199.684s][debug ][cds ] bm space: 56555664 [ 1.3% of total] out of 56555664 bytes [100.0% used] [199.684s][debug ][cds ] hp space: 16091256 [ 0.4% of total] out of 16091256 bytes [100.0% used] at 0x0000000c6d000000 [199.684s][debug ][cds ] total : 4471216088 [100.0% of total] out of 4471228536 bytes [100.0% used] ------------- PR Comment: https://git.openjdk.org/jdk/pull/29494#issuecomment-3838062386 From jbhateja at openjdk.org Tue Feb 3 03:31:52 2026 From: jbhateja at openjdk.org (Jatin Bhateja) Date: Tue, 3 Feb 2026 03:31:52 GMT Subject: RFR: 8376187: [VectorAPI] Define new lane type constants and pass them to intrinsic entries [v7] In-Reply-To: References: Message-ID: > As per [discussions ](https://github.com/openjdk/jdk/pull/28002#issuecomment-3789507594) on JDK-8370691 pull request, splitting out portion of PR#28002 into a separate patch in preparation of Float16 vector API support. > > Patch add new lane type constants and pass them to vector intrinsic entry points. > > All existing Vector API jtreg test are passing with the patch. > > Kindly review and share your feedback. > > Best Regards, > Jatin Jatin Bhateja has updated the pull request incrementally with one additional commit since the last revision: Review comments resolution ------------- Changes: - all: https://git.openjdk.org/jdk/pull/29481/files - new: https://git.openjdk.org/jdk/pull/29481/files/23022d42..c1935efc Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=29481&range=06 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=29481&range=05-06 Stats: 3 lines in 2 files changed: 1 ins; 2 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/29481.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/29481/head:pull/29481 PR: https://git.openjdk.org/jdk/pull/29481 From jbhateja at openjdk.org Tue Feb 3 03:31:54 2026 From: jbhateja at openjdk.org (Jatin Bhateja) Date: Tue, 3 Feb 2026 03:31:54 GMT Subject: RFR: 8376187: [VectorAPI] Define new lane type constants and pass them to intrinsic entries [v6] In-Reply-To: References:

Message-ID: <1g1hwUyCoVEwQmSnil3tnLEbyNDXAUGkfPSz3R8lNAg=.ca6498cb-acec-4b00-9b38-a01e720046df@github.com> On Mon, 2 Feb 2026 20:22:46 GMT, Paul Sandoz wrote: >> Jatin Bhateja has updated the pull request incrementally with one additional commit since the last revision: >> >> Review comment resolution > > Very good. Approved, there is just one comment related to adding a comment for the LT_* values. Thank you for separating this out from the float16 PR. Needs a HotSpot reviewer too. We will run it through tier 1 to 3 testing. Thanks @PaulSandoz , @merykitty please let me know if this is good to land. ------------- PR Comment: https://git.openjdk.org/jdk/pull/29481#issuecomment-3838835040 From iklam at openjdk.org Tue Feb 3 04:01:02 2026 From: iklam at openjdk.org (Ioi Lam) Date: Tue, 3 Feb 2026 04:01:02 GMT Subject: RFR: 8376125: Out of memory in the CDS archive error with lot of classes [v3] In-Reply-To: References:

Message-ID: <2HWRyOkAnKfSNQEOxjsezqs0Hgx-2w0PltNil27r86o=.7d4b31ee-c06e-4c54-9eb2-b46103e2a69d@github.com> On Sun, 1 Feb 2026 03:47:08 GMT, Xue-Lei Andrew Fan wrote: >> src/hotspot/share/cds/aotMetaspace.cpp line 2102: >> >>> 2100: unmap_archive(mapinfo); >>> 2101: return MAP_ARCHIVE_OTHER_FAILURE; >>> 2102: } >> >> Since `ArchiveUtils::OFFSET_SHIFT` is a constant for this JVM build, there's no need to save it into the archive and validate the saved value at runtime. We don't perform such checks for other constants. >> >> The archive contains the VM version string, so it cannot be used by a different JVM build. > > Make sense to me. Updated. > > Is it OK to keep the CURRENT_CDS_ARCHIVE_VERSION stay as 19 in src/hotspot/share/include/cds.h? > > - #define CURRENT_CDS_ARCHIVE_VERSION 19 > + #define CURRENT_CDS_ARCHIVE_VERSION 20 Since the header has not been changed, I think we should leave the version number unchanged. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/29494#discussion_r2757071704 From iklam at openjdk.org Tue Feb 3 04:13:01 2026 From: iklam at openjdk.org (Ioi Lam) Date: Tue, 3 Feb 2026 04:13:01 GMT Subject: RFR: 8376125: Out of memory in the CDS archive error with lot of classes [v3] In-Reply-To: <64hZpvWXWK3cRG_gVpeRvkMT2f35dkwJqgO0ZfY4YHY=.fe44589f-bf65-4ad7-bb57-f02da7f6548e@github.com> References:

<43jWfoF7waaehspCCA-pV-eWsXF5AGCKvjyiC2uguTU=.297fbe19-7cb1-49e9-9994-f4b8ffb1ef09@github.com>

<64hZpvWXWK3cRG_gVpeRvkMT2f35dkwJqgO0ZfY4YHY=.fe44589f-bf65-4ad7-bb57-f02da7f6548e@github.com> Message-ID: On Mon, 2 Feb 2026 23:35:13 GMT, Xue-Lei Andrew Fan wrote: >> umm, I removed this change to `os.cpp` and ran `LargeArchive.java` test on x86-64 system and it passed. On which platform/OS did you see the failure? > > I was on MacOS. Here is the failure without the os.cpp update: > > % make test TEST="test/hotspot/jtreg/resourcehogs/runtime/aot/LargeArchive.java" JTREG="JAVA_OPTIONS=-Dtest.archive.large.all.workflows=true" > ... > [0.018s][info][aot] JVM_StartThread() ignored: java.lang.ref.Reference$ReferenceHandler > [0.018s][info][aot] JVM_StartThread() ignored: java.lang.ref.Finalizer$FinalizerThread > [0.039s][info][cds] Loading classes to share ... > [0.039s][info][cds] Parsing LargeArchive.classlist > [0.047s][info][aot] JVM_StartThread() ignored: jdk.internal.misc.InnocuousThread > [136.124s][info][cds] Parsing /Users/xuelei.fan/workspace/openjdk/jdk-xf.git/build/macosx-aarch64-server-release/images/jdk/lib/classlist (lambda form invokers only) > [136.126s][info][cds] Loading classes to share: done. > [136.127s][info][aot] Rewriting and linking classes ... > [136.245s][info][aot] Rewriting and linking classes: done > [136.245s][info][aot] Regenerate MethodHandle Holder classes... > [136.344s][info][aot] Regenerate MethodHandle Holder classes...done > [136.351s][info][cds] Dumping shared data to file: LargeArchive.static.jsa > [136.351s][info][cds] Gathering all archivable objects ... > [136.371s][info][cds] Skipping java/lang/invoke/BoundMethodHandle$Species_F: dynamically generated > [136.371s][info][cds] Skipping java/lang/invoke/BoundMethodHandle$Species_J: dynamically generated > [136.413s][info][cds] Skipping jdk/internal/misc/CDS$UnregisteredClassLoader$Source: used only when dumping CDS archive > [136.413s][info][cds] Skipping jdk/internal/misc/CDS$UnregisteredClassLoader: used only when dumping CDS archive > [136.417s][info][cds] Heap range = [0x00000003c0000000 - 0x00000004c0000000] > [136.430s][info][aot] Archived 7975 interned strings > [136.431s][info][cds] Gathering classes and symbols ... > [143.110s][info][cds] Sorting symbols ... > [143.113s][info][cds] Sorting classes ... > [149.873s][info][cds] Reserved output buffer space at 0x0000007000000000 [34359738368 bytes] > [149.900s][info][cds] Allocating RW objects ... > [149.954s][info][cds] done (46510 objects) > [149.954s][info][cds] Allocating RO objects ... > [151.033s][info][cds] done (142334 objects) > [151.033s][info][cds] Relocating embedded pointers in core regions ... > [152.400s][info][cds] Relocating 150345176 pointers, 0 tagged, 17461 nulled > [152.400s][info][aot] Make classes shareable > [152.558s][info][cds] Number of classes 4485 > [152.558s][info][cds] instance classes = 4340, aot-linked = 0, inited = 0 > [152.558s][info][cds] ... According to https://gitlab.haskell.org/ghc/ghc/-/issues/17414 > File reads/writes bigger than 2GB result in an "Invalid argument" exception on macOS. Files bigger than 2GB still work, but individual read/write operations bigger than 2GB fail. I think it's better to move this fix into `os::pd_write()` (within `#ifdef __APPLE__`) to limit the writes to less than 2GB. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/29494#discussion_r2757096105 From asmehra at openjdk.org Tue Feb 3 04:28:04 2026 From: asmehra at openjdk.org (Ashutosh Mehra) Date: Tue, 3 Feb 2026 04:28:04 GMT Subject: RFR: 8376125: Out of memory in the CDS archive error with lot of classes [v3] In-Reply-To: References:

<43jWfoF7waaehspCCA-pV-eWsXF5AGCKvjyiC2uguTU=.297fbe19-7cb1-49e9-9994-f4b8ffb1ef09@github.com>

<64hZpvWXWK3cRG_gVpeRvkMT2f35dkwJqgO0ZfY4YHY=.fe44589f-bf65-4ad7-bb57-f02da7f6548e@github.com> Message-ID: On Tue, 3 Feb 2026 04:10:41 GMT, Ioi Lam wrote: >> I was on MacOS. Here is the failure without the os.cpp update: >> >> % make test TEST="test/hotspot/jtreg/resourcehogs/runtime/aot/LargeArchive.java" JTREG="JAVA_OPTIONS=-Dtest.archive.large.all.workflows=true" >> ... >> [0.018s][info][aot] JVM_StartThread() ignored: java.lang.ref.Reference$ReferenceHandler >> [0.018s][info][aot] JVM_StartThread() ignored: java.lang.ref.Finalizer$FinalizerThread >> [0.039s][info][cds] Loading classes to share ... >> [0.039s][info][cds] Parsing LargeArchive.classlist >> [0.047s][info][aot] JVM_StartThread() ignored: jdk.internal.misc.InnocuousThread >> [136.124s][info][cds] Parsing /Users/xuelei.fan/workspace/openjdk/jdk-xf.git/build/macosx-aarch64-server-release/images/jdk/lib/classlist (lambda form invokers only) >> [136.126s][info][cds] Loading classes to share: done. >> [136.127s][info][aot] Rewriting and linking classes ... >> [136.245s][info][aot] Rewriting and linking classes: done >> [136.245s][info][aot] Regenerate MethodHandle Holder classes... >> [136.344s][info][aot] Regenerate MethodHandle Holder classes...done >> [136.351s][info][cds] Dumping shared data to file: LargeArchive.static.jsa >> [136.351s][info][cds] Gathering all archivable objects ... >> [136.371s][info][cds] Skipping java/lang/invoke/BoundMethodHandle$Species_F: dynamically generated >> [136.371s][info][cds] Skipping java/lang/invoke/BoundMethodHandle$Species_J: dynamically generated >> [136.413s][info][cds] Skipping jdk/internal/misc/CDS$UnregisteredClassLoader$Source: used only when dumping CDS archive >> [136.413s][info][cds] Skipping jdk/internal/misc/CDS$UnregisteredClassLoader: used only when dumping CDS archive >> [136.417s][info][cds] Heap range = [0x00000003c0000000 - 0x00000004c0000000] >> [136.430s][info][aot] Archived 7975 interned strings >> [136.431s][info][cds] Gathering classes and symbols ... >> [143.110s][info][cds] Sorting symbols ... >> [143.113s][info][cds] Sorting classes ... >> [149.873s][info][cds] Reserved output buffer space at 0x0000007000000000 [34359738368 bytes] >> [149.900s][info][cds] Allocating RW objects ... >> [149.954s][info][cds] done (46510 objects) >> [149.954s][info][cds] Allocating RO objects ... >> [151.033s][info][cds] done (142334 objects) >> [151.033s][info][cds] Relocating embedded pointers in core regions ... >> [152.400s][info][cds] Relocating 150345176 pointers, 0 tagged, 17461 nulled >> [152.400s][info][aot] Make classes shareable >> [152.558s][info][cds] Number of classes 4485 >> [152.558s][info][cds] instance class... > > According to https://gitlab.haskell.org/ghc/ghc/-/issues/17414 > >> File reads/writes bigger than 2GB result in an "Invalid argument" exception on macOS. Files bigger than 2GB still work, but individual read/write operations bigger than 2GB fail. > > I think it's better to move this fix into `os::pd_write()` (within `#ifdef __APPLE__`) to limit the writes to less than 2GB. @iklam thanks for digging that up. It explains why the INT_MAX limit worked. But I should also mention that the above output does not show the actual reason for the failure. Here the `os::write` failed causing the `fd` to be closed in `FileMapInfo::write_bytes`. However, the error is not propagated up the call chain and we end up calling `FileMapInfo::seek_to_position` which throws EBADF (Bad file descriptor). So while we can keep the change in `os::write` (or `os::pd_write` as suggested) I think we should also fix `FileMapInfo::write_bytes` to 1) print the os error, and 2) terminate the write operation gracefully. I am also fine if this is done in a follow-up pr. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/29494#discussion_r2757134702 From kbarrett at openjdk.org Tue Feb 3 06:15:36 2026 From: kbarrett at openjdk.org (Kim Barrett) Date: Tue, 3 Feb 2026 06:15:36 GMT Subject: RFR: 8332189: Enable -Wzero-as-null-pointer-constant for gcc/clang Message-ID: Please review this change which enables `-Wzero-as-null-pointer-constant` warnings in HotSpot code when building with gcc or clang. There are three parts to this change. The first part augments the warning flags setup to support adding warning options that are only applied to HotSpot, rather than the JDK as a whole. There was previously some unused and possibly incomplete support for this when using gcc. Note that the Windows/Visual Studio support hasn't been tested much, and I think might not be working yet. I'm going to investigate that further in followup work. The second part enables `-Wzero-as-null-pointer-constant` for HotSpot code. This follows the guidance to avoid such in the HotSpot Style Guide. The third part removes a note in the HotSpot Style Guide about lingering uses of literal 0 as a null pointer constant. Those have been removed, and this change will block backsliding. Testing: mach5 tier1, GHA Sanity tests Integration of this change needs to wait for JDK-8376758. ------------- Commit messages: - remove obsolete note from style guide - enable -Wzero-as-null-pointer-constant for VM with gcc/clang - support hotspot-specific warnings Changes: https://git.openjdk.org/jdk/pull/29497/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=29497&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8332189 Stats: 40 lines in 3 files changed: 14 ins; 13 del; 13 mod Patch: https://git.openjdk.org/jdk/pull/29497.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/29497/head:pull/29497 PR: https://git.openjdk.org/jdk/pull/29497 From dholmes at openjdk.org Tue Feb 3 06:34:02 2026 From: dholmes at openjdk.org (David Holmes) Date: Tue, 3 Feb 2026 06:34:02 GMT Subject: RFR: 8332189: Enable -Wzero-as-null-pointer-constant for gcc/clang In-Reply-To: References: Message-ID: On Fri, 30 Jan 2026 00:16:54 GMT, Kim Barrett wrote: > Please review this change which enables `-Wzero-as-null-pointer-constant` > warnings in HotSpot code when building with gcc or clang. > > There are three parts to this change. > > The first part augments the warning flags setup to support adding warning > options that are only applied to HotSpot, rather than the JDK as a whole. > There was previously some unused and possibly incomplete support for this when > using gcc. Note that the Windows/Visual Studio support hasn't been tested > much, and I think might not be working yet. I'm going to investigate that > further in followup work. > > The second part enables `-Wzero-as-null-pointer-constant` for HotSpot code. > This follows the guidance to avoid such in the HotSpot Style Guide. > > The third part removes a note in the HotSpot Style Guide about lingering uses > of literal 0 as a null pointer constant. Those have been removed, and this > change will block backsliding. > > Testing: mach5 tier1, GHA Sanity tests > > Integration of this change needs to wait for JDK-8376758. Looks reasonable to me. Thanks ------------- Marked as reviewed by dholmes (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/29497#pullrequestreview-3743249921 From iklam at openjdk.org Tue Feb 3 06:40:02 2026 From: iklam at openjdk.org (Ioi Lam) Date: Tue, 3 Feb 2026 06:40:02 GMT Subject: RFR: 8376125: Out of memory in the CDS archive error with lot of classes In-Reply-To: References:

Message-ID: <2wZEIEuVyQR2YTbWlib002hxcA5VGuGbPgijtBNqE7k=.d43836be-5ace-4445-9a84-986f31d45f9b@github.com> On Sun, 1 Feb 2026 04:34:02 GMT, Xue-Lei Andrew Fan wrote: >> There are two more apis that return "unchecked" offset: `ArchiveBuilder::buffer_to_offset()` and `ArchiveBuilder::any_to_offset()`. These apis are not returning the scaled offset. I think it is better to get rid of these apis and replace their usage with `_u4` version which has the offset range check. I noticed there are only 1-2 instances that use these "unchecked" apis. > >> There are two more apis that return "unchecked" offset: `ArchiveBuilder::buffer_to_offset()` and `ArchiveBuilder::any_to_offset()`. These apis are not returning the scaled offset. I think it is better to get rid of these apis and replace their usage with `_u4` version which has the offset range check. I noticed there are only 1-2 instances that use these "unchecked" apis. > > Thanks for the suggestion. I looked into this and found that buffer_to_offset() and any_to_offset() serve a different purpose than the _u4 versions. The _u4 versions use scaled encoding (with MetadataOffsetShift) and return a compact u4 for metadata pointer storage. The raw versions return unscaled byte offsets stored in larger types. These usages cannot switch to _u4 versions because they need raw byte offsets (not scaled) and store them in 64-bit types. > > However, the comments for the methods may be misleading after introducing the _u4 methods. What do you think to revise the comment as: > > // The address p points to an object inside the output buffer. When the archive is mapped > // at the requested address, what's the byte offset of this object from _requested_static_archive_bottom? > uintx buffer_to_offset(address p) const; > > // Same as buffer_to_offset, except that the address p points to either (a) an object > // inside the output buffer, or (b), an object in the currently mapped static archive. > uintx any_to_offset(address p) const; > > // The reverse of buffer_to_offset_u4() - converts scaled offset units back to buffered address. > address offset_to_buffered_address(u4 offset_units) const; > > > I am also OK to rename the method names to: `buffer_to_offset_bytes()` and `any_to_offset_bytes()`, if the new names are clearer. > > @ashu-mehra What do you think? > Hi @XueleiFan, > > I've tried the suggested code with an archive size more than 4Gb, but it fails with an assertion: > > ``` > # Internal Error (aotMetaspace.cpp:1955), pid=96332, tid=4099 > # guarantee(archive_space_size < max_encoding_range_size - class_space_alignment) failed: Archive too large > ``` > > CDC archive was created successfully: > > ``` > [187.068s][info ][cds ] Shared file region (rw) 0: 822453584 bytes, addr 0x0000000800004000 file offset 0x00004000 crc 0x132b652e > [189.176s][info ][cds ] Shared file region (ro) 1: 3576115584 bytes, addr 0x0000000831060000 file offset 0x31060000 crc 0x71b020a2 > [197.653s][info ][cds ] Shared file region (ac) 4: 0 bytes > [198.870s][info ][cds ] Shared file region (bm) 2: 56555664 bytes, addr 0x0000000000000000 file offset 0x1062d4000 crc 0xbd87f804 > [199.504s][info ][cds ] Shared file region (hp) 3: 16091256 bytes, addr 0x00000000ff000000 file offset 0x1098c4000 crc 0x7834b7c3 > [199.684s][debug ][cds ] bm space: 56555664 [ 1.3% of total] out of 56555664 bytes [100.0% used] > [199.684s][debug ][cds ] hp space: 16091256 [ 0.4% of total] out of 16091256 bytes [100.0% used] at 0x0000000c6d000000 > [199.684s][debug ][cds ] total : 4471216088 [100.0% of total] out of 4471228536 bytes [100.0% used] > ``` I think we need to make `ArchiveUtils::MaxMetadataOffsetBytes` around 3.5 GB, since all AOT metadata are mapped into the compressed klass space, whose max size is 4GB. We want to leave some headroom for loading new classes in the production run. ------------- PR Comment: https://git.openjdk.org/jdk/pull/29494#issuecomment-3839353960 From dholmes at openjdk.org Tue Feb 3 06:56:04 2026 From: dholmes at openjdk.org (David Holmes) Date: Tue, 3 Feb 2026 06:56:04 GMT Subject: RFR: 8376568: Change Thread::getStackTrace to use handshake op for all cases [v3] In-Reply-To: References:

<6WdkzWF-d6yGLKVUP9pCiYE1ghOdL5sTlcBiA1bE4c0=.802606b6-f958-4dea-a6a7-3d8a406c177c@github.com> Message-ID: On Fri, 30 Jan 2026 08:07:44 GMT, Alan Bateman wrote: >> Still not clear to me why any new thread is not already filtered out long before now; nor why we have not needed this in the past. > > We want ThreadSnapshot.of(Thread) to accept a Thread in any state. Existing behavior is to return null for platform threads that are not alive. For virtual threads it will return a snapshot so we want to change that. The ThreadNotAlive test in the PR allows us to test these cases as they are hard to demonstrate with the thread dump. > > ThreadSnapshot.of(Thread) does not filter out the "not alive" cases. It could, in which case ThreadSnapshotFactory::get_thread_snapshot would need to assert if called with a new/unstarted thread. The terminating thread case would still need to be handled by ThreadSnapshotFactory::get_thread_snapshot. For platform threads there is no JavaThread so it bails easy. For virtual threads it needs to examine the state. Would you prefer if ThreadSnapshot.of(Thread) pre-checked the state so that get_thread_snapshot could be guaranteed to never see a new/unstarted thread? > > Update: I changed ThreadSnapshot.of(Thread) to filter before calling get_thread_snapshot, hopefully this will be easier to understand. I was assuming/expecting that the top-level code in `ThreadDumper` would filter out not-alive threads the same way `Thread.getStackTrace` does. You don't want lower-level code to have to worry about NEW threads, though of course they still have to deal with races against termination. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/29461#discussion_r2757501486 From rsunderbabu at openjdk.org Tue Feb 3 07:06:15 2026 From: rsunderbabu at openjdk.org (Ramkumar Sunderbabu) Date: Tue, 3 Feb 2026 07:06:15 GMT Subject: RFR: 8375443: AVX-512: Disabling through UseSHA doesn't affect UseSHA3Intrinsics [v4] In-Reply-To: References: Message-ID: > UseSHA flag is not respected while enabling/disabling UseSHA3Intrinsics flag in x86 builds. > Added UseSHA in the mix. > > Testing: Only Basic testing done. I will run more compiler related testing. Ramkumar Sunderbabu has updated the pull request incrementally with two additional commits since the last revision: - add test for unsupported platform - simpler requires condition ------------- Changes: - all: https://git.openjdk.org/jdk/pull/29266/files - new: https://git.openjdk.org/jdk/pull/29266/files/84acd692..a09cb5ad Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=29266&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=29266&range=02-03 Stats: 153 lines in 2 files changed: 150 ins; 0 del; 3 mod Patch: https://git.openjdk.org/jdk/pull/29266.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/29266/head:pull/29266 PR: https://git.openjdk.org/jdk/pull/29266 From stuefe at openjdk.org Tue Feb 3 07:08:05 2026 From: stuefe at openjdk.org (Thomas Stuefe) Date: Tue, 3 Feb 2026 07:08:05 GMT Subject: RFR: 8376125: Out of memory in the CDS archive error with lot of classes [v3] In-Reply-To: <2nI8SoEjkM35uhS-1dUEjvHOVj2RoSFGLzK6Tk4Ck7M=.a164d5e9-47ab-4be6-9f17-d770651b616b@github.com> References: <2nI8SoEjkM35uhS-1dUEjvHOVj2RoSFGLzK6Tk4Ck7M=.a164d5e9-47ab-4be6-9f17-d770651b616b@github.com> Message-ID: On Mon, 2 Feb 2026 22:02:10 GMT, Xue-Lei Andrew Fan wrote: >> **Summary** >> This change extends the CDS/AOT archive size limit from 2GB to 32GB by using scaled offset encoding. >> >> **Problem** >> Applications with a large number of classes (e.g., 300,000+) can exceed the current 2GB archive size limit, causing archive creation to fail with: >> >> [error][aot] Out of memory in the CDS archive: Please reduce the number of shared classes. >> >> >> **Solution** >> Instead of storing raw byte offsets in u4 fields (limited to ~2GB), we now store scaled offset units where each unit represents 8 bytes (OFFSET_SHIFT = 3). This allows addressing up to 32GB (2^32 ? 8 bytes) while maintaining backward compatibility with the existing u4 offset fields. >> >> Current: address = base + offset_bytes (max ~2GB) >> Proposed: address = base + (offset_units << 3) (max 32GB) >> >> All archived objects are guaranteed to be 8-byte aligned. This means the lower 3 bits of any valid byte offset are always zero ? we're wasting them! >> >> Current byte offset (aligned to 8 bytes): >> 0x00001000 = 0000 0000 0000 0000 0001 0000 0000 0|000 >> ??? Always 000! >> >> Scaled offset (shift=3): >> 0x00000200 = Same address, but stored in 29 bits instead of 32 >> Frees up 3 bits ? 8x larger range! >> Current byte offset (aligned to 8 bytes): 0x00001000 = 0000 0000 0000 0000 0001 0000 0000 0|000 ??? Always 000!Scaled offset (shift=3): 0x00000200 = Same address, but stored in 29 bits instead of 32 Frees up 3 bits ? 8x larger range! >> >> By storing `offset_bytes >> 3` instead of `offset_bytes`, we use all 32 bits of the u4 field to represent meaningful data, extending the addressable range from 2GB to 32GB. >> >> **Test** >> All tier1 and tier2 tests passed. No visible performance impact. Local benchmark shows significant performance improvement for CDS, Dynamic CDS and AOT Cache archive loading, with huge archive size (>2GB). >> >> Archive: >> - 300000 simple classes >> - 2000 mega-classes >> - 5000 FieldObject classes >> - Total: 307000 classes >> >> AOT Cache: >> Times (wall): create=250020ms verify=2771ms baseline=15470ms perf_with_aot=2388ms >> Times (classload): verify=965ms baseline=14771ms perf_with_aot=969ms >> >> Static CDS: >> Times (wall): create=161859ms verify=2055ms baseline=15592ms perf_with_cds=1996ms >> Times (classload): verify=1027ms baseline=14852ms perf_with_cds=1... > > Xue-Lei Andrew Fan has updated the pull request incrementally with one additional commit since the last revision: > > add hotspot_resourcehogs_no_cds test group This issue definitely needs more discussion. How would this work with compressed class pointers and a limited encoding space of 4G? Note that we are in the process of removing the uncompressed Klass pointer mode: [https://bugs.openjdk.org/browse/JDK-8363996 - see ](https://bugs.openjdk.org/browse/JDK-8372065) and https://github.com/openjdk/jdk/pull/28366. See also the preceding discussions. In the future, we plan to make compact object headers the default. The current limit gives us 4GB of encoding space; that is enough (with -UseCompressedKlassPointers) for roughly 5-6 million classes, possibly more. What scenario would require more classes than that? @ping rkennke ------------- PR Comment: https://git.openjdk.org/jdk/pull/29494#issuecomment-3839483028 From duke at openjdk.org Tue Feb 3 07:10:02 2026 From: duke at openjdk.org (Shawn M Emery) Date: Tue, 3 Feb 2026 07:10:02 GMT Subject: RFR: 8374516: -version asserts with "-XX:+UseAESCTRIntrinsics -XX:-UseAES": "need AES instructions and misaligned SSE support" in generate_counterMode_AESCrypt_Parallel() In-Reply-To: References: Message-ID: <1ueZt1yRnN71yJlDZ1jsOpXgGkp4bzOxNpWjbdiXx6I=.f58e8738-db1a-40bb-8d8a-bee26d7547fe@github.com> On Wed, 21 Jan 2026 08:32:59 GMT, Guanqiang Han wrote: > Please review this change. Thanks! > > **Description:** > > VM crashes during startup on x86 when running with -XX:+UseAESCTRIntrinsics -XX:-UseAES. In this configuration, UseAESCTRIntrinsics may remain enabled while UseAES is explicitly disabled, and the VM generates AES-CTR stubs, hitting an assert(UseAES) in generate_counterMode_AESCrypt_Parallel(). > > **Fix:** > > Update x86 flag initialization to enforce the dependency between UseAESCTRIntrinsics and UseAES. When UseAES is disabled, explicitly disable UseAESCTRIntrinsics (with a warning when it was set on the command line), aligning behavior with the existing UseAES/UseAESIntrinsics gating and avoiding stub generation with inconsistent flag states. > > **Test:** > > GHA Nice work! Just a couple of suggestions/comments. src/hotspot/cpu/x86/vm_version_x86.cpp line 1141: > 1139: FLAG_SET_DEFAULT(UseAESIntrinsics, false); > 1140: if (UseAESCTRIntrinsics && !FLAG_IS_DEFAULT(UseAESCTRIntrinsics)) { > 1141: warning("AES_CTR intrinsics require UseAES flag to be enabled. Intrinsics will be disabled."); I propose the following changes: OLD "Intrinsics will be disabled." NEW "AES_CTR intrinsics will be disabled." test/hotspot/jtreg/compiler/cpuflags/TestUseAESCTRIntrinsicsWithUseAESDisabled.java line 28: > 26: * @bug 8374516 > 27: * @summary Regression test for -XX:+UseAESCTRIntrinsics -XX:-UseAES crash > 28: * @requires os.arch=="amd64" | os.arch=="x86_64" These are the only two architectures that exhibit this bug? I was able to reproduce the problem with this test case on my x86_64 desktop and confirmed that the fix did indeed resolve the problem. ------------- PR Review: https://git.openjdk.org/jdk/pull/29338#pullrequestreview-3743400759 PR Review Comment: https://git.openjdk.org/jdk/pull/29338#discussion_r2757536259 PR Review Comment: https://git.openjdk.org/jdk/pull/29338#discussion_r2757539553 From duke at openjdk.org Tue Feb 3 07:10:04 2026 From: duke at openjdk.org (Shawn M Emery) Date: Tue, 3 Feb 2026 07:10:04 GMT Subject: RFR: 8374516: -version asserts with "-XX:+UseAESCTRIntrinsics -XX:-UseAES": "need AES instructions and misaligned SSE support" in generate_counterMode_AESCrypt_Parallel() In-Reply-To: References:

Message-ID: On Wed, 28 Jan 2026 11:06:09 GMT, Guanqiang Han wrote: >> Please review this change. Thanks! >> >> **Description:** >> >> VM crashes during startup on x86 when running with -XX:+UseAESCTRIntrinsics -XX:-UseAES. In this configuration, UseAESCTRIntrinsics may remain enabled while UseAES is explicitly disabled, and the VM generates AES-CTR stubs, hitting an assert(UseAES) in generate_counterMode_AESCrypt_Parallel(). >> >> **Fix:** >> >> Update x86 flag initialization to enforce the dependency between UseAESCTRIntrinsics and UseAES. When UseAES is disabled, explicitly disable UseAESCTRIntrinsics (with a warning when it was set on the command line), aligning behavior with the existing UseAES/UseAESIntrinsics gating and avoiding stub generation with inconsistent flag states. >> >> **Test:** >> >> GHA > > Hi @vnkozlov and @ascarpino , Sorry for the ping ? could you please take a look at this PR when you have a moment? Hi @hgqxjj, I will take a look at the changes later today. ------------- PR Comment: https://git.openjdk.org/jdk/pull/29338#issuecomment-3837219100 From stuefe at openjdk.org Tue Feb 3 07:23:08 2026 From: stuefe at openjdk.org (Thomas Stuefe) Date: Tue, 3 Feb 2026 07:23:08 GMT Subject: RFR: 8376125: Out of memory in the CDS archive error with lot of classes [v3] In-Reply-To: <2nI8SoEjkM35uhS-1dUEjvHOVj2RoSFGLzK6Tk4Ck7M=.a164d5e9-47ab-4be6-9f17-d770651b616b@github.com> References: <2nI8SoEjkM35uhS-1dUEjvHOVj2RoSFGLzK6Tk4Ck7M=.a164d5e9-47ab-4be6-9f17-d770651b616b@github.com> Message-ID: On Mon, 2 Feb 2026 22:02:10 GMT, Xue-Lei Andrew Fan wrote: >> **Summary** >> This change extends the CDS/AOT archive size limit from 2GB to 32GB by using scaled offset encoding. >> >> **Problem** >> Applications with a large number of classes (e.g., 300,000+) can exceed the current 2GB archive size limit, causing archive creation to fail with: >> >> [error][aot] Out of memory in the CDS archive: Please reduce the number of shared classes. >> >> >> **Solution** >> Instead of storing raw byte offsets in u4 fields (limited to ~2GB), we now store scaled offset units where each unit represents 8 bytes (OFFSET_SHIFT = 3). This allows addressing up to 32GB (2^32 ? 8 bytes) while maintaining backward compatibility with the existing u4 offset fields. >> >> Current: address = base + offset_bytes (max ~2GB) >> Proposed: address = base + (offset_units << 3) (max 32GB) >> >> All archived objects are guaranteed to be 8-byte aligned. This means the lower 3 bits of any valid byte offset are always zero ? we're wasting them! >> >> Current byte offset (aligned to 8 bytes): >> 0x00001000 = 0000 0000 0000 0000 0001 0000 0000 0|000 >> ??? Always 000! >> >> Scaled offset (shift=3): >> 0x00000200 = Same address, but stored in 29 bits instead of 32 >> Frees up 3 bits ? 8x larger range! >> Current byte offset (aligned to 8 bytes): 0x00001000 = 0000 0000 0000 0000 0001 0000 0000 0|000 ??? Always 000!Scaled offset (shift=3): 0x00000200 = Same address, but stored in 29 bits instead of 32 Frees up 3 bits ? 8x larger range! >> >> By storing `offset_bytes >> 3` instead of `offset_bytes`, we use all 32 bits of the u4 field to represent meaningful data, extending the addressable range from 2GB to 32GB. >> >> **Test** >> All tier1 and tier2 tests passed. No visible performance impact. Local benchmark shows significant performance improvement for CDS, Dynamic CDS and AOT Cache archive loading, with huge archive size (>2GB). >> >> Archive: >> - 300000 simple classes >> - 2000 mega-classes >> - 5000 FieldObject classes >> - Total: 307000 classes >> >> AOT Cache: >> Times (wall): create=250020ms verify=2771ms baseline=15470ms perf_with_aot=2388ms >> Times (classload): verify=965ms baseline=14771ms perf_with_aot=969ms >> >> Static CDS: >> Times (wall): create=161859ms verify=2055ms baseline=15592ms perf_with_cds=1996ms >> Times (classload): verify=1027ms baseline=14852ms perf_with_cds=1... > > Xue-Lei Andrew Fan has updated the pull request incrementally with one additional commit since the last revision: > > add hotspot_resourcehogs_no_cds test group Looking at the issue closer, and the provided example. The classes seem to be both numerous and monstrous. How realistic is this scenario? Such objects would pose other challenges too, e.g. to GC. We can, and should, certainly make the dividing line between CDS and class space more fluid to allow for a larger CDS at the cost of the class space. As @iklam wrote, 3.5 GB, possibly even more, can be done. ------------- PR Comment: https://git.openjdk.org/jdk/pull/29494#issuecomment-3839553187 From iklam at openjdk.org Tue Feb 3 07:38:04 2026 From: iklam at openjdk.org (Ioi Lam) Date: Tue, 3 Feb 2026 07:38:04 GMT Subject: RFR: 8376125: Out of memory in the CDS archive error with lot of classes In-Reply-To: References:

Message-ID: On Mon, 2 Feb 2026 20:34:29 GMT, Ashutosh Mehra wrote: > > These usages cannot switch to _u4 versions because they need raw byte offsets (not scaled) and store them in 64-bit types. > > I am not sure why we can't store the scaled offsets in such cases. Are data structures not aligned properly that prevents from storing as scaled offsets. Its true they are stored in 64-bit types but that doesn't prevent scaling the offsets. IMO I would rather have a single API to compute offsets, otherwise we will end up with a system that has two types of offsets and it would be confusing when to use which. @iklam what do you think? I tried switching everything to the encoded offsets, but the changes are quite extensive. Most tests passed but serviceability/sa/ClhsdbCDSCore.java is still failing. Here's my patch: https://github.com/openjdk/jdk/commit/3f6dea9963bba05ca2f22abfe02199fa7767f82d I think this should be done in a follow-up RFE. In this PR, I think we should update the APIs so it's more obvious which "offset" we are talking about: - byte offsets should be called "raw offset". - the "u4 offset" should be called "encoded offset" So we'd have - `ArchiveUtils::encoded_offset_to_archived_address()` - `ArchiveBuilder::buffer_to_raw_offset()` - `ArchiveBuilder::any_to_encoded_offset()` - etc Eventually, I want to move the encoding logic to its own class (patterned after `CompressedKlassPointers`): https://github.com/openjdk/jdk/commit/8d5b3d5e684381005f1631e1577af2f716c4be9c ------------- PR Comment: https://git.openjdk.org/jdk/pull/29494#issuecomment-3839602927 From iklam at openjdk.org Tue Feb 3 07:47:01 2026 From: iklam at openjdk.org (Ioi Lam) Date: Tue, 3 Feb 2026 07:47:01 GMT Subject: RFR: 8376125: Out of memory in the CDS archive error with lot of classes [v3] In-Reply-To: References: <2nI8SoEjkM35uhS-1dUEjvHOVj2RoSFGLzK6Tk4Ck7M=.a164d5e9-47ab-4be6-9f17-d770651b616b@github.com> Message-ID: On Tue, 3 Feb 2026 07:05:08 GMT, Thomas Stuefe wrote: > The current limit gives us 4GB of encoding space; that is enough (with -UseCompressedKlassPointers) for roughly 5-6 million classes, possibly more. What scenario would require more classes than that? The problem is that CDS stuffs all data (not just Klasses) into the ro/rw regions, which are mapped into the compressed class space. If we want to support millions of classes, we would need to split the classes out into its own region, and map only that into the CCS. In any case, supporting very large set of classes is not our priority. I think it's OK to make small tweaks to allow more classes, but we won't have time for more drastic changes. ------------- PR Comment: https://git.openjdk.org/jdk/pull/29494#issuecomment-3839637819 From stuefe at openjdk.org Tue Feb 3 08:00:06 2026 From: stuefe at openjdk.org (Thomas Stuefe) Date: Tue, 3 Feb 2026 08:00:06 GMT Subject: RFR: 8376125: Out of memory in the CDS archive error with lot of classes [v3] In-Reply-To: References: