From kvn at openjdk.org Sun Jun 1 00:11:55 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Sun, 1 Jun 2025 00:11:55 GMT Subject: RFR: 8357175: Failure to generate or load AOT code should be handled gracefully In-Reply-To: References:

Message-ID: <_dcowgIjM5R9m4Ye0BNclWFRkNW_GisoCsrSAW4b0rI=.fced02a6-16ac-4a56-bd45-3e3b6e764bb5@github.com> On Sat, 31 May 2025 19:16:17 GMT, Ashutosh Mehra wrote: >> By default a failed AOT code should be discarded with UL message about it by request (`-Xlog:aot+codecache+*=debug`) and VM and AOT code processing should continue run. >> >> Unless we hit some catastrophic failure: OOM for example. This is similar how JIT compilers behave. >> >> I reordered VM configuration settings checking (`Config::verify()`) so that we switch off AOT code caching type which depends on these VM settings. For example, AOT adapters do not operate on oops - they are not affected by compressed oops settings/encoding. I removed `_objectAlignment` check because CDS already does this check when open archive. >> >> The AOT relocation processing for a blob will skip this blob when corresponding address is not found instead of bailing out VM in product mode. In debug VM it will issue assert so we know about missing address. These changes are in `AOTCodeAddressTable::id_for_address()` >> >> I kept `fatal()` in `AOTCodeAddressTable::for_address_for_id()` for incorrect ID we read from archive. The archive could be corrupted if ID is wrong. >> >> I did small code cleanup/renaming. >> >> Tested: tier1-10 > > src/hotspot/share/code/aotCodeCache.cpp line 434: > >> 432: >> 433: if (((_flags & enableContendedPadding) != 0) != EnableContended) { >> 434: log_debug(aot, codecache, init)("AOT Code Cache disabled: it was created with EnableContended = %s", EnableContended ? "false" : "true"); > > This check says code cache is disabled, but we still return true. Same with other checks following this. Is that intentional? The rest of checks are for nmethods, UseCodeCaching. May be I should remove it to avoid confusion. > src/hotspot/share/code/aotCodeCache.cpp line 1011: > >> 1009: } >> 1010: case relocInfo::runtime_call_w_cp_type: >> 1011: log_debug(aot, codecache, reloc)("runtime_call_w_cp_type relocation is not unimplemented"); > > typo: "relocation is not unimplemented" -> "relocation is unimplemented" fixed. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/25525#discussion_r2118436203 PR Review Comment: https://git.openjdk.org/jdk/pull/25525#discussion_r2118436486 From kvn at openjdk.org Sun Jun 1 00:22:58 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Sun, 1 Jun 2025 00:22:58 GMT Subject: RFR: 8357175: Failure to generate or load AOT code should be handled gracefully In-Reply-To: References:

Message-ID: On Sat, 31 May 2025 19:50:04 GMT, Ashutosh Mehra wrote: >> By default a failed AOT code should be discarded with UL message about it by request (`-Xlog:aot+codecache+*=debug`) and VM and AOT code processing should continue run. >> >> Unless we hit some catastrophic failure: OOM for example. This is similar how JIT compilers behave. >> >> I reordered VM configuration settings checking (`Config::verify()`) so that we switch off AOT code caching type which depends on these VM settings. For example, AOT adapters do not operate on oops - they are not affected by compressed oops settings/encoding. I removed `_objectAlignment` check because CDS already does this check when open archive. >> >> The AOT relocation processing for a blob will skip this blob when corresponding address is not found instead of bailing out VM in product mode. In debug VM it will issue assert so we know about missing address. These changes are in `AOTCodeAddressTable::id_for_address()` >> >> I kept `fatal()` in `AOTCodeAddressTable::for_address_for_id()` for incorrect ID we read from archive. The archive could be corrupted if ID is wrong. >> >> I did small code cleanup/renaming. >> >> Tested: tier1-10 > > src/hotspot/share/code/aotCodeCache.cpp line 985: > >> 983: // ------------ process code and data -------------- >> 984: >> 985: #define BAD_ADDRESS_ID -2 > > Can you please add a comment to indicate why -1 is not used. > From the comment in `id_for_address`, I guess it is because -1 is a valid id for representing jump to itself in static call stub. Is that correct? > > int id = -1; > if (addr == (address)-1) { // Static call stub has jump to itself > return id; > } Yes, it is correct. I will add the comment: // Can't use -1. It is valid value for jump to iteself destination // used by static call stub: see NativeJump::jump_destination(). ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/25525#discussion_r2118454632 From kvn at openjdk.org Sun Jun 1 00:28:15 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Sun, 1 Jun 2025 00:28:15 GMT Subject: RFR: 8357175: Failure to generate or load AOT code should be handled gracefully In-Reply-To: References: Message-ID: On Thu, 29 May 2025 18:45:11 GMT, Vladimir Kozlov wrote: > By default a failed AOT code should be discarded with UL message about it by request (`-Xlog:aot+codecache+*=debug`) and VM and AOT code processing should continue run. > > Unless we hit some catastrophic failure: OOM for example. This is similar how JIT compilers behave. > > I reordered VM configuration settings checking (`Config::verify()`) so that we switch off AOT code caching type which depends on these VM settings. For example, AOT adapters do not operate on oops - they are not affected by compressed oops settings/encoding. I removed `_objectAlignment` check because CDS already does this check when open archive. > > The AOT relocation processing for a blob will skip this blob when corresponding address is not found instead of bailing out VM in product mode. In debug VM it will issue assert so we know about missing address. These changes are in `AOTCodeAddressTable::id_for_address()` > > I kept `fatal()` in `AOTCodeAddressTable::for_address_for_id()` for incorrect ID we read from archive. The archive could be corrupted if ID is wrong. > > I did small code cleanup/renaming. > > Tested: tier1-10 Thank you, @ashu-mehra. I addressed your comments. ------------- PR Comment: https://git.openjdk.org/jdk/pull/25525#issuecomment-2926099925 From kvn at openjdk.org Sun Jun 1 00:28:15 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Sun, 1 Jun 2025 00:28:15 GMT Subject: RFR: 8357175: Failure to generate or load AOT code should be handled gracefully [v2] In-Reply-To: References: Message-ID: > By default a failed AOT code should be discarded with UL message about it by request (`-Xlog:aot+codecache+*=debug`) and VM and AOT code processing should continue run. > > Unless we hit some catastrophic failure: OOM for example. This is similar how JIT compilers behave. > > I reordered VM configuration settings checking (`Config::verify()`) so that we switch off AOT code caching type which depends on these VM settings. For example, AOT adapters do not operate on oops - they are not affected by compressed oops settings/encoding. I removed `_objectAlignment` check because CDS already does this check when open archive. > > The AOT relocation processing for a blob will skip this blob when corresponding address is not found instead of bailing out VM in product mode. In debug VM it will issue assert so we know about missing address. These changes are in `AOTCodeAddressTable::id_for_address()` > > I kept `fatal()` in `AOTCodeAddressTable::for_address_for_id()` for incorrect ID we read from archive. The archive could be corrupted if ID is wrong. > > I did small code cleanup/renaming. > > Tested: tier1-10 Vladimir Kozlov has updated the pull request incrementally with one additional commit since the last revision: address comments ------------- Changes: - all: https://git.openjdk.org/jdk/pull/25525/files - new: https://git.openjdk.org/jdk/pull/25525/files/3399e5f9..497c141d Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=25525&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=25525&range=00-01 Stats: 22 lines in 1 file changed: 2 ins; 18 del; 2 mod Patch: https://git.openjdk.org/jdk/pull/25525.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/25525/head:pull/25525 PR: https://git.openjdk.org/jdk/pull/25525 From kvn at openjdk.org Sun Jun 1 00:29:50 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Sun, 1 Jun 2025 00:29:50 GMT Subject: RFR: 8358230: Incorrect location for the assert for blob != nullptr in CodeBlob::create In-Reply-To: References: Message-ID: On Sat, 31 May 2025 20:45:56 GMT, Ashutosh Mehra wrote: > A trivial fix to moves the assert for `blob != nullptr` before any usage of the the `blob` Yes, it is trivial. ------------- Marked as reviewed by kvn (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/25566#pullrequestreview-2884824886 From asmehra at openjdk.org Sun Jun 1 01:05:09 2025 From: asmehra at openjdk.org (Ashutosh Mehra) Date: Sun, 1 Jun 2025 01:05:09 GMT Subject: RFR: 8357175: Failure to generate or load AOT code should be handled gracefully [v2] In-Reply-To: References:

Message-ID: On Sun, 1 Jun 2025 00:28:15 GMT, Vladimir Kozlov wrote: >> By default a failed AOT code should be discarded with UL message about it by request (`-Xlog:aot+codecache+*=debug`) and VM and AOT code processing should continue run. >> >> Unless we hit some catastrophic failure: OOM for example. This is similar how JIT compilers behave. >> >> I reordered VM configuration settings checking (`Config::verify()`) so that we switch off AOT code caching type which depends on these VM settings. For example, AOT adapters do not operate on oops - they are not affected by compressed oops settings/encoding. I removed `_objectAlignment` check because CDS already does this check when open archive. >> >> The AOT relocation processing for a blob will skip this blob when corresponding address is not found instead of bailing out VM in product mode. In debug VM it will issue assert so we know about missing address. These changes are in `AOTCodeAddressTable::id_for_address()` >> >> I kept `fatal()` in `AOTCodeAddressTable::for_address_for_id()` for incorrect ID we read from archive. The archive could be corrupted if ID is wrong. >> >> I did small code cleanup/renaming. >> >> Tested: tier1-10 > > Vladimir Kozlov has updated the pull request incrementally with one additional commit since the last revision: > > address comments Marked as reviewed by asmehra (Committer). Thanks for addressing the comments. Looks good. ------------- PR Review: https://git.openjdk.org/jdk/pull/25525#pullrequestreview-2884902972 PR Comment: https://git.openjdk.org/jdk/pull/25525#issuecomment-2926206541 From asmehra at openjdk.org Sun Jun 1 01:08:01 2025 From: asmehra at openjdk.org (Ashutosh Mehra) Date: Sun, 1 Jun 2025 01:08:01 GMT Subject: Integrated: 8358230: Incorrect location for the assert for blob != nullptr in CodeBlob::create In-Reply-To: References: Message-ID: On Sat, 31 May 2025 20:45:56 GMT, Ashutosh Mehra wrote: > A trivial fix to moves the assert for `blob != nullptr` before any usage of the the `blob` This pull request has now been integrated. Changeset: 59dc8499 Author: Ashutosh Mehra URL: https://git.openjdk.org/jdk/commit/59dc849909c1edc892c94a27b0340fcf53db3a98 Stats: 3 lines in 1 file changed: 2 ins; 1 del; 0 mod 8358230: Incorrect location for the assert for blob != nullptr in CodeBlob::create Reviewed-by: kvn ------------- PR: https://git.openjdk.org/jdk/pull/25566 From iveresov at openjdk.org Sun Jun 1 03:03:51 2025 From: iveresov at openjdk.org (Igor Veresov) Date: Sun, 1 Jun 2025 03:03:51 GMT Subject: RFR: 8357175: Failure to generate or load AOT code should be handled gracefully [v2] In-Reply-To: References:

Message-ID: On Sun, 1 Jun 2025 00:28:15 GMT, Vladimir Kozlov wrote: >> By default a failed AOT code should be discarded with UL message about it by request (`-Xlog:aot+codecache+*=debug`) and VM and AOT code processing should continue run. >> >> Unless we hit some catastrophic failure: OOM for example. This is similar how JIT compilers behave. >> >> I reordered VM configuration settings checking (`Config::verify()`) so that we switch off AOT code caching type which depends on these VM settings. For example, AOT adapters do not operate on oops - they are not affected by compressed oops settings/encoding. I removed `_objectAlignment` check because CDS already does this check when open archive. >> >> The AOT relocation processing for a blob will skip this blob when corresponding address is not found instead of bailing out VM in product mode. In debug VM it will issue assert so we know about missing address. These changes are in `AOTCodeAddressTable::id_for_address()` >> >> I kept `fatal()` in `AOTCodeAddressTable::for_address_for_id()` for incorrect ID we read from archive. The archive could be corrupted if ID is wrong. >> >> I did small code cleanup/renaming. >> >> Tested: tier1-10 > > Vladimir Kozlov has updated the pull request incrementally with one additional commit since the last revision: > > address comments Marked as reviewed by iveresov (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/25525#pullrequestreview-2884989001 From kvn at openjdk.org Sun Jun 1 03:59:55 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Sun, 1 Jun 2025 03:59:55 GMT Subject: Integrated: 8357175: Failure to generate or load AOT code should be handled gracefully In-Reply-To: References: Message-ID: On Thu, 29 May 2025 18:45:11 GMT, Vladimir Kozlov wrote: > By default a failed AOT code should be discarded with UL message about it by request (`-Xlog:aot+codecache+*=debug`) and VM and AOT code processing should continue run. > > Unless we hit some catastrophic failure: OOM for example. This is similar how JIT compilers behave. > > I reordered VM configuration settings checking (`Config::verify()`) so that we switch off AOT code caching type which depends on these VM settings. For example, AOT adapters do not operate on oops - they are not affected by compressed oops settings/encoding. I removed `_objectAlignment` check because CDS already does this check when open archive. > > The AOT relocation processing for a blob will skip this blob when corresponding address is not found instead of bailing out VM in product mode. In debug VM it will issue assert so we know about missing address. These changes are in `AOTCodeAddressTable::id_for_address()` > > I kept `fatal()` in `AOTCodeAddressTable::for_address_for_id()` for incorrect ID we read from archive. The archive could be corrupted if ID is wrong. > > I did small code cleanup/renaming. > > Tested: tier1-10 This pull request has now been integrated. Changeset: e3eb089d Author: Vladimir Kozlov URL: https://git.openjdk.org/jdk/commit/e3eb089d47d62ae6feeba3dc6b3752a025e27bed Stats: 130 lines in 2 files changed: 41 ins; 46 del; 43 mod 8357175: Failure to generate or load AOT code should be handled gracefully Reviewed-by: iveresov, asmehra ------------- PR: https://git.openjdk.org/jdk/pull/25525 From epeter at openjdk.org Sun Jun 1 05:37:08 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Sun, 1 Jun 2025 05:37:08 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v61] In-Reply-To: References:

Message-ID: <2nsC6sfjkW6j7aMI9TwUgOM4qcyqQj03xGQ8WKfd2VU=.46a960b2-941e-40dc-917f-331aea0e6a70@github.com> On Fri, 30 May 2025 07:54:53 GMT, Christian Hagedorn wrote: >> Emanuel Peter has updated the pull request incrementally with two additional commits since the last revision: >> >> - Merge branch 'JDK-8344942-TemplateFramework-v3' of https://github.com/eme64/jdk into JDK-8344942-TemplateFramework-v3 >> - move verification > > test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestTutorial.java line 151: > >> 149: "System.out.println(", arg, ");\n", // capture arg via lambda argument >> 150: "System.out.println(#arg);\n", // capture arg via hashtag replacement >> 151: "System.out.println(#{arg});\n", // capture arg via hashtag replacement with brackets > > It's not clear here why one should use brackets. If there is an argument for those further down, then you can cross reference. Otherwise, it might need some explanation here. I rewrote the whole section a little: 155 // It would have been optimal to use Java String Templates to format 156 // argument values into Strings. However, since these are not (yet) 157 // available, the Template Framework provides two alternative ways of 158 // formatting Strings: 159 // 1) By appending to the comma-separated list of Tokens passed to body(). 160 // Appending as a Token works whenever one has a reference to the Object 161 // in Java code. But often, this is rather cumbersome and looks awkward, 162 // given all the additional quotes and commands required. Hence, it 163 // is encouraged to only use this method when necessary. 164 // 2) By hashtag replacements inside a single string. One can either 165 // use "#arg" directly, or use brackets "#{arg}". When possible, one 166 // should prefer avoiding the brackets, as they create additional 167 // noise. However, there are cases where they are useful, for 168 // example "#TYPE_CON" would be parsed as a hashtag replacement 169 // for the hashtag name "TYPE_CON", whereas "#{TYPE}_CON" is 170 // parsed as hashtag name "TYPE", followed by literal string "_CON". 171 // See also: generateWithHashtagAndDollarReplacements2 172 // There are two ways to define the value of a hashtag replacement: 173 // a) Capturing Template arguments as Strings. 174 // b) Using a "let" definition (see examples further down). 175 // Which one should be preferred is a code style question. Generally, we 176 // prefer the use of hashtag replacements because that allows easy use of _ 177 // multiline strings (i.e. text blocks). ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24217#discussion_r2118746980 From epeter at openjdk.org Sun Jun 1 05:45:08 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Sun, 1 Jun 2025 05:45:08 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v61] In-Reply-To: References:

Message-ID: On Fri, 30 May 2025 08:08:20 GMT, Christian Hagedorn wrote: >> Emanuel Peter has updated the pull request incrementally with two additional commits since the last revision: >> >> - Merge branch 'JDK-8344942-TemplateFramework-v3' of https://github.com/eme64/jdk into JDK-8344942-TemplateFramework-v3 >> - move verification > > test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestTutorial.java line 258: > >> 256: >> 257: // Render templateClass to String. >> 258: return templateClass.render(); > > When printing this, it starts at `var_2` and not `var_1`. Why is that? The `nextTemplateFrameId` starts at zero, and is incremented for every Template instantiation. The `templateClass` has `nextTemplateFrameId=1`. If there was any use of `$`, it would append `_1`. For `template1.asToken(1)` we have `nextTemplateFrameId=2` -> produces the `var_2`. Generally, the API does not make any guarantees about what id we give, it is just unique. Is that ok for you? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24217#discussion_r2118752174 From epeter at openjdk.org Sun Jun 1 05:45:08 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Sun, 1 Jun 2025 05:45:08 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v61] In-Reply-To: References:

Message-ID: On Sun, 1 Jun 2025 05:41:29 GMT, Emanuel Peter wrote: >> test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestTutorial.java line 258: >> >>> 256: >>> 257: // Render templateClass to String. >>> 258: return templateClass.render(); >> >> When printing this, it starts at `var_2` and not `var_1`. Why is that? > > The `nextTemplateFrameId` starts at zero, and is incremented for every Template instantiation. > The `templateClass` has `nextTemplateFrameId=1`. If there was any use of `$`, it would append `_1`. > For `template1.asToken(1)` we have `nextTemplateFrameId=2` -> produces the `var_2`. > Generally, the API does not make any guarantees about what id we give, it is just unique. > > Is that ok for you? Ah, I guess the comment above talks about `var_1, var_2 ...` hmm. I suppose I can add another comment for that in the test code. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24217#discussion_r2118752394 From epeter at openjdk.org Sun Jun 1 06:02:05 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Sun, 1 Jun 2025 06:02:05 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v67] In-Reply-To: References: Message-ID: <3o8bVN9T_7p1h4miFfyUXDnyESEh4YAMzJhPcmE6XmI=.be1003c1-6867-4971-be12-1aa9389cf25e@github.com> > **Goal** > We want to generate Java source code: > - Make it easy to generate variants of tests. E.g. for each offset, for each operator, for each type, etc. > - Enable the generation of domain specific fuzzers (e.g. random expressions and statements). > > Note: with the Template Library draft I was already able to find a [list of bugs](https://bugs.openjdk.org/issues/?jql=labels%20%3D%20template-framework%20ORDER%20BY%20created%20DESC%2C%20summary%20DESC). > > **How to get started** > When reviewing, please start by looking at: > https://github.com/openjdk/jdk/blob/d21a8aabaf3b191e851b6997c11bb30fcd0f942f/test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestSimple.java#L60-L76 > > We have a Template with two arguments. They are typed (Integer and String). We then apply the arguments `template.withArgs(42, "7")`, producing a `TemplateWithArgs`. This can then be `render`ed to a String. And then that can be compiled and executed with the CompileFramework. > > Second, look at this advanced test: > https://github.com/openjdk/jdk/blob/77079807042fc5a3af04e0ccccad4ecd89e21cdb/test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestAdvanced.java#L102-L119 > > And then for a "tutorial", look at: > `test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestTutorial.java` > > It shows these features: > - The `body` of a Template is essentially a list of `Token`s that are concatenated. > - Templates can be nested: a `TemplateWithArgs` is also a `Token`. > - We can use `#name` replacements to directly format values into the String. If we had proper String Templates in Java, we would not need this feature. > - We can use `$var` to make variable names unique: if we applied the same template twice, we would get variable collisions. `$var` is then replaced with e.g. `var_7` in one template use and `var_42` in the other template use. > - The use of `Hook`s to insert code into outer (earlier) code locations. This is useful, for example, to insert fields on demand. > - The use of recursive templates, and `fuel` to limit the recursion. > - `Name`s: useful to register field and variable names in code scopes. > > Next, look at the documentation in. This file is the heart of the Template Framework, and describes all the important features. > https://github.com/openjdk/jdk/blob/d21a8aabaf3b191e851b6997c11bb30fcd0f942f/test/hotspot/jtreg/compiler/lib/template_framework/Template.java#L31-L76 > > For a better experience, you may want to generate the `javadocs`: > `javadoc -sourcepath test/hotspot/j... Emanuel Peter has updated the pull request incrementally with one additional commit since the last revision: more improvements ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24217/files - new: https://git.openjdk.org/jdk/pull/24217/files/ea2bb65d..68b45b1c Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24217&range=66 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24217&range=65-66 Stats: 25 lines in 1 file changed: 21 ins; 1 del; 3 mod Patch: https://git.openjdk.org/jdk/pull/24217.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24217/head:pull/24217 PR: https://git.openjdk.org/jdk/pull/24217 From epeter at openjdk.org Sun Jun 1 06:02:05 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Sun, 1 Jun 2025 06:02:05 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v61] In-Reply-To: References:

Message-ID: On Sun, 1 Jun 2025 05:42:37 GMT, Emanuel Peter wrote: >> The `nextTemplateFrameId` starts at zero, and is incremented for every Template instantiation. >> The `templateClass` has `nextTemplateFrameId=1`. If there was any use of `$`, it would append `_1`. >> For `template1.asToken(1)` we have `nextTemplateFrameId=2` -> produces the `var_2`. >> Generally, the API does not make any guarantees about what id we give, it is just unique. >> >> Is that ok for you? > > Ah, I guess the comment above talks about `var_1, var_2 ...` hmm. I suppose I can add another comment for that in the test code. Wrote this: 255 var templateClass = Template.make(() -> body( + 256 // The Template Framework API only guarantees that every Template use + 257 // has a unique ID. When using the Templates, all we need is that + 258 // variables from different Template uses do not conflict. But it can + 259 // be helpful to understand how the IDs are produced. The implementation + 260 // simply gives the first Template use the ID=1, and increments from there. + 261 // + 262 // In this example, the templateClass is the first Template use, and + 263 // has ID=1. We never use a dollar replacement here, so the code will + 264 // not show any "_1". 265 """ 266 package p.xyz; 267 268 public class InnerTest3 { 269 public static void main() { 270 """, + 271 // Second Template use: ID=2 -> var_2 272 template1.asToken(1), + 273 // Third Template use: ID=3 -> var_3 274 template1.asToken(7), + 275 // Fourth Template use with template2, no use of dollar, so + 276 // no "_4" shows up in the generated code. Internally, it + 277 // calls template1, shich is the fifth Template use, with + 278 // ID = 5 -> var_5 279 template2.asToken(2), + 280 // Sixth and Seventh Template use -> var_7 281 template2.asToken(5), + 282 // Eighth Template use with template4 -> var_8. + 283 // Ninth Template use with internal call to template3, + 284 // The local "$var" turns to "var_9", but the Template + 285 // argument captured value = "var_8" from the outer + 286 // template use of $("var"). 287 template4.asToken(), 288 """ 289 } 290 } 291 """ 292 )); 293 294 // Render templateClass to String. 295 return templateClass.render(); ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24217#discussion_r2118771773 From epeter at openjdk.org Sun Jun 1 06:02:06 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Sun, 1 Jun 2025 06:02:06 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v61] In-Reply-To: References:

Message-ID: On Fri, 30 May 2025 08:22:00 GMT, Christian Hagedorn wrote: >> Emanuel Peter has updated the pull request incrementally with two additional commits since the last revision: >> >> - Merge branch 'JDK-8344942-TemplateFramework-v3' of https://github.com/eme64/jdk into JDK-8344942-TemplateFramework-v3 >> - move verification > > test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestTutorial.java line 306: > >> 304: var myHook = new Hook("MyHook"); >> 305: >> 306: var template1 = Template.make("name", "value", (String name, Integer value) -> body( > > One could generally think about using `_` for unused lambda parameters which I think is the common convention. But then I guess we would need to update the documentation about saying "name" and "String name" should be the same and make an exception for unused ones. I don't know. I think it is better to keep the names duplicated. This gives the reader an easier visual aid to check which name has which type. What do you think? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24217#discussion_r2118774938 From epeter at openjdk.org Sun Jun 1 16:03:08 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Sun, 1 Jun 2025 16:03:08 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v61] In-Reply-To: References:

Message-ID: On Fri, 30 May 2025 08:39:14 GMT, Christian Hagedorn wrote: >> test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestTutorial.java line 358: >> >>> 356: >>> 357: // We saw the use of custom hooks above, but now we look at the use of CLASS_HOOK and METHOD_HOOK >>> 358: // from the Template Library. >> >> Can you expand here on why it's better to use them instead of creating your own? Is it just readability/convenience? > > Another question which is not evidently clear by following the examples: Can and should (not) you use the same hook inside the hook itself, i.e.: > > Hooks.CLASS_HOOK.anchor( > Hooks.CLASS_HOOK.anchor( > // ... > > This is probably not done on purpose but such a situation could arise when nesting more templates and suddenly one anchors the same hook again? I extended the explanations: ~ 397 // We saw the use of custom hooks above, but now we look at the use of CLASS_HOOK and METHOD_HOOK. ~ 398 // By convention, we use the CLASS_HOOK for class scopes, and METHOD_HOOK for method scopes. + 399 // Whenever we open a class scope, we should anchor a CLASS_HOOK for that scope, and whenever we + 400 // open a method, we should anchor a METHOD_HOOK. Conversely, this allows us to check if we are + 401 // inside a class or method scope by querying "isAnchored". This convention helps us when building + 402 // a large library of Templates. But if you are writing your own self-contained set of Templates, + 403 // you do not have to follow this convention. + 404 // + 405 // Hooks are "re-entrant", that is we can anchor the same hook inside a scope that we already + 406 // anchored it previously. The "Hook.insert" always goes to the innermost anchoring of that + 407 // hook. There are cases where "re-entrant" Hooks are helpful such as nested classes, where + 408 // there is a class scope inside another class scope. Similarly, we can nest lambda bodies + 409 // inside method bodies, so also METHOD_HOOK can be used in such a "re-entrant" way. We could consider having both "re-entrant" and "non-re-entrant" Hooks. But I'm not yet convinced it is a very useful feature. Sure, there could be some confusion with nested hooks. But I think that confusion to code generation, because we can also nest class and method/lambda scopes. What do you think? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24217#discussion_r2119274873 From epeter at openjdk.org Sun Jun 1 16:03:10 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Sun, 1 Jun 2025 16:03:10 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v61] In-Reply-To: References:

Message-ID: On Fri, 30 May 2025 08:57:44 GMT, Christian Hagedorn wrote: >> Emanuel Peter has updated the pull request incrementally with two additional commits since the last revision: >> >> - Merge branch 'JDK-8344942-TemplateFramework-v3' of https://github.com/eme64/jdk into JDK-8344942-TemplateFramework-v3 >> - move verification > > test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestTutorial.java line 454: > >> 452: // For every recursion depth, some fuel is automatically subtracted >> 453: // so that the fuel slowly depletes with the depth. >> 454: // We keep the recursion going until the fuel is depleted. > > You can also note here that if we forget to check the `fuel()`, the renderer causes a stack overflow because the recursion never ends. Good idea! Added. > test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestTutorial.java line 487: > >> 485: // in this scope, and in any nested scope, including nested Templates. This allows us to >> 486: // add some fields and registers in one Template, and later on, in another Template, we >> 487: // can access these fields and registers again with "dataNames()". > > What do you mean by "registers"? Hmm good question. I think I meant "variables". Changed it! > test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestTutorial.java line 596: > >> 594: @Override >> 595: public boolean isSubtypeOf(DataName.Type other) { >> 596: return other instanceof MyPrimitive(String n) && n == name(); > > Is `==` intended? Should it be `equals()`? Nice catch, fixed. Well it did not matter here, but it is good practice I guess. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24217#discussion_r2119278069 PR Review Comment: https://git.openjdk.org/jdk/pull/24217#discussion_r2119276977 PR Review Comment: https://git.openjdk.org/jdk/pull/24217#discussion_r2119278275 From epeter at openjdk.org Sun Jun 1 16:06:53 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Sun, 1 Jun 2025 16:06:53 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v68] In-Reply-To: References: Message-ID: <2xkmqbUmlAvSV6SUym7pUeA_gwTDErFOMPuzTZ86TAI=.4a2da8f6-db82-46a0-b5b7-3f8fa4b30385@github.com> > **Goal** > We want to generate Java source code: > - Make it easy to generate variants of tests. E.g. for each offset, for each operator, for each type, etc. > - Enable the generation of domain specific fuzzers (e.g. random expressions and statements). > > Note: with the Template Library draft I was already able to find a [list of bugs](https://bugs.openjdk.org/issues/?jql=labels%20%3D%20template-framework%20ORDER%20BY%20created%20DESC%2C%20summary%20DESC). > > **How to get started** > When reviewing, please start by looking at: > https://github.com/openjdk/jdk/blob/d21a8aabaf3b191e851b6997c11bb30fcd0f942f/test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestSimple.java#L60-L76 > > We have a Template with two arguments. They are typed (Integer and String). We then apply the arguments `template.withArgs(42, "7")`, producing a `TemplateWithArgs`. This can then be `render`ed to a String. And then that can be compiled and executed with the CompileFramework. > > Second, look at this advanced test: > https://github.com/openjdk/jdk/blob/77079807042fc5a3af04e0ccccad4ecd89e21cdb/test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestAdvanced.java#L102-L119 > > And then for a "tutorial", look at: > `test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestTutorial.java` > > It shows these features: > - The `body` of a Template is essentially a list of `Token`s that are concatenated. > - Templates can be nested: a `TemplateWithArgs` is also a `Token`. > - We can use `#name` replacements to directly format values into the String. If we had proper String Templates in Java, we would not need this feature. > - We can use `$var` to make variable names unique: if we applied the same template twice, we would get variable collisions. `$var` is then replaced with e.g. `var_7` in one template use and `var_42` in the other template use. > - The use of `Hook`s to insert code into outer (earlier) code locations. This is useful, for example, to insert fields on demand. > - The use of recursive templates, and `fuel` to limit the recursion. > - `Name`s: useful to register field and variable names in code scopes. > > Next, look at the documentation in. This file is the heart of the Template Framework, and describes all the important features. > https://github.com/openjdk/jdk/blob/d21a8aabaf3b191e851b6997c11bb30fcd0f942f/test/hotspot/jtreg/compiler/lib/template_framework/Template.java#L31-L76 > > For a better experience, you may want to generate the `javadocs`: > `javadoc -sourcepath test/hotspot/j... Emanuel Peter has updated the pull request incrementally with one additional commit since the last revision: more fixes from Christian ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24217/files - new: https://git.openjdk.org/jdk/pull/24217/files/68b45b1c..ab20c217 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24217&range=67 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24217&range=66-67 Stats: 19 lines in 1 file changed: 14 ins; 0 del; 5 mod Patch: https://git.openjdk.org/jdk/pull/24217.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24217/head:pull/24217 PR: https://git.openjdk.org/jdk/pull/24217 From epeter at openjdk.org Sun Jun 1 16:10:07 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Sun, 1 Jun 2025 16:10:07 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v61] In-Reply-To: References:

Message-ID: On Fri, 30 May 2025 10:39:57 GMT, Christian Hagedorn wrote: >> Emanuel Peter has updated the pull request incrementally with two additional commits since the last revision: >> >> - Merge branch 'JDK-8344942-TemplateFramework-v3' of https://github.com/eme64/jdk into JDK-8344942-TemplateFramework-v3 >> - move verification > > Thanks for all the updates and discussions! I've worked my way through the documentation in `Template` and the examples again in some more detail. It's much better and the new explanations are well done, excellent work! > > I left some comments here and there but mostly minor things. I will have another look at the implementation - probably only finished by Monday. The design now looks great. I'm glad we could find a good solution now after some more iterations :-) @chhagedorn Thanks a lot for all the great suggestions! I now addressed everything except for: Issue with `$$var` and `$1var`. Similarly, we would have issues with `##name` and `#1name`. https://github.com/openjdk/jdk/pull/24217#discussion_r2115232385 (I'll have to do some more experiments with parsing.) These are issues we could continue the conversation, unless you are satisfied with my answers: https://github.com/openjdk/jdk/pull/24217#discussion_r2115388737 https://github.com/openjdk/jdk/pull/24217#discussion_r2115406391 ------------- PR Comment: https://git.openjdk.org/jdk/pull/24217#issuecomment-2927467228 From epeter at openjdk.org Sun Jun 1 16:14:06 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Sun, 1 Jun 2025 16:14:06 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v61] In-Reply-To: References:

<4MgAjHfzurYkWqrZ6ah81SwKah7IHR7okOxnq5gapb8=.b7b7bfc8-6dd7-4186-9839-b446c86f21a3@github.com> Message-ID: On Sat, 31 May 2025 11:48:39 GMT, Emanuel Peter wrote: >> @chhagedorn >> The current parsing/regex-ing is relatively simple. We only parse the "valid" cases, so the description above is still relevant. >> Your example `$1var` is not a valid pattern, so the regex does not match, and there is no replacement. Sadly, in Java `$1var` is a valid variable name, so there is some chance that the user makes a mistake and gets tripped up by this. >> >> If the user does a call to `let` or `$` with such a bad string `1var`, then they get a `RendererException`. >> >> The question is this: >> Should I really try to parse these "bad" patterns, just to validate them as well? All solutions I can think of are really complicated. Is it worth it? Or is it just a mistake by the user, and so the matching does not happen, and that is the users problem? > > FYI: `$$var` the first `$` is not a valid pattern, so it is not replaced. But `$var` is, and so that part gets replaced. The result is `$var_1`, which sadly happens to also be valid Java code. I think I just need to rewrite the way I parse and replace the strings. Doing a simple regex with `replaceAll` does not work if we also want to allow "bad" patterns such as `$$var` to be parsed, because of ambiguity. My new idea: split the string by `#` and `$`. The first string is just a regular string, because it has no `#` or `$` before it. But all others should start with either a `name` or `{name}` pattern. I should also do the `#` and `$` replacement in a single pass, so that we cannot have one replacement influence the other, i.e. that we have no "replacement injection" issues that may be confusing if anybody ever trips over it. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24217#discussion_r2119291366 From epeter at openjdk.org Sun Jun 1 16:57:49 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Sun, 1 Jun 2025 16:57:49 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v69] In-Reply-To: References: Message-ID: > **Goal** > We want to generate Java source code: > - Make it easy to generate variants of tests. E.g. for each offset, for each operator, for each type, etc. > - Enable the generation of domain specific fuzzers (e.g. random expressions and statements). > > Note: with the Template Library draft I was already able to find a [list of bugs](https://bugs.openjdk.org/issues/?jql=labels%20%3D%20template-framework%20ORDER%20BY%20created%20DESC%2C%20summary%20DESC). > > **How to get started** > When reviewing, please start by looking at: > https://github.com/openjdk/jdk/blob/d21a8aabaf3b191e851b6997c11bb30fcd0f942f/test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestSimple.java#L60-L76 > > We have a Template with two arguments. They are typed (Integer and String). We then apply the arguments `template.withArgs(42, "7")`, producing a `TemplateWithArgs`. This can then be `render`ed to a String. And then that can be compiled and executed with the CompileFramework. > > Second, look at this advanced test: > https://github.com/openjdk/jdk/blob/77079807042fc5a3af04e0ccccad4ecd89e21cdb/test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestAdvanced.java#L102-L119 > > And then for a "tutorial", look at: > `test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestTutorial.java` > > It shows these features: > - The `body` of a Template is essentially a list of `Token`s that are concatenated. > - Templates can be nested: a `TemplateWithArgs` is also a `Token`. > - We can use `#name` replacements to directly format values into the String. If we had proper String Templates in Java, we would not need this feature. > - We can use `$var` to make variable names unique: if we applied the same template twice, we would get variable collisions. `$var` is then replaced with e.g. `var_7` in one template use and `var_42` in the other template use. > - The use of `Hook`s to insert code into outer (earlier) code locations. This is useful, for example, to insert fields on demand. > - The use of recursive templates, and `fuel` to limit the recursion. > - `Name`s: useful to register field and variable names in code scopes. > > Next, look at the documentation in. This file is the heart of the Template Framework, and describes all the important features. > https://github.com/openjdk/jdk/blob/d21a8aabaf3b191e851b6997c11bb30fcd0f942f/test/hotspot/jtreg/compiler/lib/template_framework/Template.java#L31-L76 > > For a better experience, you may want to generate the `javadocs`: > `javadoc -sourcepath test/hotspot/j... Emanuel Peter has updated the pull request incrementally with one additional commit since the last revision: wip refactor parsing dollar and hashtag ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24217/files - new: https://git.openjdk.org/jdk/pull/24217/files/ab20c217..ccc132b5 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24217&range=68 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24217&range=67-68 Stats: 52 lines in 1 file changed: 20 ins; 13 del; 19 mod Patch: https://git.openjdk.org/jdk/pull/24217.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24217/head:pull/24217 PR: https://git.openjdk.org/jdk/pull/24217 From jbhateja at openjdk.org Sun Jun 1 17:26:07 2025 From: jbhateja at openjdk.org (Jatin Bhateja) Date: Sun, 1 Jun 2025 17:26:07 GMT Subject: RFR: 8352635: Improve inferencing of Float16 operations with constant inputs [v5] In-Reply-To: <44nVQBYgzCOB2mAB9xtAPvkUcOMJOITA2VjMdDFgm1g=.48266693-48bf-41db-8871-a7dcafe93509@github.com> References: <44nVQBYgzCOB2mAB9xtAPvkUcOMJOITA2VjMdDFgm1g=.48266693-48bf-41db-8871-a7dcafe93509@github.com> Message-ID: > This is a follow-up PR#22755 to improve Float16 operations inferencing. > > The existing scheme to detect Float16 operations for some operations is based on pattern matching which expects to receive inputs through ConvHF2F IR, this patch extends matching to accept constant floating point inputs within the Float16 value range. > > Best Regards, > Jatin Jatin Bhateja has updated the pull request incrementally with one additional commit since the last revision: Extending tests and review resolutions ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24179/files - new: https://git.openjdk.org/jdk/pull/24179/files/b44d62dc..4a491bef Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24179&range=04 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24179&range=03-04 Stats: 181 lines in 4 files changed: 112 ins; 5 del; 64 mod Patch: https://git.openjdk.org/jdk/pull/24179.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24179/head:pull/24179 PR: https://git.openjdk.org/jdk/pull/24179 From jbhateja at openjdk.org Sun Jun 1 17:26:10 2025 From: jbhateja at openjdk.org (Jatin Bhateja) Date: Sun, 1 Jun 2025 17:26:10 GMT Subject: RFR: 8352635: Improve inferencing of Float16 operations with constant inputs [v4] In-Reply-To: <6PFX21b9eT5mQv8Ym7b_RuKNpnuQ5CVqhc8TKxstlYo=.eb7d9f85-5e49-4e8f-b17a-c8e3728e7624@github.com> References: <44nVQBYgzCOB2mAB9xtAPvkUcOMJOITA2VjMdDFgm1g=.48266693-48bf-41db-8871-a7dcafe93509@github.com> <6PFX21b9eT5mQv8Ym7b_RuKNpnuQ5CVqhc8TKxstlYo=.eb7d9f85-5e49-4e8f-b17a-c8e3728e7624@github.com> Message-ID: <4kFfYPljgrRZSDgDmn4XbCB9iwnrETd0eFOxBSV-sVg=.422f1e5d-6182-4ad5-a509-3b1451a71dfc@github.com> On Wed, 28 May 2025 08:56:51 GMT, Emanuel Peter wrote: >> Jatin Bhateja has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains six commits: >> >> - Merge branch 'master' of http://github.com/openjdk/jdk into JDK-8352635 >> - Enabling some test points >> - Adding test points and some re-factoring >> - Merge branch 'master' of https://github.com/openjdk/jdk into JDK-8352635 >> - Merge branch 'master' of http://github.com/openjdk/jdk into JDK-8352635 >> - 8352635: Improve inferencing of Float16 operations with constant inputs > > src/hotspot/share/opto/convertnode.cpp line 290: > >> 288: // If constant lie within Float16 value range, convert it to >> 289: // a half-float constant. >> 290: if (StubRoutines::hf2f(StubRoutines::f2hf(conF)) == conF) { > > How does this behave with `NaN` values? Do you have a test for that below? Extended coveage for NaNs, yes we have new test points for them. > src/hotspot/share/opto/convertnode.cpp line 298: > >> 296: } else { >> 297: f16bOp = phase->transform(Float16NodeFactory::make(f32bOp->Opcode(), f32bOp->in(0), new_var_inp, new_con_inp)); >> 298: } > > Why is the order important here? A comment could help :) Addressed. > src/hotspot/share/opto/subnode.cpp line 566: > >> 564: // applicable to other floating point types. >> 565: // There are no known undefined, unspecified or implimentation specific >> 566: // behaviors w.r.t to floating point non-pointer subtraction. > > That sounds like we are not quite sure "no known" ... problems. Could there be any, or are we sure there are none? C++ follows IEEE 754 semantics for floating-point subtraction and there is no specified undefined behavior related to it in C++ standard. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24179#discussion_r2119357586 PR Review Comment: https://git.openjdk.org/jdk/pull/24179#discussion_r2119357694 PR Review Comment: https://git.openjdk.org/jdk/pull/24179#discussion_r2119358354 From jbhateja at openjdk.org Sun Jun 1 17:26:10 2025 From: jbhateja at openjdk.org (Jatin Bhateja) Date: Sun, 1 Jun 2025 17:26:10 GMT Subject: RFR: 8352635: Improve inferencing of Float16 operations with constant inputs [v4] In-Reply-To: <-d846uXzYApO-CUq6peUgguY2YLpvG6ioAdVkN1wHG0=.94a09310-9d87-481c-b374-05ae99db0133@github.com> References: <44nVQBYgzCOB2mAB9xtAPvkUcOMJOITA2VjMdDFgm1g=.48266693-48bf-41db-8871-a7dcafe93509@github.com> <6PFX21b9eT5mQv8Ym7b_RuKNpnuQ5CVqhc8TKxstlYo=.eb7d9f85-5e49-4e8f-b17a-c8e3728e7624@github.com> <-d846uXzYApO-CUq6peUgguY2YLpvG6ioAdVkN1wHG0=.94a09310-9d87-481c-b374-05ae99db0133@github.com> Message-ID: On Wed, 28 May 2025 09:09:46 GMT, Emanuel Peter wrote: >> test/hotspot/jtreg/compiler/c2/irTests/TestFloat16ScalarOperations.java line 320: >> >>> 318: res += Float.floatToFloat16(POSITIVE_ZERO_VAR.floatValue() / INEXACT_FP16); >>> 319: assertResult(Float.float16ToFloat(res), 32.125f, "testInexactFP16ConstantPatterns"); >>> 320: } >> >> Alignment is messed up by one space indentation. >> >> Can you add a comment why we are expecting none of the `HF` ops here? >> Are we expecting any other ops, maybe `F` ops? >> It could be good to check for that, so that we are sure that we get anything even close to our expectation. > > Same for the tests below :) Fixed, IR checks and indentaitons. >> test/hotspot/jtreg/compiler/c2/irTests/TestFloat16ScalarOperations.java line 363: >> >>> 361: res += Float.floatToFloat16(POSITIVE_ZERO_VAR.floatValue() / EXACT_FP16); >>> 362: assertResult(Float.float16ToFloat(res), 32.125f, "testExactFP16ConstantPatterns"); >>> 363: } >> >> Can we have a test that picks a random `FP16` value, and does result verification on it? Because currently, you are testing the new pattern only with a few example values. > > And: your pattern matching allows the constant to be lhs or rhs, so you should add corresponding tests. Done. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24179#discussion_r2119358477 PR Review Comment: https://git.openjdk.org/jdk/pull/24179#discussion_r2119358543 From kvn at openjdk.org Sun Jun 1 21:23:53 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Sun, 1 Jun 2025 21:23:53 GMT Subject: RFR: 8358236: [AOT] Graal crashes when trying to use persisted MDOs In-Reply-To: References: Message-ID: On Sun, 1 Jun 2025 19:01:27 GMT, Igor Veresov wrote: > Forgot to null out MethodData::_failed_speculations before snapshotting. As a result it gets restored with a dangling pointer. > Testing looks clean. Trivial. ------------- Marked as reviewed by kvn (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/25570#pullrequestreview-2886119546 From iveresov at openjdk.org Sun Jun 1 21:23:54 2025 From: iveresov at openjdk.org (Igor Veresov) Date: Sun, 1 Jun 2025 21:23:54 GMT Subject: Integrated: 8358236: [AOT] Graal crashes when trying to use persisted MDOs In-Reply-To: References: Message-ID: <2VQGaTWxeSr29uU3Ih3S5kF9l70w3xwlkHNG_pVFr7U=.3279eb7c-5bf8-4df1-8405-61b1678552d5@github.com> On Sun, 1 Jun 2025 19:01:27 GMT, Igor Veresov wrote: > Forgot to null out MethodData::_failed_speculations before snapshotting. As a result it gets restored with a dangling pointer. > Testing looks clean. This pull request has now been integrated. Changeset: 85e36d79 Author: Igor Veresov URL: https://git.openjdk.org/jdk/commit/85e36d79246913abb8b85c2be719670655d619ab Stats: 3 lines in 1 file changed: 3 ins; 0 del; 0 mod 8358236: [AOT] Graal crashes when trying to use persisted MDOs Reviewed-by: kvn ------------- PR: https://git.openjdk.org/jdk/pull/25570 From epeter at openjdk.org Mon Jun 2 03:09:09 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Mon, 2 Jun 2025 03:09:09 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v70] In-Reply-To: References: Message-ID: > **Goal** > We want to generate Java source code: > - Make it easy to generate variants of tests. E.g. for each offset, for each operator, for each type, etc. > - Enable the generation of domain specific fuzzers (e.g. random expressions and statements). > > Note: with the Template Library draft I was already able to find a [list of bugs](https://bugs.openjdk.org/issues/?jql=labels%20%3D%20template-framework%20ORDER%20BY%20created%20DESC%2C%20summary%20DESC). > > **How to get started** > When reviewing, please start by looking at: > https://github.com/openjdk/jdk/blob/d21a8aabaf3b191e851b6997c11bb30fcd0f942f/test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestSimple.java#L60-L76 > > We have a Template with two arguments. They are typed (Integer and String). We then apply the arguments `template.withArgs(42, "7")`, producing a `TemplateWithArgs`. This can then be `render`ed to a String. And then that can be compiled and executed with the CompileFramework. > > Second, look at this advanced test: > https://github.com/openjdk/jdk/blob/77079807042fc5a3af04e0ccccad4ecd89e21cdb/test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestAdvanced.java#L102-L119 > > And then for a "tutorial", look at: > `test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestTutorial.java` > > It shows these features: > - The `body` of a Template is essentially a list of `Token`s that are concatenated. > - Templates can be nested: a `TemplateWithArgs` is also a `Token`. > - We can use `#name` replacements to directly format values into the String. If we had proper String Templates in Java, we would not need this feature. > - We can use `$var` to make variable names unique: if we applied the same template twice, we would get variable collisions. `$var` is then replaced with e.g. `var_7` in one template use and `var_42` in the other template use. > - The use of `Hook`s to insert code into outer (earlier) code locations. This is useful, for example, to insert fields on demand. > - The use of recursive templates, and `fuel` to limit the recursion. > - `Name`s: useful to register field and variable names in code scopes. > > Next, look at the documentation in. This file is the heart of the Template Framework, and describes all the important features. > https://github.com/openjdk/jdk/blob/d21a8aabaf3b191e851b6997c11bb30fcd0f942f/test/hotspot/jtreg/compiler/lib/template_framework/Template.java#L31-L76 > > For a better experience, you may want to generate the `javadocs`: > `javadoc -sourcepath test/hotspot/j... Emanuel Peter has updated the pull request incrementally with one additional commit since the last revision: dollar and hashtag parsing validatiaon ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24217/files - new: https://git.openjdk.org/jdk/pull/24217/files/ccc132b5..21d3f507 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24217&range=69 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24217&range=68-69 Stats: 31 lines in 2 files changed: 26 ins; 0 del; 5 mod Patch: https://git.openjdk.org/jdk/pull/24217.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24217/head:pull/24217 PR: https://git.openjdk.org/jdk/pull/24217 From epeter at openjdk.org Mon Jun 2 03:30:24 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Mon, 2 Jun 2025 03:30:24 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v71] In-Reply-To: References: Message-ID: > **Goal** > We want to generate Java source code: > - Make it easy to generate variants of tests. E.g. for each offset, for each operator, for each type, etc. > - Enable the generation of domain specific fuzzers (e.g. random expressions and statements). > > Note: with the Template Library draft I was already able to find a [list of bugs](https://bugs.openjdk.org/issues/?jql=labels%20%3D%20template-framework%20ORDER%20BY%20created%20DESC%2C%20summary%20DESC). > > **How to get started** > When reviewing, please start by looking at: > https://github.com/openjdk/jdk/blob/d21a8aabaf3b191e851b6997c11bb30fcd0f942f/test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestSimple.java#L60-L76 > > We have a Template with two arguments. They are typed (Integer and String). We then apply the arguments `template.withArgs(42, "7")`, producing a `TemplateWithArgs`. This can then be `render`ed to a String. And then that can be compiled and executed with the CompileFramework. > > Second, look at this advanced test: > https://github.com/openjdk/jdk/blob/77079807042fc5a3af04e0ccccad4ecd89e21cdb/test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestAdvanced.java#L102-L119 > > And then for a "tutorial", look at: > `test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestTutorial.java` > > It shows these features: > - The `body` of a Template is essentially a list of `Token`s that are concatenated. > - Templates can be nested: a `TemplateWithArgs` is also a `Token`. > - We can use `#name` replacements to directly format values into the String. If we had proper String Templates in Java, we would not need this feature. > - We can use `$var` to make variable names unique: if we applied the same template twice, we would get variable collisions. `$var` is then replaced with e.g. `var_7` in one template use and `var_42` in the other template use. > - The use of `Hook`s to insert code into outer (earlier) code locations. This is useful, for example, to insert fields on demand. > - The use of recursive templates, and `fuel` to limit the recursion. > - `Name`s: useful to register field and variable names in code scopes. > > Next, look at the documentation in. This file is the heart of the Template Framework, and describes all the important features. > https://github.com/openjdk/jdk/blob/d21a8aabaf3b191e851b6997c11bb30fcd0f942f/test/hotspot/jtreg/compiler/lib/template_framework/Template.java#L31-L76 > > For a better experience, you may want to generate the `javadocs`: > `javadoc -sourcepath test/hotspot/j... Emanuel Peter has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 91 commits: - Merge branch 'master' into JDK-8344942-TemplateFramework-v3 - validation tests - dollar and hashtag parsing validatiaon - wip refactor parsing dollar and hashtag - more fixes from Christian - more improvements - more suggestions applied - good practice - rename template arguments - more from Christian - ... and 81 more: https://git.openjdk.org/jdk/compare/90d6ad01...cb7037e7 ------------- Changes: https://git.openjdk.org/jdk/pull/24217/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=24217&range=70 Stats: 6683 lines in 27 files changed: 6683 ins; 0 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/24217.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24217/head:pull/24217 PR: https://git.openjdk.org/jdk/pull/24217 From epeter at openjdk.org Mon Jun 2 03:30:24 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Mon, 2 Jun 2025 03:30:24 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v61] In-Reply-To: References:

Message-ID: On Fri, 30 May 2025 10:39:57 GMT, Christian Hagedorn wrote: >> Emanuel Peter has updated the pull request incrementally with two additional commits since the last revision: >> >> - Merge branch 'JDK-8344942-TemplateFramework-v3' of https://github.com/eme64/jdk into JDK-8344942-TemplateFramework-v3 >> - move verification > > Thanks for all the updates and discussions! I've worked my way through the documentation in `Template` and the examples again in some more detail. It's much better and the new explanations are well done, excellent work! > > I left some comments here and there but mostly minor things. I will have another look at the implementation - probably only finished by Monday. The design now looks great. I'm glad we could find a good solution now after some more iterations :-) @chhagedorn Alright, I now have a decent solution for `$$var` and `$1var` etc. I also added tests for it. These are issues we could continue the conversation, unless you are satisfied with my answers: https://github.com/openjdk/jdk/pull/24217#discussion_r2115388737 https://github.com/openjdk/jdk/pull/24217#discussion_r2115406391 This is now ready for another review pass ? ------------- PR Comment: https://git.openjdk.org/jdk/pull/24217#issuecomment-2928567671 From amitkumar at openjdk.org Mon Jun 2 03:37:57 2025 From: amitkumar at openjdk.org (Amit Kumar) Date: Mon, 2 Jun 2025 03:37:57 GMT Subject: RFR: 8353500: [s390x] Intrinsify Unsafe::setMemory [v5] In-Reply-To: References:

Message-ID: On Fri, 30 May 2025 08:32:30 GMT, Andrew Haley wrote: > What are all those `nopr`s for? Sorry that is old code; nops were inserted for the loop alignment; this is the newer stub code: - - - [BEGIN] - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - StubRoutines::unsafe_setmemory [0x000003ffa84b63c0, 0x000003ffa84b644c] (140 bytes) -------------------------------------------------------------------------------- BFD: unknown S/390 disassembler option: s390 .long 0x00000000 0x000003ffa84b63c0: vlvgb %v0,%r4,0 0x000003ffa84b63c6: vrepb %v0,%v0,0 0x000003ffa84b63cc: aghi %r3,-32 0x000003ffa84b63d0: jl 0x000003ffa84b63ec 0x000003ffa84b63d4: vst %v0,0(%r2) 0x000003ffa84b63da: vst %v0,16(%r2) 0x000003ffa84b63e0: aghi %r2,32 0x000003ffa84b63e4: aghi %r3,-32 0x000003ffa84b63e8: jhe 0x000003ffa84b63d4 0x000003ffa84b63ec: tmll %r3,16 0x000003ffa84b63f0: je 0x000003ffa84b63fe 0x000003ffa84b63f4: vst %v0,0(%r2) 0x000003ffa84b63fa: aghi %r2,16 0x000003ffa84b63fe: tmll %r3,8 0x000003ffa84b6402: je 0x000003ffa84b6410 0x000003ffa84b6406: vsteg %v0,0(%r2),0 0x000003ffa84b640c: aghi %r2,8 0x000003ffa84b6410: tmll %r3,7 0x000003ffa84b6414: je 0x000003ffa84b644a 0x000003ffa84b6418: tmll %r3,4 0x000003ffa84b641c: je 0x000003ffa84b642a 0x000003ffa84b6420: vstef %v0,0(%r2),0 0x000003ffa84b6426: aghi %r2,4 0x000003ffa84b642a: tmll %r3,2 0x000003ffa84b642e: je 0x000003ffa84b643c 0x000003ffa84b6432: vsteh %v0,0(%r2),0 0x000003ffa84b6438: aghi %r2,2 0x000003ffa84b643c: tmll %r3,1 0x000003ffa84b6440: je 0x000003ffa84b644a 0x000003ffa84b6444: vsteb %v0,0(%r2),0 0x000003ffa84b644a: br %r14 -------------------------------------------------------------------------------- - - - [END] - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ------------- PR Comment: https://git.openjdk.org/jdk/pull/24480#issuecomment-2928591294 From epeter at openjdk.org Mon Jun 2 04:54:53 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Mon, 2 Jun 2025 04:54:53 GMT Subject: RFR: 8350896: Integer/Long.compress gets wrong type from CompressBitsNode::Value [v8] In-Reply-To: References:

Message-ID: On Fri, 30 May 2025 17:43:27 GMT, Jatin Bhateja wrote: >> Jatin Bhateja has updated the pull request incrementally with one additional commit since the last revision: >> >> Review comments resolutions > > We can further constrain the value range bounds of bit compression and expansion once PR #17508 gets integrated. For now, I have developed the following draft demonstrates bound constraining with KnownBitLattice. > > > // > // Prototype of bit compress/expand value range computation > // using KnownBits infrastructure. > // > > #include > #include > #include > #include > > template > class KnownBitsLattice { > private: > U zeros; > U ones; > > public: > KnownBitsLattice(U lb, U ub); > > U getKnownZeros() { > return zeros; > } > > U getKnownOnes() { > return ones; > } > > long getKnownZerosCount() { > uint64_t count = 0; > asm volatile ("popcntq %1, %0 \n\t" : "=r"(count) : "r"(zeros) : "cc"); > return count; > } > > long getKnownOnesCount() { > uint64_t count = 0; > asm volatile ("popcntq %1, %0 \n\t" : "=r"(count) : "r"(ones) : "cc"); > return count; > } > > bool check_voilation() { > // A given bit cannot be both zero or one. > return (zeros & ones) != 0; > } > > bool is_MSB_KnownOneBitsSet() { > return (ones >> 63) == 1; > } > > bool is_MSB_KnownZeroBitsSet() { > return (zeros >> 63) == 1; > } > }; > > template > KnownBitsLattice::KnownBitsLattice(U lb, U ub) { > // To find KnownBitsLattice from a given value range > // we first find the common prefix b/w upper and lower > // bound, we then concertize known zeros and ones bit > // based on common prefix. > // e.g. > // lb = 00110001 > // ub = 00111111 > // common prefix = 0011XXXX > // knownbits.zeros = 11000000 > // knownbits.ones = 00110000 > // > // conversely, for a give knownbits value we can find > // lower and upper value ranges. > // e.g. > // knownbits.zeros = 0x00010001 > // knownbits.ones = 0x10001100 > // range.lo = knownbits.ones, this is because knownbits.ones are > // guaranteed to be one. > // range.hi = ~knownbits.zeros, this is an optimistic upper bound > // which assumes all unset knownbits.zero > // are ones. > // Thus in above example, > // range.lo = 0x8C > // range.hi = 0xEE > > U lzcnt = 0; > U common_prefix = lb ^ ub; > asm volatile ("lzcntq %1, %0 \n\t" : "=r"(lzcnt) : "r"(common_prefix) : "cc"); > U common_prefix_mask = lzcnt == 0 ? 0xFFFFFFFFFFFFFFFFL : ~((1ULL << (64 - lzcnt)) - 1); > zeros = (~lb) & common_prefix_mask; > ones = (lb) & c... @jatin-bhateja Nice! Yes I'm looking forward to reviewing all the KnownBits extensions! @jatin-bhateja Let me know whenever this is ready for another pass of reviews :) ------------- PR Comment: https://git.openjdk.org/jdk/pull/23947#issuecomment-2928741573 From rehn at openjdk.org Mon Jun 2 05:45:59 2025 From: rehn at openjdk.org (Robbin Ehn) Date: Mon, 2 Jun 2025 05:45:59 GMT Subject: RFR: 8357968: RISC-V: Interpreter volatile reference stores with G1 are not sequentially consistent In-Reply-To: References: Message-ID: On Wed, 28 May 2025 16:47:06 GMT, Robbin Ehn wrote: > Hi please consider. > > As ref: https://github.com/openjdk/jdk/pull/25483 > As suggested in that PR - I removed these helpers as it's very hard to see that you get registers clobbered. > > Sanity tested, running t1. > > /Robbin Thanks all! ------------- PR Comment: https://git.openjdk.org/jdk/pull/25502#issuecomment-2928896307 From rehn at openjdk.org Mon Jun 2 05:45:59 2025 From: rehn at openjdk.org (Robbin Ehn) Date: Mon, 2 Jun 2025 05:45:59 GMT Subject: Integrated: 8357968: RISC-V: Interpreter volatile reference stores with G1 are not sequentially consistent In-Reply-To: References: Message-ID: On Wed, 28 May 2025 16:47:06 GMT, Robbin Ehn wrote: > Hi please consider. > > As ref: https://github.com/openjdk/jdk/pull/25483 > As suggested in that PR - I removed these helpers as it's very hard to see that you get registers clobbered. > > Sanity tested, running t1. > > /Robbin This pull request has now been integrated. Changeset: c5a1543e Author: Robbin Ehn URL: https://git.openjdk.org/jdk/commit/c5a1543ee3e68775f09ca29fb07efd9aebfdb33e Stats: 27 lines in 1 file changed: 0 ins; 18 del; 9 mod 8357968: RISC-V: Interpreter volatile reference stores with G1 are not sequentially consistent Reviewed-by: eosterlund, fbredberg, shade, fyang ------------- PR: https://git.openjdk.org/jdk/pull/25502 From mchevalier at openjdk.org Mon Jun 2 06:53:50 2025 From: mchevalier at openjdk.org (Marc Chevalier) Date: Mon, 2 Jun 2025 06:53:50 GMT Subject: RFR: 8353266: C2: Wrong execution with Integer.bitCount(int) intrinsic on AArch64 In-Reply-To: References:

Message-ID: On Sat, 31 May 2025 02:59:48 GMT, SendaoYan wrote: > Hi, how does this bug was found, seems the original testcase generated by a fuzz tool. Seems so, given what the initial reproducer looks like, but I'm not sure. The ticket was opened 3 years ago, not sure anyone remembers. If you want to know more context, maybe you can ask the initial reporter. ------------- PR Comment: https://git.openjdk.org/jdk/pull/25551#issuecomment-2929087749 From jbhateja at openjdk.org Mon Jun 2 07:44:58 2025 From: jbhateja at openjdk.org (Jatin Bhateja) Date: Mon, 2 Jun 2025 07:44:58 GMT Subject: RFR: 8355563: VectorAPI: Refactor current implementation of subword gather load API In-Reply-To: References: Message-ID: On Fri, 9 May 2025 07:35:41 GMT, Xiaohong Gong wrote: > JDK-8318650 introduced hotspot intrinsification of subword gather load APIs for X86 platforms [1]. However, the current implementation is not optimal for AArch64 SVE platform, which natively supports vector instructions for subword gather load operations using an int vector for indices (see [2][3]). > > Two key areas require improvement: > 1. At the Java level, vector indices generated for range validation could be reused for the subsequent gather load operation on architectures with native vector instructions like AArch64 SVE. However, the current implementation prevents compiler reuse of these index vectors due to divergent control flow, potentially impacting performance. > 2. At the compiler IR level, the additional `offset` input for `LoadVectorGather`/`LoadVectorGatherMasked` with subword types increases IR complexity and complicates backend implementation. Furthermore, generating `add` instructions before each memory access negatively impacts performance. > > This patch refactors the implementation at both the Java level and compiler mid-end to improve efficiency and maintainability across different architectures. > > Main changes: > 1. Java-side API refactoring: > - Explicitly passes generated index vectors to hotspot, eliminating duplicate index vectors for gather load instructions on > architectures like AArch64. > 2. C2 compiler IR refactoring: > - Refactors `LoadVectorGather`/`LoadVectorGatherMasked` IR for subword types by removing the memory offset input and incorporating it into the memory base `addr` at the IR level. This simplifies backend implementation, reduces add operations, and unifies the IR across all types. > 3. Backend changes: > - Streamlines X86 implementation of subword gather operations following the removal of the offset input from the IR level. > > Performance: > The performance of the relative JMH improves up to 27% on a X86 AVX512 system. Please see the data below: > > Benchmark Mode Cnt Unit SIZE Before After Gain > GatherOperationsBenchmark.microByteGather128 thrpt 30 ops/ms 64 53682.012 52650.325 0.98 > GatherOperationsBenchmark.microByteGather128 thrpt 30 ops/ms 256 14484.252 14255.156 0.98 > GatherOperationsBenchmark.microByteGather128 thrpt 30 ops/ms 1024 3664.900 3595.615 0.98 > GatherOperationsBenchmark.microByteGather128 thrpt 30 ops/ms 4096 908.312 935.269 1.02 > GatherOperationsBenchmark.micr... Hi @XiaohongGong , Looks good to me, thanks again for this re-factor !! Best Regards, Jatin ------------- Marked as reviewed by jbhateja (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/25138#pullrequestreview-2887157235 From duke at openjdk.org Mon Jun 2 08:15:36 2025 From: duke at openjdk.org (Tom Shull) Date: Mon, 2 Jun 2025 08:15:36 GMT Subject: RFR: 8357987: [JVMCI] Add support for retrieving all methods of a ResolvedJavaType [v2] In-Reply-To: References: Message-ID: > Currently from ResolvedJavaType one can retrieve all declared methods, static methods, and constructors of the given type. However, internally in HotSpot there are also VM-internal methods, such as overpass methods, associated with a given type which we cannot access via the API. > > To correct this, we should add a new method which enables VM-internal methods, such as overpass methods, to be accessed. Tom Shull has updated the pull request incrementally with one additional commit since the last revision: format javadoc and update test ------------- Changes: - all: https://git.openjdk.org/jdk/pull/25498/files - new: https://git.openjdk.org/jdk/pull/25498/files/1f42f05f..0de1feae Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=25498&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=25498&range=00-01 Stats: 16 lines in 4 files changed: 2 ins; 2 del; 12 mod Patch: https://git.openjdk.org/jdk/pull/25498.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/25498/head:pull/25498 PR: https://git.openjdk.org/jdk/pull/25498 From shade at openjdk.org Mon Jun 2 08:18:53 2025 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 2 Jun 2025 08:18:53 GMT Subject: RFR: 8358169: Shenandoah/JVMCI: Export GC state constants In-Reply-To: References: Message-ID: On Fri, 30 May 2025 16:09:03 GMT, Roman Kennke wrote: > We need the GC state enum constants available in JVMCI. Marked as reviewed by shade (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/25552#pullrequestreview-2887264825 From dbriemann at openjdk.org Mon Jun 2 08:27:56 2025 From: dbriemann at openjdk.org (David Briemann) Date: Mon, 2 Jun 2025 08:27:56 GMT Subject: RFR: 8357793: [PPC64] VM crashes with -XX:-UseSIGTRAP -XX:-ImplicitNullChecks [v2] In-Reply-To: References: <5XqAA3Z2G0uwOBkitUrqkG3Y68xtpRuvBwj_cEIFECs=.18259520-6f73-406f-a46f-fa025c12b303@github.com> Message-ID: On Wed, 28 May 2025 19:12:55 GMT, Martin Doerr wrote: >> In case of -XX:-UseSIGTRAP -XX:-ImplicitNullChecks, we use the manually selected entry. (The same is true for -XX:-TrapBasedNullChecks -XX:-ImplicitNullChecks.) >> We only need to use the correct NullPointerException entry in the compiler case. >> >> With this patch, the manually selected entry matches the one selected by `PosixSignals::pd_hotspot_signal_handler`. > > Martin Doerr has updated the pull request incrementally with one additional commit since the last revision: > > Fix bastore without ImplicitNullChecks. LGTM. Thanks ------------- Marked as reviewed by dbriemann (Author). PR Review: https://git.openjdk.org/jdk/pull/25504#pullrequestreview-2887294855 From mdoerr at openjdk.org Mon Jun 2 08:33:56 2025 From: mdoerr at openjdk.org (Martin Doerr) Date: Mon, 2 Jun 2025 08:33:56 GMT Subject: RFR: 8357793: [PPC64] VM crashes with -XX:-UseSIGTRAP -XX:-ImplicitNullChecks [v2] In-Reply-To: References: <5XqAA3Z2G0uwOBkitUrqkG3Y68xtpRuvBwj_cEIFECs=.18259520-6f73-406f-a46f-fa025c12b303@github.com> Message-ID: On Wed, 28 May 2025 19:12:55 GMT, Martin Doerr wrote: >> In case of -XX:-UseSIGTRAP -XX:-ImplicitNullChecks, we use the manually selected entry. (The same is true for -XX:-TrapBasedNullChecks -XX:-ImplicitNullChecks.) >> We only need to use the correct NullPointerException entry in the compiler case. >> >> With this patch, the manually selected entry matches the one selected by `PosixSignals::pd_hotspot_signal_handler`. > > Martin Doerr has updated the pull request incrementally with one additional commit since the last revision: > > Fix bastore without ImplicitNullChecks. Thanks for the reviews! ------------- PR Comment: https://git.openjdk.org/jdk/pull/25504#issuecomment-2929418451 From mdoerr at openjdk.org Mon Jun 2 08:33:57 2025 From: mdoerr at openjdk.org (Martin Doerr) Date: Mon, 2 Jun 2025 08:33:57 GMT Subject: Integrated: 8357793: [PPC64] VM crashes with -XX:-UseSIGTRAP -XX:-ImplicitNullChecks In-Reply-To: <5XqAA3Z2G0uwOBkitUrqkG3Y68xtpRuvBwj_cEIFECs=.18259520-6f73-406f-a46f-fa025c12b303@github.com> References: <5XqAA3Z2G0uwOBkitUrqkG3Y68xtpRuvBwj_cEIFECs=.18259520-6f73-406f-a46f-fa025c12b303@github.com> Message-ID: On Wed, 28 May 2025 17:00:48 GMT, Martin Doerr wrote: > In case of -XX:-UseSIGTRAP -XX:-ImplicitNullChecks, we use the manually selected entry. (The same is true for -XX:-TrapBasedNullChecks -XX:-ImplicitNullChecks.) > We only need to use the correct NullPointerException entry in the compiler case. > > With this patch, the manually selected entry matches the one selected by `PosixSignals::pd_hotspot_signal_handler`. This pull request has now been integrated. Changeset: ba9f44c9 Author: Martin Doerr URL: https://git.openjdk.org/jdk/commit/ba9f44c90fe8da2d97d67b6878ac2c0c14e35bd0 Stats: 4 lines in 2 files changed: 2 ins; 0 del; 2 mod 8357793: [PPC64] VM crashes with -XX:-UseSIGTRAP -XX:-ImplicitNullChecks Reviewed-by: shade, dbriemann ------------- PR: https://git.openjdk.org/jdk/pull/25504 From duke at openjdk.org Mon Jun 2 08:39:31 2025 From: duke at openjdk.org (Tom Shull) Date: Mon, 2 Jun 2025 08:39:31 GMT Subject: RFR: 8357660: [JVMCI] Add support for retrieving all BootstrapMethodInvocations directly from ConstantPool [v2] In-Reply-To: References: Message-ID: > This PR adds support for directly retrieving both all invokedynamic and all condy BootstrapMethodInvocations from a ConstantPool via the new method `List lookupBootstrapMethodInvocations(boolean invokeDynamic)`. > > In addition, two methods are added to the BootstrapMethodInvocations: > 1. `void resolve()` > 2. `JavaConstant lookup()` > > The combination of these two features allows one to directly interact with all BSM information of a given ConstantPool without having to iterate through all of the Classfile's methods to find all invokedynamic bytecodes and/or iterate through all Constant Pool entries. Tom Shull has updated the pull request incrementally with one additional commit since the last revision: reviewer feedback and update javadoc formatting ------------- Changes: - all: https://git.openjdk.org/jdk/pull/25420/files - new: https://git.openjdk.org/jdk/pull/25420/files/519be178..60c39b5e Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=25420&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=25420&range=00-01 Stats: 23 lines in 2 files changed: 3 ins; 1 del; 19 mod Patch: https://git.openjdk.org/jdk/pull/25420.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/25420/head:pull/25420 PR: https://git.openjdk.org/jdk/pull/25420 From shade at openjdk.org Mon Jun 2 08:42:55 2025 From: shade at openjdk.org (Aleksey Shipilev) Date: Mon, 2 Jun 2025 08:42:55 GMT Subject: RFR: 8357223: AArch64: Optimize interpreter profile updates [v2] In-Reply-To: <7wo-_Wt-EiVGKgxMxU_MnTA8o1QQxH_LDtNzDShlOIY=.9c8093b7-ed4b-487d-afbe-5227362f1ade@github.com> References: <7wo-_Wt-EiVGKgxMxU_MnTA8o1QQxH_LDtNzDShlOIY=.9c8093b7-ed4b-487d-afbe-5227362f1ade@github.com> Message-ID: <-gNhkdcFda-JXrWH4bpViukhPFnm0EyO71u1o2ZyV68=.0228af79-58ec-4fb5-9ca0-85148cc8365d@github.com> On Thu, 29 May 2025 23:04:25 GMT, Chad Rakoczy wrote: >> [JDK-8357223](https://bugs.openjdk.org/browse/JDK-8357223) >> >> The aarch64 version of [JDK-8356946](https://bugs.openjdk.org/browse/JDK-8356946) >> >> The reasoning for this change is the same as the x86 version's PR: >> >>> First, we carry the implementation for counter decrements without using them. This is dead code, and can be purged. >>> >>> Second, we care about overflows for 64-bit for some reason. I think this is a reminiscent of 32-bit x86 support, where we can plausibly have 32-bit counter overflow in a reasonable timeframe. But for 64-bit counter, we need tens of years of constantly bashing the counter to get it to overflow. No other profile counter update code, e.g. in C1, cares about this. >> >> Additional testing: >> >> - [x] Linux aarch64 fastdebug tier 1/2/3/4 > > Chad Rakoczy has updated the pull request incrementally with one additional commit since the last revision: > > Address comments Marked as reviewed by shade (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/25512#pullrequestreview-2887351889 From rkennke at openjdk.org Mon Jun 2 08:59:56 2025 From: rkennke at openjdk.org (Roman Kennke) Date: Mon, 2 Jun 2025 08:59:56 GMT Subject: Integrated: 8358169: Shenandoah/JVMCI: Export GC state constants In-Reply-To: References: Message-ID: On Fri, 30 May 2025 16:09:03 GMT, Roman Kennke wrote: > We need the GC state enum constants available in JVMCI. This pull request has now been integrated. Changeset: eb9badd8 Author: Roman Kennke URL: https://git.openjdk.org/jdk/commit/eb9badd8a4ea6dca834525fd49429e2ce771a76c Stats: 8 lines in 1 file changed: 8 ins; 0 del; 0 mod 8358169: Shenandoah/JVMCI: Export GC state constants Reviewed-by: dnsimon, shade ------------- PR: https://git.openjdk.org/jdk/pull/25552 From galder at openjdk.org Mon Jun 2 09:17:52 2025 From: galder at openjdk.org (Galder =?UTF-8?B?WmFtYXJyZcOxbw==?=) Date: Mon, 2 Jun 2025 09:17:52 GMT Subject: RFR: 8357726: C2 fails to recognize the counted loop when induction variable range is changed multiple times In-Reply-To: <-SKyhptjFPhuOPflySOZXJloR_Vgr4sC-xB5dSQXxZU=.fd6922bc-2498-4f4e-873a-999f82cd0a1a@github.com> References: <-SKyhptjFPhuOPflySOZXJloR_Vgr4sC-xB5dSQXxZU=.fd6922bc-2498-4f4e-873a-999f82cd0a1a@github.com> Message-ID: On Fri, 30 May 2025 07:43:29 GMT, Xiaohong Gong wrote: > C2 compiler fails to recognize counted loops when the induction variable is constrained by multiple consecutive `CastII` nodes. > This prevents optimizations like range check elimination, loop unrolling and auto-vectorization for these loops. Please refer > to the detailed discussion for a related performance issue from [1]. > > The ideal graph of such a loop typically looks like: > > > /-----------| > | | > | ConI | > loop | / / > | | / / > \ AddI / > RangeCheck \ / | > | \ / | > IfTrue Phi | > \ | | > RangeCheck \ | | > \ CastII / <- Range check #1 > | | / > IfTrue | | > \ | | > CastII | <- Range check #2 > | / > |-------/ > > > > For a counted loop, the loop induction variable (i.e `Phi`) should be the input of `AddI` ideally. However, in above case, it is used > by two consecutive `CastII` nodes generated by two different range check operations. Compiler should skip all such kind of `CastII` when recognizing a counted loop. > > This patch modifies the counted loop recognition code to iteratively uncast the loop `iv` until no `CastII` nodes remain, enabling proper counted loop recognition even when the induction variable undergoes multiple range constraint operations. > > Test: > - Tested tier1, tier2, tier3, and no regressions are found. > - An additional test case is added to verify the fix. > > Performance: > Here is the performance gain on a NVIDIA Grace machine which is an AArch64 architecture: > > > Benchmark Mode Cnt Unit Before After Gain > CountedLoopCastIV.loop_iv_int thrpt 30 ops/s 941482.597 4389292.439 4.66 > CountedLoopCastIV.loop_iv_long thrpt 30 ops/s 884563.232 1441485.455 1.62 > > > We can also observe the similar uplift on a x86_64 machine. > > [1] https://github.com/openjdk/jdk/pull/25138#issuecomment-2892720654 Marked as reviewed by galder (Author). ------------- PR Review: https://git.openjdk.org/jdk/pull/25539#pullrequestreview-2887478434 From epeter at openjdk.org Mon Jun 2 10:30:51 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Mon, 2 Jun 2025 10:30:51 GMT Subject: RFR: 8357726: C2 fails to recognize the counted loop when induction variable range is changed multiple times In-Reply-To: <-SKyhptjFPhuOPflySOZXJloR_Vgr4sC-xB5dSQXxZU=.fd6922bc-2498-4f4e-873a-999f82cd0a1a@github.com> References: <-SKyhptjFPhuOPflySOZXJloR_Vgr4sC-xB5dSQXxZU=.fd6922bc-2498-4f4e-873a-999f82cd0a1a@github.com> Message-ID: On Fri, 30 May 2025 07:43:29 GMT, Xiaohong Gong wrote: > C2 compiler fails to recognize counted loops when the induction variable is constrained by multiple consecutive `CastII` nodes. > This prevents optimizations like range check elimination, loop unrolling and auto-vectorization for these loops. Please refer > to the detailed discussion for a related performance issue from [1]. > > The ideal graph of such a loop typically looks like: > > > /-----------| > | | > | ConI | > loop | / / > | | / / > \ AddI / > RangeCheck \ / | > | \ / | > IfTrue Phi | > \ | | > RangeCheck \ | | > \ CastII / <- Range check #1 > | | / > IfTrue | | > \ | | > CastII | <- Range check #2 > | / > |-------/ > > > > For a counted loop, the loop induction variable (i.e `Phi`) should be the input of `AddI` ideally. However, in above case, it is used > by two consecutive `CastII` nodes generated by two different range check operations. Compiler should skip all such kind of `CastII` when recognizing a counted loop. > > This patch modifies the counted loop recognition code to iteratively uncast the loop `iv` until no `CastII` nodes remain, enabling proper counted loop recognition even when the induction variable undergoes multiple range constraint operations. > > Test: > - Tested tier1, tier2, tier3, and no regressions are found. > - An additional test case is added to verify the fix. > > Performance: > Here is the performance gain on a NVIDIA Grace machine which is an AArch64 architecture: > > > Benchmark Mode Cnt Unit Before After Gain > CountedLoopCastIV.loop_iv_int thrpt 30 ops/s 941482.597 4389292.439 4.66 > CountedLoopCastIV.loop_iv_long thrpt 30 ops/s 884563.232 1441485.455 1.62 > > > We can also observe the similar uplift on a x86_64 machine. > > [1] https://github.com/openjdk/jdk/pull/25138#issuecomment-2892720654 test/hotspot/jtreg/compiler/c2/irTests/TestCountedLoopCastIV.java line 2: > 1: /* > 2: * Copyright (c) 2025, NVIDIA CORPORATION & AFFILIATES. All rights reserved. Can you please move the test to `test/hotspot/jtreg/compiler/loopopts`? The `irTests` directory was not the best idea, it makes more sense to have tests thematically grouped. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/25539#discussion_r2120715051 From mchevalier at openjdk.org Mon Jun 2 10:37:11 2025 From: mchevalier at openjdk.org (Marc Chevalier) Date: Mon, 2 Jun 2025 10:37:11 GMT Subject: RFR: 8353266: C2: Wrong execution with Integer.bitCount(int) intrinsic on AArch64 [v2] In-Reply-To: References: Message-ID: > ### Problem > > On Aarch64, using `Integer.bitCount` can modify its argument. The problem comes from the implementation of `popCountI` on Aarch64. For instance, that's what we get with the reproducer `Reduced.java` on the related issue: > > ; Load lFld into local x > ldr x11, [x10, #120] > ; popCountI > mov w11, w11 > mov v16.d[0], x11 > cnt v16.8b, v16.8b > addv b16, v16.8b > mov x13, v16.d[0] > ; [...] > ; store local x (which is believed to still contain lFld) into result > str x11, [x10, #128] > > > The instruction `mov w11, w11` is used to cut the 32 higher bits of `x11` since we use `popCountI` (from `Integer.bitCount`): on aarch64 (like other architectures), assigning the 32 lower bits of a register reset the 32 higher bits. Short: the input is modified, but the implementation of `popCountI` doesn't declare it: > > instruct popCountI(iRegINoSp dst, iRegIorL2I src, vRegF tmp) %{ > match(Set dst (PopCountI src)); > effect(TEMP tmp); > [...] > %} > > > But then, why resetting the upper word of `x11`? It all starts with vector instructions: > > cnt v16.8b, v16.8b > addv b16, v16.8b > > The `8b` specifies that it operates on the 8 lower bytes of `v16`, it would be nice to simply use `4b`, but that doesn't exist: vector instructions can only work on either the whole 128-bit register, or the 64 lower bits (by blocks of 1, 2, 4, 8 or 16 bytes). There is no suffix (and encoding) for a vector instruction to work only on the 32 lower bits, so not to pollute the bit count, we need to reset the 32 higher bits of `v16.d[0]` (aka `d16`), that is `v16.s[1]`, that is `v16[32:63]` in a more bit-explicit notation. Moreover, unlike with general purpose register doing > > mov v16.s[0], w11 > > would set `v16[0:31]` to `w11`, but not reset `v16[32:63]`. Which makes sense! Otherwise, using vector registers would be impractical if writing any piece would reset the rest... So we indeed need to set all of `v16[0:63]`, which > > mov w11, w11 > mov v16.d[0], x11 > > does, but by destroying `x11`. > > ### Solution > > Simply adding `USE_KILL src` in the effects would be nice, but unfortunately not possible: `iRegIorL2I` is an operand class (either a 32-bit register or a L2I of a 64-bit register) and those cannot be used in effect lists. > > The way I went for is rather not to modify the source, but rather do write the two lower words of `v16` we are interested in separately: > > mov v16.s[1], wzr ; Reset the 1-indexed word of v16, that is v16[32:63] <- 0 > mov v16.s[0], w11 ; Set the 0-ind... Marc Chevalier has updated the pull request incrementally with one additional commit since the last revision: Apply suggestions ------------- Changes: - all: https://git.openjdk.org/jdk/pull/25551/files - new: https://git.openjdk.org/jdk/pull/25551/files/fb8d64d9..8318b50c Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=25551&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=25551&range=00-01 Stats: 6 lines in 2 files changed: 0 ins; 2 del; 4 mod Patch: https://git.openjdk.org/jdk/pull/25551.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/25551/head:pull/25551 PR: https://git.openjdk.org/jdk/pull/25551 From mchevalier at openjdk.org Mon Jun 2 10:37:12 2025 From: mchevalier at openjdk.org (Marc Chevalier) Date: Mon, 2 Jun 2025 10:37:12 GMT Subject: RFR: 8353266: C2: Wrong execution with Integer.bitCount(int) intrinsic on AArch64 In-Reply-To: References: Message-ID: <5KRLt28hn0r2ZL_M0Rdx7LOThZPIymChXhWGP7SVLXI=.0a0bc3f7-b81d-4271-8044-8431edd6196d@github.com> On Fri, 30 May 2025 15:33:14 GMT, Marc Chevalier wrote: > ### Problem > > On Aarch64, using `Integer.bitCount` can modify its argument. The problem comes from the implementation of `popCountI` on Aarch64. For instance, that's what we get with the reproducer `Reduced.java` on the related issue: > > ; Load lFld into local x > ldr x11, [x10, #120] > ; popCountI > mov w11, w11 > mov v16.d[0], x11 > cnt v16.8b, v16.8b > addv b16, v16.8b > mov x13, v16.d[0] > ; [...] > ; store local x (which is believed to still contain lFld) into result > str x11, [x10, #128] > > > The instruction `mov w11, w11` is used to cut the 32 higher bits of `x11` since we use `popCountI` (from `Integer.bitCount`): on aarch64 (like other architectures), assigning the 32 lower bits of a register reset the 32 higher bits. Short: the input is modified, but the implementation of `popCountI` doesn't declare it: > > instruct popCountI(iRegINoSp dst, iRegIorL2I src, vRegF tmp) %{ > match(Set dst (PopCountI src)); > effect(TEMP tmp); > [...] > %} > > > But then, why resetting the upper word of `x11`? It all starts with vector instructions: > > cnt v16.8b, v16.8b > addv b16, v16.8b > > The `8b` specifies that it operates on the 8 lower bytes of `v16`, it would be nice to simply use `4b`, but that doesn't exist: vector instructions can only work on either the whole 128-bit register, or the 64 lower bits (by blocks of 1, 2, 4, 8 or 16 bytes). There is no suffix (and encoding) for a vector instruction to work only on the 32 lower bits, so not to pollute the bit count, we need to reset the 32 higher bits of `v16.d[0]` (aka `d16`), that is `v16.s[1]`, that is `v16[32:63]` in a more bit-explicit notation. Moreover, unlike with general purpose register doing > > mov v16.s[0], w11 > > would set `v16[0:31]` to `w11`, but not reset `v16[32:63]`. Which makes sense! Otherwise, using vector registers would be impractical if writing any piece would reset the rest... So we indeed need to set all of `v16[0:63]`, which > > mov w11, w11 > mov v16.d[0], x11 > > does, but by destroying `x11`. > > ### Solution > > Simply adding `USE_KILL src` in the effects would be nice, but unfortunately not possible: `iRegIorL2I` is an operand class (either a 32-bit register or a L2I of a 64-bit register) and those cannot be used in effect lists. > > The way I went for is rather not to modify the source, but rather do write the two lower words of `v16` we are interested in separately: > > mov v16.s[1], wzr ; Reset the 1-indexed word of v16, that is v16[32:63] <- 0 > mov v16.s[0], w11 ; Set the 0-ind... I've changed the two `mov`s into a `fmovs` as suggested and adapted the format part. Tests seem happy. ------------- PR Comment: https://git.openjdk.org/jdk/pull/25551#issuecomment-2929946930 From mchevalier at openjdk.org Mon Jun 2 10:37:12 2025 From: mchevalier at openjdk.org (Marc Chevalier) Date: Mon, 2 Jun 2025 10:37:12 GMT Subject: RFR: 8353266: C2: Wrong execution with Integer.bitCount(int) intrinsic on AArch64 [v2] In-Reply-To: References:

Message-ID: On Sat, 31 May 2025 14:29:26 GMT, Andrew Haley wrote: >> Marc Chevalier has updated the pull request incrementally with one additional commit since the last revision: >> >> Apply suggestions > > src/hotspot/cpu/aarch64/aarch64.ad line 7771: > >> 7769: ins_encode %{ >> 7770: __ mov($tmp$$FloatRegister, __ S, 1, zr); // tmp[32:63] <- 0 >> 7771: __ mov($tmp$$FloatRegister, __ S, 0, $src$$Register); // tmp[ 0:31] <- src > > "Where the entire 128-bit wide register is not fully utilized, the vector or scalar quantity is held in the least significant bits of the register, with the most significant bits being cleared to zero on a write." > > Suggestion: > > __ fmovs($tmp$$FloatRegister, $src$$Register); > > should do it. Yes! Nicer, thanks. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/25551#discussion_r2120723694 From mchevalier at openjdk.org Mon Jun 2 10:37:12 2025 From: mchevalier at openjdk.org (Marc Chevalier) Date: Mon, 2 Jun 2025 10:37:12 GMT Subject: RFR: 8353266: C2: Wrong execution with Integer.bitCount(int) intrinsic on AArch64 [v2] In-Reply-To: References:

Message-ID: On Sat, 31 May 2025 03:11:28 GMT, SendaoYan wrote: >> Marc Chevalier has updated the pull request incrementally with one additional commit since the last revision: >> >> Apply suggestions > > test/hotspot/jtreg/compiler/intrinsics/BitCountIAarch64PreservesArgument.java line 58: > >> 56: if (result != 0xfedc_ba98_7654_3210L) { >> 57: // Wrongly outputs the cut input 0x7654_3210 == 1985229328 >> 58: throw new RuntimeException("Wrong result. lFld=" + lFld + "; result=" + result); > > How about: > > > throw new RuntimeException("Wrong result. Expected result = " + lFld + "; Actual result = " + result); That looks better indeed. Applied. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/25551#discussion_r2120724260 From epeter at openjdk.org Mon Jun 2 10:45:50 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Mon, 2 Jun 2025 10:45:50 GMT Subject: RFR: 8357726: C2 fails to recognize the counted loop when induction variable range is changed multiple times In-Reply-To: <-SKyhptjFPhuOPflySOZXJloR_Vgr4sC-xB5dSQXxZU=.fd6922bc-2498-4f4e-873a-999f82cd0a1a@github.com> References: <-SKyhptjFPhuOPflySOZXJloR_Vgr4sC-xB5dSQXxZU=.fd6922bc-2498-4f4e-873a-999f82cd0a1a@github.com> Message-ID: <1Er4nlGWx_yp6RIkqSo0PUk84lX50sTAGmGbnu4jokY=.74dc326e-9038-40d0-9b00-f5eaef1bd504@github.com> On Fri, 30 May 2025 07:43:29 GMT, Xiaohong Gong wrote: > C2 compiler fails to recognize counted loops when the induction variable is constrained by multiple consecutive `CastII` nodes. > This prevents optimizations like range check elimination, loop unrolling and auto-vectorization for these loops. Please refer > to the detailed discussion for a related performance issue from [1]. > > The ideal graph of such a loop typically looks like: > > > /-----------| > | | > | ConI | > loop | / / > | | / / > \ AddI / > RangeCheck \ / | > | \ / | > IfTrue Phi | > \ | | > RangeCheck \ | | > \ CastII / <- Range check #1 > | | / > IfTrue | | > \ | | > CastII | <- Range check #2 > | / > |-------/ > > > > For a counted loop, the loop induction variable (i.e `Phi`) should be the input of `AddI` ideally. However, in above case, it is used > by two consecutive `CastII` nodes generated by two different range check operations. Compiler should skip all such kind of `CastII` when recognizing a counted loop. > > This patch modifies the counted loop recognition code to iteratively uncast the loop `iv` until no `CastII` nodes remain, enabling proper counted loop recognition even when the induction variable undergoes multiple range constraint operations. > > Test: > - Tested tier1, tier2, tier3, and no regressions are found. > - An additional test case is added to verify the fix. > > Performance: > Here is the performance gain on a NVIDIA Grace machine which is an AArch64 architecture: > > > Benchmark Mode Cnt Unit Before After Gain > CountedLoopCastIV.loop_iv_int thrpt 30 ops/s 941482.597 4389292.439 4.66 > CountedLoopCastIV.loop_iv_long thrpt 30 ops/s 884563.232 1441485.455 1.62 > > > We can also observe the similar uplift on a x86_64 machine. > > [1] https://github.com/openjdk/jdk/pull/25138#issuecomment-2892720654 @XiaohongGong Nice work! @chhagedorn And I quickly discussed it offline, and we think this is a good approach. ------------- PR Comment: https://git.openjdk.org/jdk/pull/25539#issuecomment-2929984802 From epeter at openjdk.org Mon Jun 2 10:49:51 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Mon, 2 Jun 2025 10:49:51 GMT Subject: RFR: 8357726: C2 fails to recognize the counted loop when induction variable range is changed multiple times In-Reply-To: <-SKyhptjFPhuOPflySOZXJloR_Vgr4sC-xB5dSQXxZU=.fd6922bc-2498-4f4e-873a-999f82cd0a1a@github.com> References: <-SKyhptjFPhuOPflySOZXJloR_Vgr4sC-xB5dSQXxZU=.fd6922bc-2498-4f4e-873a-999f82cd0a1a@github.com> Message-ID: On Fri, 30 May 2025 07:43:29 GMT, Xiaohong Gong wrote: > C2 compiler fails to recognize counted loops when the induction variable is constrained by multiple consecutive `CastII` nodes. > This prevents optimizations like range check elimination, loop unrolling and auto-vectorization for these loops. Please refer > to the detailed discussion for a related performance issue from [1]. > > The ideal graph of such a loop typically looks like: > > > /-----------| > | | > | ConI | > loop | / / > | | / / > \ AddI / > RangeCheck \ / | > | \ / | > IfTrue Phi | > \ | | > RangeCheck \ | | > \ CastII / <- Range check #1 > | | / > IfTrue | | > \ | | > CastII | <- Range check #2 > | / > |-------/ > > > > For a counted loop, the loop induction variable (i.e `Phi`) should be the input of `AddI` ideally. However, in above case, it is used > by two consecutive `CastII` nodes generated by two different range check operations. Compiler should skip all such kind of `CastII` when recognizing a counted loop. > > This patch modifies the counted loop recognition code to iteratively uncast the loop `iv` until no `CastII` nodes remain, enabling proper counted loop recognition even when the induction variable undergoes multiple range constraint operations. > > Test: > - Tested tier1, tier2, tier3, and no regressions are found. > - An additional test case is added to verify the fix. > > Performance: > Here is the performance gain on a NVIDIA Grace machine which is an AArch64 architecture: > > > Benchmark Mode Cnt Unit Before After Gain > CountedLoopCastIV.loop_iv_int thrpt 30 ops/s 941482.597 4389292.439 4.66 > CountedLoopCastIV.loop_iv_long thrpt 30 ops/s 884563.232 1441485.455 1.62 > > > We can also observe the similar uplift on a x86_64 machine. > > [1] https://github.com/openjdk/jdk/pull/25138#issuecomment-2892720654 test/hotspot/jtreg/compiler/c2/irTests/TestCountedLoopCastIV.java line 57: > 55: out[i] = 0; > 56: } > 57: } You could also just use `Arrays.fill` test/hotspot/jtreg/compiler/c2/irTests/TestCountedLoopCastIV.java line 174: > 172: > 173: public static void main(String[] args) { > 174: TestFramework.runWithFlags("-XX:LoopUnrollLimit=0"); What is the reason for the flag here? Do you really need it? test/micro/org/openjdk/bench/vm/compiler/CountedLoopCastIV.java line 54: > 52: Random r = new Random(); > 53: start = r.nextInt(LEN >> 2); > 54: limit = r.nextInt(LEN >> 1, LEN - 3); Does this not mean that we use a different seed every time, and therefore the loop has different lengths, and so the results can be influenced accordingly? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/25539#discussion_r2120762941 PR Review Comment: https://git.openjdk.org/jdk/pull/25539#discussion_r2120766290 PR Review Comment: https://git.openjdk.org/jdk/pull/25539#discussion_r2120770394 From epeter at openjdk.org Mon Jun 2 10:50:54 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Mon, 2 Jun 2025 10:50:54 GMT Subject: RFR: 8355563: VectorAPI: Refactor current implementation of subword gather load API In-Reply-To: References:

Message-ID: On Fri, 30 May 2025 08:15:22 GMT, Xiaohong Gong wrote: >>> @XiaohongGong Thanks for splitting this one out, and for investigating the regressions here. >>> >>> Putting the permalink here, fixed to the current change (the link you pasted will always refer to the newest, which may later on point to the wrong line when lines above are inserted / deleted): >>> >>> https://github.com/openjdk/jdk/blob/7077535c0b0a6ea0a2a167f9135b1504a3d71fb3/src/hotspot/share/opto/loopnode.cpp#L1659-L1661 >>> >>> I wonder if we should just use `Node::uncast` there? But I'm quite unsure about that. >> >> Sounds good to me. I will have a deep investigation for it. Thanks! >> >> >> >>> > Yes, I also observed such regression. >>> > It would be nice if you proactively mentioned regressions, so it does not have to be pointed out by reviewers. >>> >>> For me, it could be ok to fix it in a follow-up patch. I think we are too close to RDP1 for JDK25 now anyway, and so we could push this patch here into JDK26, and then we have enough time in JDK26 to investigate the regression. Even better would be if we could do the other patch first, so we never even encounter a regression. >> >> Sounds good to me. Thanks! > >> > @XiaohongGong Thanks for splitting this one out, and for investigating the regressions here. >> > Putting the permalink here, fixed to the current change (the link you pasted will always refer to the newest, which may later on point to the wrong line when lines above are inserted / deleted): >> > https://github.com/openjdk/jdk/blob/7077535c0b0a6ea0a2a167f9135b1504a3d71fb3/src/hotspot/share/opto/loopnode.cpp#L1659-L1661 >> > >> > I wonder if we should just use `Node::uncast` there? But I'm quite unsure about that. >> >> Sounds good to me. I will have a deep investigation for it. Thanks! > > Hi @eme64 @jatin-bhateja, I'v created a PR https://github.com/openjdk/jdk/pull/25539 to fix this issue. With this change, the performance regression can be fixed as well. Could you please take a look at that change and help to run the test on different X86 machines? Thanks a lot! @XiaohongGong I reviewed https://github.com/openjdk/jdk/pull/25539. Since it is a relatively simple patch, I suggest that we integrate that one first, and come back to this here later. Is that ok for you? ------------- PR Comment: https://git.openjdk.org/jdk/pull/25138#issuecomment-2930007655 From epeter at openjdk.org Mon Jun 2 10:53:52 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Mon, 2 Jun 2025 10:53:52 GMT Subject: RFR: 8357726: C2 fails to recognize the counted loop when induction variable range is changed multiple times In-Reply-To: <-SKyhptjFPhuOPflySOZXJloR_Vgr4sC-xB5dSQXxZU=.fd6922bc-2498-4f4e-873a-999f82cd0a1a@github.com> References: <-SKyhptjFPhuOPflySOZXJloR_Vgr4sC-xB5dSQXxZU=.fd6922bc-2498-4f4e-873a-999f82cd0a1a@github.com> Message-ID: On Fri, 30 May 2025 07:43:29 GMT, Xiaohong Gong wrote: > C2 compiler fails to recognize counted loops when the induction variable is constrained by multiple consecutive `CastII` nodes. > This prevents optimizations like range check elimination, loop unrolling and auto-vectorization for these loops. Please refer > to the detailed discussion for a related performance issue from [1]. > > The ideal graph of such a loop typically looks like: > > > /-----------| > | | > | ConI | > loop | / / > | | / / > \ AddI / > RangeCheck \ / | > | \ / | > IfTrue Phi | > \ | | > RangeCheck \ | | > \ CastII / <- Range check #1 > | | / > IfTrue | | > \ | | > CastII | <- Range check #2 > | / > |-------/ > > > > For a counted loop, the loop induction variable (i.e `Phi`) should be the input of `AddI` ideally. However, in above case, it is used > by two consecutive `CastII` nodes generated by two different range check operations. Compiler should skip all such kind of `CastII` when recognizing a counted loop. > > This patch modifies the counted loop recognition code to iteratively uncast the loop `iv` until no `CastII` nodes remain, enabling proper counted loop recognition even when the induction variable undergoes multiple range constraint operations. > > Test: > - Tested tier1, tier2, tier3, and no regressions are found. > - An additional test case is added to verify the fix. > > Performance: > Here is the performance gain on a NVIDIA Grace machine which is an AArch64 architecture: > > > Benchmark Mode Cnt Unit Before After Gain > CountedLoopCastIV.loop_iv_int thrpt 30 ops/s 941482.597 4389292.439 4.66 > CountedLoopCastIV.loop_iv_long thrpt 30 ops/s 884563.232 1441485.455 1.62 > > > We can also observe the similar uplift on a x86_64 machine. > > [1] https://github.com/openjdk/jdk/pull/25138#issuecomment-2892720654 @XiaohongGong I suggest you change the title from: `8357726: C2 fails to recognize the counted loop when induction variable range is changed multiple times` to `8357726: C2 recognize loops with multiple casts in trip counter` or even: `8357726: C2 recognize loops with multiple casts in trip counter: phi -> CastII* -> AddI -> phi` ------------- PR Comment: https://git.openjdk.org/jdk/pull/25539#issuecomment-2930020530 From epeter at openjdk.org Mon Jun 2 11:06:53 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Mon, 2 Jun 2025 11:06:53 GMT Subject: RFR: 8356813: Improve Mod(I|L)Node::Value [v4] In-Reply-To: References: <2Jf_gfvRlKcmCFoQHp5T0WW_fU_yK5-0Z3z41f00-YU=.164be9f0-fae1-44bb-84c3-846d8c2c0db2@github.com> Message-ID: On Fri, 30 May 2025 07:26:13 GMT, Hannes Greule wrote: >> This change improves the precision of the `Mod(I|L)Node::Value()` functions. >> >> I reordered the structure a bit. First, we handle constants, afterwards, we handle ranges. The bottom checks seem to be excessive (`Type::BOTTOM` is covered by using `isa_(int|long)()`, the local bottom is just the full range). Given we can even give reasonable bounds if only one input has any bounds, we don't want to return early. >> The changes after that are commented. Please let me know if the explanations are good, or if you have any suggestions. >> >> ### Monotonicity >> >> Before, a 0 divisor resulted in `Type(Int|Long)::POS`. Initially I wanted to keep it this way, but that violates monotonicity during PhaseCCP. As an example, if we see a 0 divisor first and a 3 afterwards, we might try to go from `>=0` to `-2..2`, but the meet of these would be `>=-2` rather than `-2..2`. Using `Type(Int|Long)::ZERO` instead (zero is always in the resulting value if we cover a range). >> >> ### Testing >> >> I added tests for cases around the relevant bounds. I also ran tier1, tier2, and tier3 but didn't see any related failures after addressing the monotonicity problem described above (I'm having a few unrelated failures on my system currently, so separate testing would be appreciated in case I missed something). >> >> Please review and let me know what you think. >> >> ### Other >> >> The `UMod(I|L)Node`s were adjusted to be more in line with its signed variants. This change diverges them again, but similar improvements could be made after #17508. >> >> During experimenting with these changes, I stumbled upon a few things that aren't directly related to this change, but might be worth to further look into: >> - If the divisor is a constant, we will directly replace the `Mod(I|L)Node` with more but less expensive nodes in `::Ideal()`. Type analysis for these nodes combined is less precise, means we miss potential cases were this would help e.g., removing range checks. Would it make sense to delay the replacement? >> - To force non-negative ranges, I'm using `char`. I noticed that method parameters of sub-int integer types all fall back to `TypeInt::INT`. This seems to be an intentional change of https://github.com/openjdk/jdk/commit/200784d505dd98444c48c9ccb7f2e4df36dcbb6a. The bug report is private, so I can't really judge if that part is necessary, but it seems odd. > > Hannes Greule has updated the pull request incrementally with one additional commit since the last revision: > > Add randomized test src/hotspot/share/opto/divnode.cpp line 1206: > 1204: > 1205: //------------------------------Value------------------------------------------ > 1206: static const Type* mod_value(const PhaseGVN* phase, const Node* in1, const Node* in2, const BasicType bt, const Type* bottom) { You did choose the `bt` path here! I would add an assert that we only allow `T_INT` and `T_LONG` src/hotspot/share/opto/divnode.cpp line 1237: > 1235: // We don't need to check for min_jint % '-1' as its result is defined when using jlong. > 1236: if (i1->get_con_as_long(bt) == min_jlong && i2->get_con_as_long(bt) == -1) { > 1237: return TypeInteger::zero(bt); Is this correct? For `bt = T_INT` is this really equivalent? `i1->get_con() == min_jint` We might get `min_jint` back here. `i1->get_con_as_long(bt) == min_jlong` Would we not return `min_jint` here, and then the condition is false? Do we have an IR test for this? src/hotspot/share/opto/divnode.cpp line 1241: > 1239: return TypeInteger::make(i1->get_con_as_long(bt) % i2->get_con_as_long(bt), bt); > 1240: } > 1241: // The magnitude of the divisor is in range [1, 2^63]. You should probably also mention the `2^31` variant. src/hotspot/share/opto/divnode.cpp line 1247: > 1245: // JVMS lrem bytecode: "the magnitude of the result is always less than the magnitude of the divisor" > 1246: // "less than" means we can subtract 1 to get an inclusive upper bound in [0, 2^63-1] > 1247: jlong hi = static_cast(divisor_magnitude - 1); Hmm, this also looks confusing for the `T_INT` case. What about `-5`, does that then not become `max_julong - 5`, but it should have been `max_juint - 1`? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/25254#discussion_r2120802575 PR Review Comment: https://git.openjdk.org/jdk/pull/25254#discussion_r2120800945 PR Review Comment: https://git.openjdk.org/jdk/pull/25254#discussion_r2120801900 PR Review Comment: https://git.openjdk.org/jdk/pull/25254#discussion_r2120805584 From epeter at openjdk.org Mon Jun 2 11:07:53 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Mon, 2 Jun 2025 11:07:53 GMT Subject: RFR: 8252473: [TESTBUG] compiler tests fail with minimal VM: Unrecognized VM option [v2] In-Reply-To: References:

Message-ID: On Wed, 28 May 2025 18:45:49 GMT, Zdenek Zambersky wrote: > (I have not changed JIRA as there is no info about fix. Should I add it there?) Yes please, that is generally what we should do :) ------------- PR Comment: https://git.openjdk.org/jdk/pull/24262#issuecomment-2930075745 From epeter at openjdk.org Mon Jun 2 11:10:53 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Mon, 2 Jun 2025 11:10:53 GMT Subject: RFR: 8252473: [TESTBUG] compiler tests fail with minimal VM: Unrecognized VM option [v3] In-Reply-To: References:

Message-ID: On Wed, 28 May 2025 18:39:27 GMT, Zdenek Zambersky wrote: >> This change adds ` -XX:-IgnoreUnrecognizedVMOptions` to problematic tests (or `@requires vm.compiler2.enabled` in one case), to prevent failures `Unrecognized VM option` on client VM. > > Zdenek Zambersky has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains one commit: > > Fix of compiler tests for client VM Still looks reasonable. I'll run some testing now, please ping me again in 24h :) ------------- PR Review: https://git.openjdk.org/jdk/pull/24262#pullrequestreview-2887893389 From dnsimon at openjdk.org Mon Jun 2 11:11:52 2025 From: dnsimon at openjdk.org (Doug Simon) Date: Mon, 2 Jun 2025 11:11:52 GMT Subject: RFR: 8357987: [JVMCI] Add support for retrieving all methods of a ResolvedJavaType [v2] In-Reply-To: References:

Message-ID: On Mon, 2 Jun 2025 08:15:36 GMT, Tom Shull wrote: >> Currently from ResolvedJavaType one can retrieve all declared methods, static methods, and constructors of the given type. However, internally in HotSpot there are also VM-internal methods, such as overpass methods, associated with a given type which we cannot access via the API. >> >> To correct this, we should add a new method which enables VM-internal methods, such as overpass methods, to be accessed. > > Tom Shull has updated the pull request incrementally with one additional commit since the last revision: > > format javadoc and update test Looks good to me. ------------- Marked as reviewed by dnsimon (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/25498#pullrequestreview-2887897833 From dnsimon at openjdk.org Mon Jun 2 11:16:53 2025 From: dnsimon at openjdk.org (Doug Simon) Date: Mon, 2 Jun 2025 11:16:53 GMT Subject: RFR: 8357660: [JVMCI] Add support for retrieving all BootstrapMethodInvocations directly from ConstantPool [v2] In-Reply-To: References:

Message-ID: <1jDUbEJHRDYuT4RDOHlEeY5C4IWwwcenweFgZcwnUsU=.bc8d84ad-13bb-4b5a-9d02-de020301e3d6@github.com> On Mon, 2 Jun 2025 08:39:31 GMT, Tom Shull wrote: >> This PR adds support for directly retrieving both all invokedynamic and all condy BootstrapMethodInvocations from a ConstantPool via the new method `List lookupBootstrapMethodInvocations(boolean invokeDynamic)`. >> >> In addition, two methods are added to the BootstrapMethodInvocations: >> 1. `void resolve()` >> 2. `JavaConstant lookup()` >> >> The combination of these two features allows one to directly interact with all BSM information of a given ConstantPool without having to iterate through all of the Classfile's methods to find all invokedynamic bytecodes and/or iterate through all Constant Pool entries. > > Tom Shull has updated the pull request incrementally with one additional commit since the last revision: > > reviewer feedback and update javadoc formatting Looks good to me. Please enable GitHub Actions on your JDK fork. ------------- Marked as reviewed by dnsimon (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/25420#pullrequestreview-2887908832 From hgreule at openjdk.org Mon Jun 2 11:33:53 2025 From: hgreule at openjdk.org (Hannes Greule) Date: Mon, 2 Jun 2025 11:33:53 GMT Subject: RFR: 8356813: Improve Mod(I|L)Node::Value [v4] In-Reply-To: References: <2Jf_gfvRlKcmCFoQHp5T0WW_fU_yK5-0Z3z41f00-YU=.164be9f0-fae1-44bb-84c3-846d8c2c0db2@github.com>

Message-ID: On Mon, 2 Jun 2025 10:58:45 GMT, Emanuel Peter wrote: >> Hannes Greule has updated the pull request incrementally with one additional commit since the last revision: >> >> Add randomized test > > src/hotspot/share/opto/divnode.cpp line 1237: > >> 1235: // We don't need to check for min_jint % '-1' as its result is defined when using jlong. >> 1236: if (i1->get_con_as_long(bt) == min_jlong && i2->get_con_as_long(bt) == -1) { >> 1237: return TypeInteger::zero(bt); > > Is this correct? For `bt = T_INT` is this really equivalent? > > `i1->get_con() == min_jint` > We might get `min_jint` back here. > > `i1->get_con_as_long(bt) == min_jlong` > Would we not return `min_jint` here, and then the condition is false? > > Do we have an IR test for this? This special case is only needed because `min_jlong % -1L` in C++ is UB (afaik) and the idiv instruction triggers a SIGFPE in such case. But `min_jint % -1L` *using long arithmetic* correctly produces 0. I think it would make sense to expand tests for constant folding, but I'll have to check if that actually gets called, see **Other** in the PR description (copied): > If the divisor is a constant, we will directly replace the Mod(I|L)Node with more but less expensive nodes in ::Ideal(). Type analysis for these nodes combined is less precise, means we miss potential cases were this would help e.g., removing range checks. Would it make sense to delay the replacement? So there's a chance this code was never called before... ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/25254#discussion_r2120868235 From hgreule at openjdk.org Mon Jun 2 11:36:54 2025 From: hgreule at openjdk.org (Hannes Greule) Date: Mon, 2 Jun 2025 11:36:54 GMT Subject: RFR: 8356813: Improve Mod(I|L)Node::Value [v4] In-Reply-To: References: <2Jf_gfvRlKcmCFoQHp5T0WW_fU_yK5-0Z3z41f00-YU=.164be9f0-fae1-44bb-84c3-846d8c2c0db2@github.com>

Message-ID: <5FnA_gZNzRom3MBShwfbdCffeRGogf1cyKo0nF40c4I=.9db6f973-e6a5-4852-b82e-24ccc198bcb9@github.com> On Mon, 2 Jun 2025 11:01:29 GMT, Emanuel Peter wrote: >> Hannes Greule has updated the pull request incrementally with one additional commit since the last revision: >> >> Add randomized test > > src/hotspot/share/opto/divnode.cpp line 1247: > >> 1245: // JVMS lrem bytecode: "the magnitude of the result is always less than the magnitude of the divisor" >> 1246: // "less than" means we can subtract 1 to get an inclusive upper bound in [0, 2^63-1] >> 1247: jlong hi = static_cast(divisor_magnitude - 1); > > Hmm, this also looks confusing for the `T_INT` case. What about `-5`, does that then not become `max_julong - 5`, but it should have been `max_juint - 1`? We use `g_uabs()` to get the absolute value, that should't exceed 2^31 for int values (i.e., `g_uabs(min_jint) == 2^31`). So we should get into the right range here again. But I guess I can expand the comment to better explain that part. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/25254#discussion_r2120875453 From epeter at openjdk.org Mon Jun 2 11:46:54 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Mon, 2 Jun 2025 11:46:54 GMT Subject: RFR: 8356813: Improve Mod(I|L)Node::Value [v4] In-Reply-To: <5FnA_gZNzRom3MBShwfbdCffeRGogf1cyKo0nF40c4I=.9db6f973-e6a5-4852-b82e-24ccc198bcb9@github.com> References: <2Jf_gfvRlKcmCFoQHp5T0WW_fU_yK5-0Z3z41f00-YU=.164be9f0-fae1-44bb-84c3-846d8c2c0db2@github.com>

<5FnA_gZNzRom3MBShwfbdCffeRGogf1cyKo0nF40c4I=.9db6f973-e6a5-4852-b82e-24ccc198bcb9@github.com> Message-ID: On Mon, 2 Jun 2025 11:34:22 GMT, Hannes Greule wrote: >> src/hotspot/share/opto/divnode.cpp line 1247: >> >>> 1245: // JVMS lrem bytecode: "the magnitude of the result is always less than the magnitude of the divisor" >>> 1246: // "less than" means we can subtract 1 to get an inclusive upper bound in [0, 2^63-1] >>> 1247: jlong hi = static_cast(divisor_magnitude - 1); >> >> Hmm, this also looks confusing for the `T_INT` case. What about `-5`, does that then not become `max_julong - 5`, but it should have been `max_juint - 1`? > > We use `g_uabs()` to get the absolute value, that should't exceed 2^31 for int values (i.e., `g_uabs(min_jint) == 2^31`). So we should get into the right range here again. But I guess I can expand the comment to better explain that part. @SirYwell I'm not 100% sure here, so please correct me if I'm wrong. You are now always passing in a `jlong` value, so you always use `static inline julong g_uabs(jlong n) { return g_uabs((julong)n); }`, even for `T_INT`. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/25254#discussion_r2120898799 From epeter at openjdk.org Mon Jun 2 11:53:00 2025 From: epeter at openjdk.org (Emanuel Peter) Date: Mon, 2 Jun 2025 11:53:00 GMT Subject: RFR: 8347555: [REDO] C2: implement optimization for series of Add of unique value [v7] In-Reply-To: References:

Message-ID: On Wed, 28 May 2025 14:47:02 GMT, Kangcheng Xu wrote: >> @tabjy Thanks for your patience, this one took me longer than I wanted. I responded like this above: >> >>> Hmm, ok I see. Why don't you remove the asserts for now, and we see how clear the code looks now. I think I asked for the consistency check because I was confused by the previous code structure. Maybe it is ok now as it is. > > Ping @eme64 again for awareness. :) @tabjy > I could, at very least, try to swap LHS and RHS if no match is found I think that would be a good idea, and not very hard. You can just have a function `add_pattern(lhs, rhs)`, and then run it also with `add_pattern(rhs, lhs)` for **swapping**. Personally, I would have preferred a recursive algorithm, but that could have some compile time overhead. @chhagedorn Was a little more skeptical about the recursive algorithm. It seems the motivation for this change is the benchmark from here: ArithmeticCanonicalizationBenchmark https://ionutbalosin.com/2024/02/jvm-performance-comparison-for-jdk-21/#jit-compiler This benchmark is of course somewhat arbitrary, and so are now all of your added patterns. Having a most general solution would be nice, but maybe the recursive algorithm is too much, I'm not 100% sure. Of course we now still have cases that do not optimize/canonicalize, and so someone could write a benchmark for those cases still.. oh well. What I would like to see for **testing**: add some more patterns with IR rules. More that now optimize, and also a few that do not optimize, just so we have a bit of a sense what we are still missing. @rwestrel Filed this issue. I wonder: what do you think we should do here? How general should the optimization/canonicalization be? ------------- PR Comment: https://git.openjdk.org/jdk/pull/23506#issuecomment-2930295143 From aph at openjdk.org Mon Jun 2 11:56:52 2025 From: aph at openjdk.org (Andrew Haley) Date: Mon, 2 Jun 2025 11:56:52 GMT Subject: RFR: 8353266: C2: Wrong execution with Integer.bitCount(int) intrinsic on AArch64 [v2] In-Reply-To: References:

Message-ID: On Mon, 2 Jun 2025 10:37:11 GMT, Marc Chevalier wrote: >> ### Problem >> >> On Aarch64, using `Integer.bitCount` can modify its argument. The problem comes from the implementation of `popCountI` on Aarch64. For instance, that's what we get with the reproducer `Reduced.java` on the related issue: >> >> ; Load lFld into local x >> ldr x11, [x10, #120] >> ; popCountI >> mov w11, w11 >> mov v16.d[0], x11 >> cnt v16.8b, v16.8b >> addv b16, v16.8b >> mov x13, v16.d[0] >> ; [...] >> ; store local x (which is believed to still contain lFld) into result >> str x11, [x10, #128] >> >> >> The instruction `mov w11, w11` is used to cut the 32 higher bits of `x11` since we use `popCountI` (from `Integer.bitCount`): on aarch64 (like other architectures), assigning the 32 lower bits of a register reset the 32 higher bits. Short: the input is modified, but the implementation of `popCountI` doesn't declare it: >> >> instruct popCountI(iRegINoSp dst, iRegIorL2I src, vRegF tmp) %{ >> match(Set dst (PopCountI src)); >> effect(TEMP tmp); >> [...] >> %} >> >> >> But then, why resetting the upper word of `x11`? It all starts with vector instructions: >> >> cnt v16.8b, v16.8b >> addv b16, v16.8b >> >> The `8b` specifies that it operates on the 8 lower bytes of `v16`, it would be nice to simply use `4b`, but that doesn't exist: vector instructions can only work on either the whole 128-bit register, or the 64 lower bits (by blocks of 1, 2, 4, 8 or 16 bytes). There is no suffix (and encoding) for a vector instruction to work only on the 32 lower bits, so not to pollute the bit count, we need to reset the 32 higher bits of `v16.d[0]` (aka `d16`), that is `v16.s[1]`, that is `v16[32:63]` in a more bit-explicit notation. Moreover, unlike with general purpose register doing >> >> mov v16.s[0], w11 >> >> would set `v16[0:31]` to `w11`, but not reset `v16[32:63]`. Which makes sense! Otherwise, using vector registers would be impractical if writing any piece would reset the rest... So we indeed need to set all of `v16[0:63]`, which >> >> mov w11, w11 >> mov v16.d[0], x11 >> >> does, but by destroying `x11`. >> >> ### Solution >> >> Simply adding `USE_KILL src` in the effects would be nice, but unfortunately not possible: `iRegIorL2I` is an operand class (either a 32-bit register or a L2I of a 64-bit register) and those cannot be used in effect lists. >> >> The way I went for is rather not to modify the source, but rather do write the two lower words of `v16` we are interested in separately: >> >> mov v16.s[1], wzr ... > > Marc Chevalier has updated the pull request incrementally with one additional commit since the last revision: > > Apply suggestions Marked as reviewed by aph (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/25551#pullrequestreview-2888056446 From chagedorn at openjdk.org Mon Jun 2 12:08:10 2025 From: chagedorn at openjdk.org (Christian Hagedorn) Date: Mon, 2 Jun 2025 12:08:10 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v61] In-Reply-To: References:

Message-ID: On Sun, 1 Jun 2025 05:58:13 GMT, Emanuel Peter wrote: >> test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestTutorial.java line 306: >> >>> 304: var myHook = new Hook("MyHook"); >>> 305: >>> 306: var template1 = Template.make("name", "value", (String name, Integer value) -> body( >> >> One could generally think about using `_` for unused lambda parameters which I think is the common convention. But then I guess we would need to update the documentation about saying "name" and "String name" should be the same and make an exception for unused ones. I don't know. > > I think it is better to keep the names duplicated. This gives the reader an easier visual aid to check which name has which type. What do you think? That's totally fine and easy to follow. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24217#discussion_r2120948254 From jbhateja at openjdk.org Mon Jun 2 12:08:54 2025 From: jbhateja at openjdk.org (Jatin Bhateja) Date: Mon, 2 Jun 2025 12:08:54 GMT Subject: RFR: 8352635: Improve inferencing of Float16 operations with constant inputs [v4] In-Reply-To: <6PFX21b9eT5mQv8Ym7b_RuKNpnuQ5CVqhc8TKxstlYo=.eb7d9f85-5e49-4e8f-b17a-c8e3728e7624@github.com> References: <44nVQBYgzCOB2mAB9xtAPvkUcOMJOITA2VjMdDFgm1g=.48266693-48bf-41db-8871-a7dcafe93509@github.com> <6PFX21b9eT5mQv8Ym7b_RuKNpnuQ5CVqhc8TKxstlYo=.eb7d9f85-5e49-4e8f-b17a-c8e3728e7624@github.com> Message-ID: On Wed, 28 May 2025 09:15:31 GMT, Emanuel Peter wrote: >> Jatin Bhateja has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains six commits: >> >> - Merge branch 'master' of http://github.com/openjdk/jdk into JDK-8352635 >> - Enabling some test points >> - Adding test points and some re-factoring >> - Merge branch 'master' of https://github.com/openjdk/jdk into JDK-8352635 >> - Merge branch 'master' of http://github.com/openjdk/jdk into JDK-8352635 >> - 8352635: Improve inferencing of Float16 operations with constant inputs > > @jatin-bhateja That looks very promising, thanks for working on that! Hi @eme64 , Your comments have been addressed. Best Regards ------------- PR Comment: https://git.openjdk.org/jdk/pull/24179#issuecomment-2930355506 From yzheng at openjdk.org Mon Jun 2 12:11:57 2025 From: yzheng at openjdk.org (Yudi Zheng) Date: Mon, 2 Jun 2025 12:11:57 GMT Subject: RFR: 8357987: [JVMCI] Add support for retrieving all methods of a ResolvedJavaType [v2] In-Reply-To: References:

Message-ID: <0o43MdXkVHVU8JQIoBSQ-46j3jLJjvAEqARhk88aeEw=.a202168b-af3f-4ce4-b274-f1cbbd4295fa@github.com> On Mon, 2 Jun 2025 08:15:36 GMT, Tom Shull wrote: >> Currently from ResolvedJavaType one can retrieve all declared methods, static methods, and constructors of the given type. However, internally in HotSpot there are also VM-internal methods, such as overpass methods, associated with a given type which we cannot access via the API. >> >> To correct this, we should add a new method which enables VM-internal methods, such as overpass methods, to be accessed. > > Tom Shull has updated the pull request incrementally with one additional commit since the last revision: > > format javadoc and update test src/jdk.internal.vm.ci/share/classes/jdk/vm/ci/hotspot/HotSpotResolvedObjectTypeImpl.java line 1079: > 1077: return List.of(); > 1078: } > 1079: return Collections.unmodifiableList(Arrays.asList(instanceMethods)); `return List.of(instanceMethods);` should work. We can then replace the above with `return List.of(runtime().compilerToVm.getAllMethods(this));` ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/25498#discussion_r2120952854 From chagedorn at openjdk.org Mon Jun 2 12:12:11 2025 From: chagedorn at openjdk.org (Christian Hagedorn) Date: Mon, 2 Jun 2025 12:12:11 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v61] In-Reply-To: References:

Message-ID: On Sun, 1 Jun 2025 15:56:18 GMT, Emanuel Peter wrote: >> Another question which is not evidently clear by following the examples: Can and should (not) you use the same hook inside the hook itself, i.e.: >> >> Hooks.CLASS_HOOK.anchor( >> Hooks.CLASS_HOOK.anchor( >> // ... >> >> This is probably not done on purpose but such a situation could arise when nesting more templates and suddenly one anchors the same hook again? > > I extended the explanations: > > ~ 397 // We saw the use of custom hooks above, but now we look at the use of CLASS_HOOK and METHOD_HOOK. > ~ 398 // By convention, we use the CLASS_HOOK for class scopes, and METHOD_HOOK for method scopes. > + 399 // Whenever we open a class scope, we should anchor a CLASS_HOOK for that scope, and whenever we > + 400 // open a method, we should anchor a METHOD_HOOK. Conversely, this allows us to check if we are > + 401 // inside a class or method scope by querying "isAnchored". This convention helps us when building > + 402 // a large library of Templates. But if you are writing your own self-contained set of Templates, > + 403 // you do not have to follow this convention. > + 404 // > + 405 // Hooks are "re-entrant", that is we can anchor the same hook inside a scope that we already > + 406 // anchored it previously. The "Hook.insert" always goes to the innermost anchoring of that > + 407 // hook. There are cases where "re-entrant" Hooks are helpful such as nested classes, where > + 408 // there is a class scope inside another class scope. Similarly, we can nest lambda bodies > + 409 // inside method bodies, so also METHOD_HOOK can be used in such a "re-entrant" way. > > > We could consider having both "re-entrant" and "non-re-entrant" Hooks. But I'm not yet convinced it is a very useful feature. Sure, there could be some confusion with nested hooks. But I think that confusion to code generation, because we can also nest class and method/lambda scopes. > > What do you think? The updated explanation is very good of making clear when we could/want to have nested hooks. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24217#discussion_r2120955442 From chagedorn at openjdk.org Mon Jun 2 12:18:10 2025 From: chagedorn at openjdk.org (Christian Hagedorn) Date: Mon, 2 Jun 2025 12:18:10 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v61] In-Reply-To: References:

Message-ID: On Fri, 30 May 2025 10:39:57 GMT, Christian Hagedorn wrote: >> Emanuel Peter has updated the pull request incrementally with two additional commits since the last revision: >> >> - Merge branch 'JDK-8344942-TemplateFramework-v3' of https://github.com/eme64/jdk into JDK-8344942-TemplateFramework-v3 >> - move verification > > Thanks for all the updates and discussions! I've worked my way through the documentation in `Template` and the examples again in some more detail. It's much better and the new explanations are well done, excellent work! > > I left some comments here and there but mostly minor things. I will have another look at the implementation - probably only finished by Monday. The design now looks great. I'm glad we could find a good solution now after some more iterations :-) > @chhagedorn Alright, I now have a decent solution for `$$var` and `$1var` etc. I also added tests for it. > > These are issues we could continue the conversation, unless you are satisfied with my answers: [#24217 (comment)](https://github.com/openjdk/jdk/pull/24217#discussion_r2115388737) [#24217 (comment)](https://github.com/openjdk/jdk/pull/24217#discussion_r2115406391) > > This is now ready for another review pass ? Awesome, thanks for spending some more time with these nasty edge-cases and finding a solution! I had a look at your updates for all my comments, they look good, thanks! I'm going to make a pass over the implementation classes now and will have a look at the `Renderer` updates as well :-) ------------- PR Comment: https://git.openjdk.org/jdk/pull/24217#issuecomment-2930394221 From thartmann at openjdk.org Mon Jun 2 12:49:52 2025 From: thartmann at openjdk.org (Tobias Hartmann) Date: Mon, 2 Jun 2025 12:49:52 GMT Subject: RFR: 8353266: C2: Wrong execution with Integer.bitCount(int) intrinsic on AArch64 [v2] In-Reply-To: References:

Message-ID: On Mon, 2 Jun 2025 10:37:11 GMT, Marc Chevalier wrote: >> ### Problem >> >> On Aarch64, using `Integer.bitCount` can modify its argument. The problem comes from the implementation of `popCountI` on Aarch64. For instance, that's what we get with the reproducer `Reduced.java` on the related issue: >> >> ; Load lFld into local x >> ldr x11, [x10, #120] >> ; popCountI >> mov w11, w11 >> mov v16.d[0], x11 >> cnt v16.8b, v16.8b >> addv b16, v16.8b >> mov x13, v16.d[0] >> ; [...] >> ; store local x (which is believed to still contain lFld) into result >> str x11, [x10, #128] >> >> >> The instruction `mov w11, w11` is used to cut the 32 higher bits of `x11` since we use `popCountI` (from `Integer.bitCount`): on aarch64 (like other architectures), assigning the 32 lower bits of a register reset the 32 higher bits. Short: the input is modified, but the implementation of `popCountI` doesn't declare it: >> >> instruct popCountI(iRegINoSp dst, iRegIorL2I src, vRegF tmp) %{ >> match(Set dst (PopCountI src)); >> effect(TEMP tmp); >> [...] >> %} >> >> >> But then, why resetting the upper word of `x11`? It all starts with vector instructions: >> >> cnt v16.8b, v16.8b >> addv b16, v16.8b >> >> The `8b` specifies that it operates on the 8 lower bytes of `v16`, it would be nice to simply use `4b`, but that doesn't exist: vector instructions can only work on either the whole 128-bit register, or the 64 lower bits (by blocks of 1, 2, 4, 8 or 16 bytes). There is no suffix (and encoding) for a vector instruction to work only on the 32 lower bits, so not to pollute the bit count, we need to reset the 32 higher bits of `v16.d[0]` (aka `d16`), that is `v16.s[1]`, that is `v16[32:63]` in a more bit-explicit notation. Moreover, unlike with general purpose register doing >> >> mov v16.s[0], w11 >> >> would set `v16[0:31]` to `w11`, but not reset `v16[32:63]`. Which makes sense! Otherwise, using vector registers would be impractical if writing any piece would reset the rest... So we indeed need to set all of `v16[0:63]`, which >> >> mov w11, w11 >> mov v16.d[0], x11 >> >> does, but by destroying `x11`. >> >> ### Solution >> >> Simply adding `USE_KILL src` in the effects would be nice, but unfortunately not possible: `iRegIorL2I` is an operand class (either a 32-bit register or a L2I of a 64-bit register) and those cannot be used in effect lists. >> >> The way I went for is rather not to modify the source, but rather do write the two lower words of `v16` we are interested in separately: >> >> mov v16.s[1], wzr ... > > Marc Chevalier has updated the pull request incrementally with one additional commit since the last revision: > > Apply suggestions Nice analysis, Marc! The fix looks good to me and I don't have a strong opinion about the print format. ------------- Marked as reviewed by thartmann (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/25551#pullrequestreview-2888252754 From hgreule at openjdk.org Mon Jun 2 12:55:50 2025 From: hgreule at openjdk.org (Hannes Greule) Date: Mon, 2 Jun 2025 12:55:50 GMT Subject: RFR: 8356813: Improve Mod(I|L)Node::Value [v4] In-Reply-To: References: <2Jf_gfvRlKcmCFoQHp5T0WW_fU_yK5-0Z3z41f00-YU=.164be9f0-fae1-44bb-84c3-846d8c2c0db2@github.com>

<5FnA_gZNzRom3MBShwfbdCffeRGogf1cyKo0nF40c4I=.9db6f973-e6a5-4852-b82e-24ccc198bcb9@github.com> Message-ID: On Mon, 2 Jun 2025 11:44:36 GMT, Emanuel Peter wrote: >> We use `g_uabs()` to get the absolute value, that should't exceed 2^31 for int values (i.e., `g_uabs(min_jint) == 2^31`). So we should get into the right range here again. But I guess I can expand the comment to better explain that part. > > @SirYwell I'm not 100% sure here, so please correct me if I'm wrong. > You are now always passing in a `jlong` value, so you always use `static inline julong g_uabs(jlong n) { return g_uabs((julong)n); }`, even for `T_INT`. Yes that's correct, and it should still work due to how negation works for negative inputs. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/25254#discussion_r2121069875 From mhaessig at openjdk.org Mon Jun 2 12:58:53 2025 From: mhaessig at openjdk.org (Manuel =?UTF-8?B?SMOkc3NpZw==?=) Date: Mon, 2 Jun 2025 12:58:53 GMT Subject: RFR: 8354930: IGV: dump C2 graph before and after live range stretching In-Reply-To: References: Message-ID: On Wed, 28 May 2025 11:54:24 GMT, Manuel H?ssig wrote: > This PR introduces a new phase `LIVE_RANGE_STRETCHING` that prints after live ranges have been stretched, if that happens at all. The phase `INITIAL_LIVENESS` is moved before live range stretching so we can compare the live ranges before and after stretching in IGV, which is useful for debugging why an oop suddenly belongs to an oop map. > > ## Testing > > - [x] [Github Actions](https://github.com/mhaessig/jdk/actions/runs/15299362485) > - [x] tier1 and tier1, plus additional Oracle internal testing for all Oracle supported platforms and OSs > - [x] verified that the new phase prints when it should in IGV and with `-XX:PrintPhaseLevel=4` Thank you for your reviews! ------------- PR Comment: https://git.openjdk.org/jdk/pull/25492#issuecomment-2930572102 From duke at openjdk.org Mon Jun 2 12:58:53 2025 From: duke at openjdk.org (duke) Date: Mon, 2 Jun 2025 12:58:53 GMT Subject: RFR: 8354930: IGV: dump C2 graph before and after live range stretching In-Reply-To: References: Message-ID: On Wed, 28 May 2025 11:54:24 GMT, Manuel H?ssig wrote: > This PR introduces a new phase `LIVE_RANGE_STRETCHING` that prints after live ranges have been stretched, if that happens at all. The phase `INITIAL_LIVENESS` is moved before live range stretching so we can compare the live ranges before and after stretching in IGV, which is useful for debugging why an oop suddenly belongs to an oop map. > > ## Testing > > - [x] [Github Actions](https://github.com/mhaessig/jdk/actions/runs/15299362485) > - [x] tier1 and tier1, plus additional Oracle internal testing for all Oracle supported platforms and OSs > - [x] verified that the new phase prints when it should in IGV and with `-XX:PrintPhaseLevel=4` @mhaessig Your change (at version df3c396f5a26658f6efbaf4f7a153f7214be5e57) is now ready to be sponsored by a Committer. ------------- PR Comment: https://git.openjdk.org/jdk/pull/25492#issuecomment-2930573797 From chagedorn at openjdk.org Mon Jun 2 13:58:22 2025 From: chagedorn at openjdk.org (Christian Hagedorn) Date: Mon, 2 Jun 2025 13:58:22 GMT Subject: RFR: 8344942: Template-Based Testing Framework [v71] In-Reply-To: References:

Message-ID: On Mon, 2 Jun 2025 03:30:24 GMT, Emanuel Peter wrote: >> **Goal** >> We want to generate Java source code: >> - Make it easy to generate variants of tests. E.g. for each offset, for each operator, for each type, etc. >> - Enable the generation of domain specific fuzzers (e.g. random expressions and statements). >> >> Note: with the Template Library draft I was already able to find a [list of bugs](https://bugs.openjdk.org/issues/?jql=labels%20%3D%20template-framework%20ORDER%20BY%20created%20DESC%2C%20summary%20DESC). >> >> **How to get started** >> When reviewing, please start by looking at: >> https://github.com/openjdk/jdk/blob/d21a8aabaf3b191e851b6997c11bb30fcd0f942f/test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestSimple.java#L60-L76 >> >> We have a Template with two arguments. They are typed (Integer and String). We then apply the arguments `template.withArgs(42, "7")`, producing a `TemplateWithArgs`. This can then be `render`ed to a String. And then that can be compiled and executed with the CompileFramework. >> >> Second, look at this advanced test: >> https://github.com/openjdk/jdk/blob/77079807042fc5a3af04e0ccccad4ecd89e21cdb/test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestAdvanced.java#L102-L119 >> >> And then for a "tutorial", look at: >> `test/hotspot/jtreg/testlibrary_tests/template_framework/examples/TestTutorial.java` >> >> It shows these features: >> - The `body` of a Template is essentially a list of `Token`s that are concatenated. >> - Templates can be nested: a `TemplateWithArgs` is also a `Token`. >> - We can use `#name` replacements to directly format values into the String. If we had proper String Templates in Java, we would not need this feature. >> - We can use `$var` to make variable names unique: if we applied the same template twice, we would get variable collisions. `$var` is then replaced with e.g. `var_7` in one template use and `var_42` in the other template use. >> - The use of `Hook`s to insert code into outer (earlier) code locations. This is useful, for example, to insert fields on demand. >> - The use of recursive templates, and `fuel` to limit the recursion. >> - `Name`s: useful to register field and variable names in code scopes. >> >> Next, look at the documentation in. This file is the heart of the Template Framework, and describes all the important features. >> https://github.com/openjdk/jdk/blob/d21a8aabaf3b191e851b6997c11bb30fcd0f942f/test/hotspot/jtreg/compiler/lib/template_framework/Template.java#L31-L76 >> >> For a better experience, you may want... > > Emanuel Peter has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 91 commits: > > - Merge branch 'master' into JDK-8344942-TemplateFramework-v3 > - validation tests > - dollar and hashtag parsing validatiaon > - wip refactor parsing dollar and hashtag > - more fixes from Christian > - more improvements > - more suggestions applied > - good practice > - rename template arguments > - more from Christian > - ... and 81 more: https://git.openjdk.org/jdk/compare/90d6ad01...cb7037e7 I worked my way through the rest of the implementation. Impressive work Emanuel! I left some more mostly minor comments. But otherwise, this looks great! test/hotspot/jtreg/compiler/lib/template_framework/Code.java line 26: > 24: package compiler.lib.template_framework; > 25: > 26: import java.util.ArrayList; Unused: Suggestion: test/hotspot/jtreg/compiler/lib/template_framework/Code.java line 33: > 31: * All the {@link String}s are later collected in a {@link StringBuilder}. If we used a {@link StringBuilder} > 32: * directly to collect the {@link String}s, we could not as easily insert code at an "earlier" position, i.e. > 33: * reaching out to a {@link Hook#set}. Suggestion: * reaching out to a {@link Hook#anchor}. test/hotspot/jtreg/compiler/lib/template_framework/CodeFrame.java line 37: > 35: * When a {@link Hook} is {@link Hook#set}, this separates the Template into an outer and inner > 36: * {@link CodeFrame}, ensuring that names that are {@link Template#addName}'d inside the inner frame > 37: * are only available inside that frame. Still references old method names. Suggestion: Suggestion: * The {@link CodeFrame} represents a frame (i.e. scope) of code, appending {@link Code} to the {@code 'codeList'} * as {@link Token}s are rendered, and adding names to the {@link NameSet}s with {@link Template#addStructuralName}/ * {@link Template#addDataName}. {@link Hook}s can be added to a frame, which allows code to be inserted at that * location later. When a {@link Hook} is {@link Hook#anchor}ed, it separates the Template into an outer and inner * {@link CodeFrame}, ensuring that names that are added inside the inner frame are only available inside that frame. test/hotspot/jtreg/compiler/lib/template_framework/CodeFrame.java line 52: > 50: class CodeFrame { > 51: public final CodeFrame parent; > 52: private final List

 codeList = new ArrayList();

Suggestion:

    private final List codeList = new ArrayList<>();

test/hotspot/jtreg/compiler/lib/template_framework/CodeFrame.java line 58:

> 56:      * The {@link NameSet} is used for variable and fields etc.
> 57:      */
> 58:     final NameSet names;

I think this can also be made private:

Suggestion:

    private final NameSet names;

test/hotspot/jtreg/compiler/lib/template_framework/CodeFrame.java line 70:

> 68:         } else {
> 69:             // New NameSet, to make sure we have a nested scope for the names.
> 70:             this.names     = new NameSet(parent.names);

Indentation is off:
Suggestion:

            this.names = parent.names;
        } else {
            // New NameSet, to make sure we have a nested scope for the names.
            this.names = new NameSet(parent.names);

test/hotspot/jtreg/compiler/lib/template_framework/CodeFrame.java line 92:

> 90:     /**
> 91:      * Creates a special frame, which has a {@link #parent} but uses the {@link NameSet}
> 92:      * from the parent frame, allowing {@link Template#defineName} to persist in the outer

`defineName` -> `addName`?

test/hotspot/jtreg/compiler/lib/template_framework/CodeFrame.java line 96:

> 94:      * where we would possibly want to make field or variable definitions during the insertion
> 95:      * that are not just local to the insertion but affect the {@link CodeFrame} that we
> 96:      * {@link Hook#set} earlier and are now {@link Hook#insert}ing into.

Suggestion:

     * {@link Hook#anchor} earlier and are now {@link Hook#insert}ing into.

test/hotspot/jtreg/compiler/lib/template_framework/CodeFrame.java line 118:

> 116:     }
> 117: 
> 118:     boolean hasHook(Hook hook) {

Can be made private:
Suggestion:

    private boolean hasHook(Hook hook) {

test/hotspot/jtreg/compiler/lib/template_framework/DataName.java line 33:

> 31:  * count, list or even sample random {@link DataName}s. Every {@link DataName} has a {@link DataName.Type},
> 32:  * so that sampling can be restricted to these types.
> 33:  *

Suggestion:

 *
 * 

test/hotspot/jtreg/compiler/lib/template_framework/DataName.java line 123:

> 121:                 if (mutability == Mutability.IMMUTABLE && dn.mutable()) { return false; }
> 122:                 if (subtype != null && !dn.type().isSubtypeOf(subtype)) { return false; }
> 123:                 if (supertype != null && !supertype.isSubtypeOf(dn.type())) { return false; }

I suggest to use the full term:
Suggestion:

                if (!(name instanceof DataName dataName)) { return false; }
                if (mutability == Mutability.MUTABLE && !dataName.mutable()) { return false; }
                if (mutability == Mutability.IMMUTABLE && dataName.mutable()) { return false; }
                if (subtype != null && !dataName.type().isSubtypeOf(subtype)) { return false; }
                if (supertype != null && !supertype.isSubtypeOf(dataName.type())) { return false; }

test/hotspot/jtreg/compiler/lib/template_framework/DataName.java line 134:

> 132:          * @return The filtered {@link View}.
> 133:          * @throws UnsupportedOperationException If this {@link View} was already filtered with
> 134:          *                                       {@link subtypeOf} or {@link exactOf}.

Also for links at methods below:
Suggestion:

         *                                       {@link #subtypeOf} or {@link #exactOf}.

test/hotspot/jtreg/compiler/lib/template_framework/DataName.java line 144:

> 142: 
> 143:         /**
> 144:          * Create a filtered {@link View}, where all {@link DataName}s must be subtypes of {@code type}.

Suggestion:

         * Create a filtered {@link View}, where all {@link DataName}s must be supertypes of {@code type}.

test/hotspot/jtreg/compiler/lib/template_framework/DataName.java line 181:

> 179:          */
> 180:         public DataName sample() {
> 181:             DataName n = (DataName)Renderer.getCurrent().sampleName(predicate());

Do you really need this cast? Can't you just return a `Name`. From the uses it seems that you only call interface methods from `Name` at the use-sites.

test/hotspot/jtreg/compiler/lib/template_framework/Hook.java line 34:

> 32:  * "back" or to some outer scope, e.g. while generating code for a method, one can reach out
> 33:  * to the class scope to insert fields.
> 34:  *

Suggestion:

 *
 * 


test/hotspot/jtreg/compiler/lib/template_framework/Name.java line 35:

> 33:      * The name of the name, that can be used in code.
> 34:      *
> 35:      * @return The {@String} name of the name, that can be used in code.

Suggestion:

     * @return The {@link String} name of the name, that can be used in code.

test/hotspot/jtreg/compiler/lib/template_framework/Name.java line 54:

> 52:     int weight();
> 53: 
> 54:     public interface Type {

Implicitly public:
Suggestion:

    interface Type {

test/hotspot/jtreg/compiler/lib/template_framework/NameSet.java line 38:

> 36:  */
> 37: class NameSet {
> 38:     static final Random RANDOM = Utils.getRandomInstance();

Suggestion:

    private static final Random RANDOM = Utils.getRandomInstance();

test/hotspot/jtreg/compiler/lib/template_framework/NameSet.java line 58:

> 56: 
> 57:     private long weight(Predicate predicate) {
> 58:         long w = names.stream().filter(n -> predicate.check(n)).mapToInt(Name::weight).sum();

Suggestion:

        long w = names.stream().filter(predicate::check).mapToInt(Name::weight).sum();

test/hotspot/jtreg/compiler/lib/template_framework/NameSet.java line 64:

> 62: 
> 63:     public int count(Predicate predicate) {
> 64:         int c = (int)names.stream().filter(n -> predicate.check(n)).count();

Suggestion:

        int c = (int)names.stream().filter(predicate::check).count();

test/hotspot/jtreg/compiler/lib/template_framework/NameSet.java line 70:

> 68: 
> 69:     public boolean hasAny(Predicate predicate) {
> 70:         return names.stream().anyMatch(n -> predicate.check(n)) ||

Suggestion:

        return names.stream().anyMatch(predicate::check) ||

test/hotspot/jtreg/compiler/lib/template_framework/NameSet.java line 77:

> 75:         List list = (parent != null) ? parent.toList(predicate)
> 76:                                            : new ArrayList<>();
> 77:         list.addAll(names.stream().filter(n -> predicate.check(n)).toList());

Suggestion:

        list.addAll(names.stream().filter(predicate::check).toList());

test/hotspot/jtreg/compiler/lib/template_framework/NameSet.java line 88:

> 86:         if (w <= 0) {
> 87:             return null;
> 88:         }

Shouldn't the weight always be positive?

test/hotspot/jtreg/compiler/lib/template_framework/Renderer.java line 66:

> 64:             // another non-capturing group.
> 65:             "(?:\\{" +
> 66:                 // capturing group for "name" inside of "{name}"

Suggestion:

                // capturing group for "name" inside "{name}"

test/hotspot/jtreg/compiler/lib/template_framework/Renderer.java line 199:

> 197:     /**
> 198:      * Formats values to {@link String} with the goal of using them in Java code.
> 199:      * By default we use the overrides of {@link Object#toString}.

Suggestion:

     * By default, we use the overrides of {@link Object#toString}.

test/hotspot/jtreg/compiler/lib/template_framework/Renderer.java line 266:

> 264:             case StringToken(String s) -> {
> 265:                 renderStringWithDollarAndHashtagReplacements(s);
> 266:             }

Suggestion:

            case StringToken(String s) -> renderStringWithDollarAndHashtagReplacements(s);

test/hotspot/jtreg/compiler/lib/template_framework/Renderer.java line 321:

> 319:                 callerCodeFrame.addCode(currentCodeFrame.getCode());
> 320:                 currentCodeFrame = callerCodeFrame;
> 321:             }

For readability:
Suggestion:

            case HookInsertToken(Hook hook, TemplateToken templateToken) -> {
                // Switch to hook CodeFrame.
                CodeFrame callerCodeFrame = currentCodeFrame;
                CodeFrame hookCodeFrame = codeFrameForHook(hook);

                // Use a transparent nested CodeFrame. We need a CodeFrame so that the code generated
                // by the TemplateToken can be collected, and hook insertions from it can still
                // be made to the hookCodeFrame before the code from the TemplateToken is added to
                // the hookCodeFrame.
                // But the CodeFrame must be transparent, so that its name definitions go out to
                // the hookCodeFrame, and are not limited to the CodeFrame for the TemplateToken.
                currentCodeFrame = CodeFrame.makeTransparentForNames(hookCodeFrame);

                renderTemplateToken(templateToken);

                hookCodeFrame.addCode(currentCodeFrame.getCode());

                // Switch back from hook CodeFrame to caller CodeFrame.
                currentCodeFrame = callerCodeFrame;
            }
            case TemplateToken templateToken -> {
                // Use a nested CodeFrame.
                CodeFrame callerCodeFrame = currentCodeFrame;
                currentCodeFrame = CodeFrame.make(currentCodeFrame);

                renderTemplateToken(templateToken);

                callerCodeFrame.addCode(currentCodeFrame.getCode());
                currentCodeFrame = callerCodeFrame;
            }

test/hotspot/jtreg/compiler/lib/template_framework/Renderer.java line 324:

> 322:             case AddNameToken(Name name) -> {
> 323:                 currentCodeFrame.addName(name);
> 324:             }

Suggestion:

            case AddNameToken(Name name) -> currentCodeFrame.addName(name);

test/hotspot/jtreg/compiler/lib/template_framework/Renderer.java line 338:

> 336:     }
> 337: 
> 338:     private void renderStringWithDollarAndHashtagReplacements(String s) {

Hard to grasp the logic of that method. But I trust you on that :-) I leave it up to you if you want to improve readability to extract some of the logic to separate methods such that this method becomes easier to understand.

test/hotspot/jtreg/compiler/lib/template_framework/StructuralName.java line 33:

> 31:  * count, list or even sample random {@link StructuralName}s. Every {@link StructuralName} has a {@link StructuralName.Type},
> 32:  * so that sampling can be restricted to these types.
> 33:  *

Suggestion:

 *
 * 


test/hotspot/jtreg/compiler/lib/template_framework/StructuralName.java line 47:

> 45:      */
> 46:     public StructuralName {
> 47:     }

Is this required? Is it not automatically added? Same for `DataName`.

test/hotspot/jtreg/compiler/lib/template_framework/StructuralName.java line 68:

> 66:          */
> 67:         boolean isSubtypeOf(StructuralName.Type other);
> 68:     }

This is identical to `DataName.Type`. What is the benefit of having separate interfaces `DataName.Type` and `StructuralName.Type`? Couldn't we just move `isSubtypeOf()` directly to the `Name.Type` interface and use that one below and for the fields and expose that one instead to the user? This would mean that you can update all `DataName/StructuralName.Type` to `Name.Type`. I have not checked if this is fully possible but it just occurred to me when reviewing this duplicated interface now.

test/hotspot/jtreg/compiler/lib/template_framework/StructuralName.java line 96:

> 94:                 if (!(name instanceof StructuralName dn)) { return false; }
> 95:                 if (subtype != null && !dn.type().isSubtypeOf(subtype)) { return false; }
> 96:                 if (supertype != null && !supertype.isSubtypeOf(dn.type())) { return false; }

Suggestion:

                if (!(name instanceof StructuralName structuralName)) { return false; }
                if (subtype != null && !structuralName.type().isSubtypeOf(subtype)) { return false; }
                if (supertype != null && !supertype.isSubtypeOf(structuralName.type())) { return false; }

test/hotspot/jtreg/compiler/lib/template_framework/StructuralName.java line 107:

> 105:          * @return The filtered {@link View}.
> 106:          * @throws UnsupportedOperationException If this {@link View} was already filtered with
> 107:          *                                       {@link subtypeOf} or {@link exactOf}.

Same here and in methods below:
Suggestion:

         *                                       {@link #subtypeOf} or {@link #exactOf}.

test/hotspot/jtreg/compiler/lib/template_framework/StructuralName.java line 117:

> 115: 
> 116:         /**
> 117:          * Create a filtered {@link View}, where all {@link StructuralName}s must be subtypes of {@code type}.

Suggestion:

         * Create a filtered {@link View}, where all {@link StructuralName}s must be supertypes of {@code type}.

test/hotspot/jtreg/compiler/lib/template_framework/TemplateBinding.java line 43:

> 41:      * Creates a new {@link TemplateBinding} that has no Template bound to it yet.
> 42:      */
> 43:     public TemplateBinding() {}

Can also be removed since it's the default constructor that is automatically added for you.
Suggestion:

test/hotspot/jtreg/compiler/lib/template_framework/Token.java line 31:

> 29: 
> 30: /**
> 31:  * The {@link Template#body} and {@link Hook#set} are given a list of tokens, which are either

Suggestion:

 * The {@link Template#body} and {@link Hook#anchor} are given a list of tokens, which are either

test/hotspot/jtreg/compiler/lib/template_framework/Token.java line 74:

> 72:             case Float s   -> outputList.add(new StringToken(Renderer.format(s)));
> 73:             case Boolean s -> outputList.add(new StringToken(Renderer.format(s)));
> 74:             case List l    -> parseList(l, outputList);

Not sure if we should use a raw `List` here. Would `List` work as well? Would then need to update `parseList(List