From dholmes at openjdk.org Tue Apr 1 06:03:47 2025 From: dholmes at openjdk.org (David Holmes) Date: Tue, 1 Apr 2025 06:03:47 GMT Subject: RFR: 8323100: com/sun/tools/attach/StartManagementAgent.java failed with "WaitForSingleObject failed" Message-ID: See bug report for gory details. Short version: in the Windows version of `VirtualMachineImpl::execute`, if an exception occurred after we created the `SocketInputStreamImpl` (which is the test scenario of the failing test), we would close the native `HANDLE` to the pipe twice. But after the first close the `HANDLE` could be reassigned to another object (e.g. the `_ParkHandle` of the `StreamPumper` thread) and the second close would close that `HANDLE` resulting in the failure of `WaitForSingleObject`. (Other failure modes with different invalid handles have also been seen.) Fix: shorten the outer try/catch block so that we only directly close the pipe if the `IOException` happens before we create the `SocketStreamImpl` - after which the closing of the stream will close the pipe `HANDLE`. Testing: - ran the com/sun/tools/attach tets group 2500 times without failure - tiers 3-5 as a sanity check (Windows only) Thanks ------------- Commit messages: - 8323100: com/sun/tools/attach/StartManagementAgent.java failed with "WaitForSingleObject failed" Changes: https://git.openjdk.org/jdk/pull/24346/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=24346&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8323100 Stats: 26 lines in 1 file changed: 14 ins; 11 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/24346.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24346/head:pull/24346 PR: https://git.openjdk.org/jdk/pull/24346 From kevinw at openjdk.org Tue Apr 1 09:11:21 2025 From: kevinw at openjdk.org (Kevin Walls) Date: Tue, 1 Apr 2025 09:11:21 GMT Subject: RFR: 8323100: com/sun/tools/attach/StartManagementAgent.java failed with "WaitForSingleObject failed" In-Reply-To: References: Message-ID: On Tue, 1 Apr 2025 05:57:55 GMT, David Holmes wrote: > See bug report for gory details. > > Short version: in the Windows version of `VirtualMachineImpl::execute`, if an exception occurred after we created the `SocketInputStreamImpl` (which is the test scenario of the failing test), we would close the native `HANDLE` to the pipe twice. But after the first close the `HANDLE` could be reassigned to another object (e.g. the `_ParkHandle` of the `StreamPumper` thread) and the second close would close that `HANDLE` resulting in the failure of `WaitForSingleObject`. (Other failure modes with different invalid handles have also been seen.) > > Fix: shorten the outer try/catch block so that we only directly close the pipe if the `IOException` happens before we create the `SocketStreamImpl` - after which the closing of the stream will close the pipe `HANDLE`. > > Testing: > - ran the com/sun/tools/attach tets group 2500 times without failure > - tiers 3-5 as a sanity check (Windows only) > > Thanks Excellent to have this found, they just looked like an "impossible" failures. 8-) Also good lessons in naming and commenting around processCompletionStatus. ------------- Marked as reviewed by kevinw (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/24346#pullrequestreview-2732025593 From varadam at openjdk.org Tue Apr 1 10:30:56 2025 From: varadam at openjdk.org (Varada M) Date: Tue, 1 Apr 2025 10:30:56 GMT Subject: RFR: 8352392: AIX: implement attach API v2 and streaming output [v4] In-Reply-To: References: Message-ID: > AIX changes for attach API to support arbitrary length arguments and the streaming output support. > serviceability/attach/AttachAPIv2/StreamingOutputTest.java test passes > > tier1, tier2 and tier3 testing is successful with fastdebug level > > JBS Issue : [JDK-8352392](https://bugs.openjdk.org/browse/JDK-8352392) Varada M has updated the pull request incrementally with one additional commit since the last revision: 8352392: AIX: implement attach API v2 and streaming output ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24177/files - new: https://git.openjdk.org/jdk/pull/24177/files/d22f0ab9..234f6d17 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24177&range=03 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24177&range=02-03 Stats: 20 lines in 2 files changed: 1 ins; 0 del; 19 mod Patch: https://git.openjdk.org/jdk/pull/24177.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24177/head:pull/24177 PR: https://git.openjdk.org/jdk/pull/24177 From varadam at openjdk.org Tue Apr 1 10:35:20 2025 From: varadam at openjdk.org (Varada M) Date: Tue, 1 Apr 2025 10:35:20 GMT Subject: RFR: 8352392: AIX: implement attach API v2 and streaming output [v3] In-Reply-To: References: <4PJ92vw13If5UwOGWdLxkM0uaUiOa4GqogzCB37xSLk=.2b0950bb-937f-408d-9c69-31dd06e391d3@github.com> Message-ID: On Mon, 31 Mar 2025 10:16:31 GMT, Martin Doerr wrote: >> Varada M has updated the pull request incrementally with two additional commits since the last revision: >> >> - updated copyright header >> - removed StreamingOutputTest.java from problem list > > src/jdk.attach/aix/classes/sun/tools/attach/VirtualMachineImpl.java line 196: > >> 194: } >> 195: } >> 196: > > Same here. Hi Martin, I have added the indentation fix and comment ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24177#discussion_r2022594632 From kevinw at openjdk.org Tue Apr 1 10:39:28 2025 From: kevinw at openjdk.org (Kevin Walls) Date: Tue, 1 Apr 2025 10:39:28 GMT Subject: RFR: 8353231: Test com/sun/management/OperatingSystemMXBean/GetProcessCpuLoad still fails intermittently Message-ID: Test com/sun/management/OperatingSystemMXBean/GetProcessCpuLoad still fails intermittently. On failure, 10 attempts with sleep(200) each time, only read -1 from mbean.getProcessCpuLoad(). The method is documented to return -1 when info is not available, but want to avoid the test accepting a -1 and masking real problems. Test failures are happening when multiple CPU load reding tests ran on the same host, at the same second. Add a TEST.properties file containing: exclusiveAccess.dirs=. ------------- Commit messages: - 8353231: Test com/sun/management/OperatingSystemMXBean cpuLoad still fails intermittently Changes: https://git.openjdk.org/jdk/pull/24352/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=24352&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8353231 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/24352.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24352/head:pull/24352 PR: https://git.openjdk.org/jdk/pull/24352 From ihse at openjdk.org Tue Apr 1 12:02:19 2025 From: ihse at openjdk.org (Magnus Ihse Bursie) Date: Tue, 1 Apr 2025 12:02:19 GMT Subject: RFR: 8349638: Build libjdwp with SIZE optimization In-Reply-To: References: Message-ID: On Tue, 11 Feb 2025 15:56:39 GMT, Matthias Baesken wrote: > The libjdwp is currently built with LOW optimization level, it could be built with SIZE optimization to lower the lib size by ~ 10 % on UNIX. > On Windows LOW and SIZE currently translate to the same O1 optimization flag so no difference there. > > On Linux x86_64 for example the lib shrinks from > 300K to 268K and the debuginfo file shrinks from 1.9M to 1.7M . > > On Linux ppc64le for example the lib shrinks from > 428K to 368K and the debuginfo file shrinks from 2.0M to 1.7M . It would be interesting to also see how compilation times varies with optimization level. At least some kind of hint if HIGHEST is like 2x slower than LOW, or if SIZE is slower than LOW at all, etc. The relative speed difference is interesting, but so is it in absolute terms. If a library takes 0.5 seconds on LOW but 1.1 seconds on HIGH on a particular system, it is unlikely to matter much to overall build time anywhere. But if it goes from 15s to 30s on a fast machine, it might be a problem if such performance regressions stack up, especially on slower machines (which includes the ones running GHA). ------------- PR Comment: https://git.openjdk.org/jdk/pull/23563#issuecomment-2769121979 From dholmes at openjdk.org Tue Apr 1 12:03:23 2025 From: dholmes at openjdk.org (David Holmes) Date: Tue, 1 Apr 2025 12:03:23 GMT Subject: RFR: 8323100: com/sun/tools/attach/StartManagementAgent.java failed with "WaitForSingleObject failed" In-Reply-To: References:

Message-ID: On Tue, 1 Apr 2025 09:08:06 GMT, Kevin Walls wrote: >> See bug report for gory details. >> >> Short version: in the Windows version of `VirtualMachineImpl::execute`, if an exception occurred after we created the `SocketInputStreamImpl` (which is the test scenario of the failing test), we would close the native `HANDLE` to the pipe twice. But after the first close the `HANDLE` could be reassigned to another object (e.g. the `_ParkHandle` of the `StreamPumper` thread) and the second close would close that `HANDLE` resulting in the failure of `WaitForSingleObject`. (Other failure modes with different invalid handles have also been seen.) >> >> Fix: shorten the outer try/catch block so that we only directly close the pipe if the `IOException` happens before we create the `SocketStreamImpl` - after which the closing of the stream will close the pipe `HANDLE`. >> >> Testing: >> - ran the com/sun/tools/attach tets group 2500 times without failure >> - tiers 3-5 as a sanity check (Windows only) >> >> Thanks > > Excellent to have this found, they just looked like an "impossible" failures. 8-) > > Also good lessons in naming and commenting around processCompletionStatus. Thanks for the review @kevinjwalls ! ------------- PR Comment: https://git.openjdk.org/jdk/pull/24346#issuecomment-2769126946 From mdoerr at openjdk.org Tue Apr 1 12:22:20 2025 From: mdoerr at openjdk.org (Martin Doerr) Date: Tue, 1 Apr 2025 12:22:20 GMT Subject: RFR: 8352392: AIX: implement attach API v2 and streaming output [v3] In-Reply-To: References: <4PJ92vw13If5UwOGWdLxkM0uaUiOa4GqogzCB37xSLk=.2b0950bb-937f-408d-9c69-31dd06e391d3@github.com>

Message-ID: On Tue, 1 Apr 2025 10:32:55 GMT, Varada M wrote: >> src/jdk.attach/aix/classes/sun/tools/attach/VirtualMachineImpl.java line 196: >> >>> 194: } >>> 195: } >>> 196: >> >> Same here. > > Hi Martin, I have added the indentation fix and comment Thanks! I think it's good. Let's wait for @JoKern65 's review. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24177#discussion_r2022743745 From sgehwolf at openjdk.org Tue Apr 1 14:59:37 2025 From: sgehwolf at openjdk.org (Severin Gehwolf) Date: Tue, 1 Apr 2025 14:59:37 GMT Subject: RFR: 8336881: [Linux] Support for hierarchical limits for Metrics [v17] In-Reply-To: References: Message-ID: > Please review this fix for cgroups-based metrics reporting in the `jdk.internal.platform` package. This fix is supposed to address wrong reporting of certain limits if the limits aren't set at the leaf nodes. > > For example, on cg v2, the memory limit interface file is `memory.max`. Consider a cgroup path of `/a/b/c/d`. The current code only reports the limits (via Metrics) correctly if it's set at `/a/b/c/d/memory.max`. However, some systems - like a systemd slice - sets those limits further up the hierarchy. For example at `/a/b/c/memory.max`. `/a/b/c/d/memory.max` might be set to the value `max` (for unlimited), yet `/a/b/c/memory.max` would report the actual limit value (e.g. `1048576000`). > > This patch addresses this issue by: > > 1. Refactoring the interface lookup code to relevant controllers for cpu/memory. The CgroupSubsystem classes then delegate to those for the lookup. This facilitates having an API for the lookup of an updated limit in step 2. > 2. Walking the full hierarchy of the cgroup path (if any), looking for a lower limit than at the leaf. Note that it's not possible to raise the limit set at a path closer to the root via the interface file at a further-to-the-leaf-level. The odd case out seems to be `max` values on some systems (which seems to be the default value). > > As an optimization this hierarchy walk is skipped on containerized systems (like K8S), where the limits are set in interface files at the leaf nodes of the hierarchy. Therefore there should be no change on those systems. > > This patch depends on the Hotspot change implementing the same for the JVM so that `Metrics.isContainerized()` works correctly on affected systems where `-XshowSettings:system` currently reports `System not containerized` due to the missing JVM fix. A test framework for such hierarchical systems has been added in [JDK-8333446](https://bugs.openjdk.org/browse/JDK-8333446). This patch adds a test using that framework among some simpler unit tests. > > Thoughts? > > Testing: > > - [x] GHA > - [x] Container tests on Linux x86_64 on cg v1 and cg v2 systems > - [x] Some manual testing using systemd slices Severin Gehwolf has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 44 commits: - Merge branch 'master' into jdk-8336881-metrics-systemd-slice - JDK-8350103 - Merge branch 'master' into jdk-8336881-metrics-systemd-slice - Merge branch 'master' into jdk-8336881-metrics-systemd-slice - Fix missing imports - Merge branch 'master' into jdk-8336881-metrics-systemd-slice - Merge branch 'master' into jdk-8336881-metrics-systemd-slice - Merge branch 'master' into jdk-8336881-metrics-systemd-slice - Merge branch 'master' into jdk-8336881-metrics-systemd-slice - Merge branch 'master' into jdk-8336881-metrics-systemd-slice - ... and 34 more: https://git.openjdk.org/jdk/compare/6801eb87...32960cd6 ------------- Changes: https://git.openjdk.org/jdk/pull/20280/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=20280&range=16 Stats: 1621 lines in 27 files changed: 1373 ins; 152 del; 96 mod Patch: https://git.openjdk.org/jdk/pull/20280.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/20280/head:pull/20280 PR: https://git.openjdk.org/jdk/pull/20280 From stefank at openjdk.org Tue Apr 1 15:32:17 2025 From: stefank at openjdk.org (Stefan Karlsson) Date: Tue, 1 Apr 2025 15:32:17 GMT Subject: RFR: 8341491: Reserve and commit memory operations should be protected by NMT lock In-Reply-To: References:

Message-ID: On Mon, 31 Mar 2025 14:09:22 GMT, Robert Toyonaga wrote: > OK should I update this PR to do the following things: > > * Add comments explaining the asymmetrical locking and warning against patterns that lead to races Sounds like a good idea. > > * swapping the order of `NmtVirtualMemoryLocker` and release/uncommit I wonder if this should be done as new RFE after the change below. It might need a bit of investigation to make sure that the reasoning around this is correct. > > * Fail fatally if release/uncommit does not complete. I think this would be a good, separate RFE to be done before we try to swap the order. > > > Or does it make more sense to do that in a different issue/PR? > > Also, do we want to keep the new tests and the refactorings (see below)? > > ``` > if (MemTracker::enabled()) { > MemTracker::NmtVirtualMemoryLocker nvml; > result = pd_some_operation(addr, bytes); > if (result != nullptr) { > MemTracker::record_some_operation(addr, bytes); > } > } else { > result = pd_unmap_memory(addr, bytes); > } > ``` > > To: > > ``` > MemTracker::NmtVirtualMemoryLocker nvml; > result = pd_unmap_memory(addr, bytes); > MemTracker::record_some_operation(addr, bytes); > ``` My thinking is that after you done (2) above, then you will not need to expose the NMT lock to this level. The code would be: MemTracker::record_some_operation(addr, bytes); // Lock confined inside this pd_unmap_memory(addr, bytes); So, I would wait with this cleanup until we know more about (2). ------------- PR Comment: https://git.openjdk.org/jdk/pull/24084#issuecomment-2769766908 From egor.ushakov at jetbrains.com Tue Apr 1 17:14:43 2025 From: egor.ushakov at jetbrains.com (Egor Ushakov) Date: Tue, 1 Apr 2025 19:14:43 +0200 Subject: Debugger overhead for virtual threads creation Message-ID: Hi everyone! Is it expected that with the debugger attached creating virtual threads is much slower? We're getting bugs like: https://youtrack.jetbrains.com/issue/IDEA-365900 And I can reproduce it easily with jdb... Just attaching the debugger immediately slows down virtual threads creation significantly. >java -agentlib:jdwp=transport=dt_shmem,server=y,suspend=n,address=8000 app ... 6808805 (1.2046688E7 threads per second) ... after >jdb -attach 8000 ... 30215 (95986.055 threads per second) ... Thanks, Egor From amenkov at openjdk.org Tue Apr 1 18:38:33 2025 From: amenkov at openjdk.org (Alex Menkov) Date: Tue, 1 Apr 2025 18:38:33 GMT Subject: RFR: 8323100: com/sun/tools/attach/StartManagementAgent.java failed with "WaitForSingleObject failed" In-Reply-To: References: Message-ID: On Tue, 1 Apr 2025 05:57:55 GMT, David Holmes wrote: > See bug report for gory details. > > Short version: in the Windows version of `VirtualMachineImpl::execute`, if an exception occurred after we created the `SocketInputStreamImpl` (which is the test scenario of the failing test), we would close the native `HANDLE` to the pipe twice. But after the first close the `HANDLE` could be reassigned to another object (e.g. the `_ParkHandle` of the `StreamPumper` thread) and the second close would close that `HANDLE` resulting in the failure of `WaitForSingleObject`. (Other failure modes with different invalid handles have also been seen.) > > Fix: shorten the outer try/catch block so that we only directly close the pipe if the `IOException` happens before we create the `SocketStreamImpl` - after which the closing of the stream will close the pipe `HANDLE`. > > Testing: > - ran the com/sun/tools/attach tets group 2500 times without failure > - tiers 3-5 as a sanity check (Windows only) > > Thanks Marked as reviewed by amenkov (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/24346#pullrequestreview-2733724000 From cjplummer at openjdk.org Tue Apr 1 20:00:22 2025 From: cjplummer at openjdk.org (Chris Plummer) Date: Tue, 1 Apr 2025 20:00:22 GMT Subject: RFR: 8352088: Call of com.sun.jdi.ThreadReference.threadGroups() can lock up target VM [v7] In-Reply-To: References: Message-ID: > Calling ThreadGroupReference.groups() from an event handler can cause a deadlock. Details in first comment. Tested with :jdk_lang on all supported platforms and tier1, tier2, tier3, and tier5 svc testing. Chris Plummer has updated the pull request incrementally with one additional commit since the last revision: minor comment update ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24236/files - new: https://git.openjdk.org/jdk/pull/24236/files/977ecf15..80f75c60 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24236&range=06 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24236&range=05-06 Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/24236.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24236/head:pull/24236 PR: https://git.openjdk.org/jdk/pull/24236 From sspitsyn at openjdk.org Tue Apr 1 20:36:25 2025 From: sspitsyn at openjdk.org (Serguei Spitsyn) Date: Tue, 1 Apr 2025 20:36:25 GMT Subject: RFR: 8352088: Call of com.sun.jdi.ThreadReference.threadGroups() can lock up target VM [v7] In-Reply-To: References:

Message-ID: On Tue, 1 Apr 2025 20:00:22 GMT, Chris Plummer wrote: >> Calling ThreadGroupReference.groups() from an event handler can cause a deadlock. Details in first comment. Tested with :jdk_lang on all supported platforms and tier1, tier2, tier3, and tier5 svc testing. > > Chris Plummer has updated the pull request incrementally with one additional commit since the last revision: > > minor comment update Marked as reviewed by sspitsyn (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/24236#pullrequestreview-2733985461 From amenkov at openjdk.org Tue Apr 1 21:32:35 2025 From: amenkov at openjdk.org (Alex Menkov) Date: Tue, 1 Apr 2025 21:32:35 GMT Subject: RFR: 8353479: jcmd with streaming output breaks intendation Message-ID: `outputStream` implementations should call `update_position` from `write` to correctly handle indentation. The fix adds the call to `attachStream::write` testing: sanity tier1; in progress: tier2..4,hs-tier5-svc ------------- Commit messages: - fix Changes: https://git.openjdk.org/jdk/pull/24368/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=24368&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8353479 Stats: 1 line in 1 file changed: 1 ins; 0 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/24368.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24368/head:pull/24368 PR: https://git.openjdk.org/jdk/pull/24368 From amenkov at openjdk.org Tue Apr 1 21:32:35 2025 From: amenkov at openjdk.org (Alex Menkov) Date: Tue, 1 Apr 2025 21:32:35 GMT Subject: RFR: 8353479: jcmd with streaming output breaks intendation In-Reply-To: References: Message-ID: On Tue, 1 Apr 2025 21:23:01 GMT, Alex Menkov wrote: > `outputStream` implementations should call `update_position` from `write` to correctly handle indentation. > The fix adds the call to `attachStream::write` > > testing: sanity tier1; > in progress: tier2..4,hs-tier5-svc Example of output with and without streaming output: $ JAVA_TOOL_OPTIONS=-Djdk.attach.allowStreamingOutput=true jcmd 31544 VM.native_memory Picked up JAVA_TOOL_OPTIONS: -Djdk.attach.allowStreamingOutput=true 31544: Native Memory Tracking: (Omitting categories weighting less than 1KB) Total: reserved=9953148KB, committed=407888KB malloc: 57776KB #293244, peak=75433KB #212390 mmap: reserved=9895372KB, committed=350112KB - Java Heap (reserved=8298496KB, committed=270336KB) (mmap: reserved=8298496KB, committed=270336KB, peak=520192KB) - Class (reserved=1048938KB, committed=2922KB) (classes #3716) ( instance classes #3406, array classes #310) (malloc=362KB #7060 ) (peak=364KB #7068) (mmap: reserved=1048576KB, committed=2560KB, at peak) ( Metadata: ) ( reserved=65536KB, committed=24384KB ) ( used=24223KB) ( waste=161KB =0.66%) ( Class space:) ( reserved=1048576KB, committed=2560KB ) ( used=2423KB) ( waste=137KB =5.36%) $ jcmd 31544 VM.native_memory 31544: Native Memory Tracking: (Omitting categories weighting less than 1KB) Total: reserved=9953152KB, committed=407892KB malloc: 57780KB #293282, peak=75433KB #212390 mmap: reserved=9895372KB, committed=350112KB - Java Heap (reserved=8298496KB, committed=270336KB) (mmap: reserved=8298496KB, committed=270336KB, peak=520192KB) - Class (reserved=1048938KB, committed=2922KB) (classes #3716) ( instance classes #3406, array classes #310) (malloc=362KB #7060) (peak=364KB #7068) (mmap: reserved=1048576KB, committed=2560KB, at peak) ( Metadata: ) ( reserved=65536KB, committed=24384KB) ( used=24223KB) ( waste=161KB =0.66%) ( Class space:) ( reserved=1048576KB, committed=2560KB) ( used=2423KB) ( waste=137KB =5.36%) ------------- PR Comment: https://git.openjdk.org/jdk/pull/24368#issuecomment-2770724520 From chris.plummer at oracle.com Tue Apr 1 23:39:07 2025 From: chris.plummer at oracle.com (Chris Plummer) Date: Tue, 1 Apr 2025 16:39:07 -0700 Subject: Debugger overhead for virtual threads creation In-Reply-To: References: Message-ID: The short answer is yes. The debug agent needs to deal with JVMTI_EVENT_VIRTUAL_THREAD_START/END events for every virtual thread. What makes it worse is when there are a large number of virtual threads that are currently alive. They are tracked on a list of ThreadNodes that starts to slow down debug agent performance when it gets too long. I have a work in progress that proactively purges these ThreadNodes so the list does not get too big.?I've been meaning to revive this project for quite some time. If you have a test case I'd be willing to experiment with these changes some more. I could not access to the IDEA-365900 link you provided. Note I think after the work is done to purge ThreadNodes proactively it might not be that hard of step to move to not needing JVMTI_EVENT_VIRTUAL_THREAD_START/END events enabled, which will help performance a lot more. Chris On 4/1/25 10:14 AM, Egor Ushakov wrote: > Hi everyone! > > Is it expected that with the debugger attached creating virtual > threads is much slower? > We're getting bugs like: https://youtrack.jetbrains.com/issue/IDEA-365900 > And I can reproduce it easily with jdb... > Just attaching the debugger immediately slows down virtual threads > creation significantly. > > >java > -agentlib:jdwp=transport=dt_shmem,server=y,suspend=n,address=8000 app > ... > 6808805 (1.2046688E7 threads per second) > ... > after >jdb -attach 8000 > ... > 30215 (95986.055 threads per second) > ... > > Thanks, > Egor From jpai at openjdk.org Wed Apr 2 01:40:31 2025 From: jpai at openjdk.org (Jaikiran Pai) Date: Wed, 2 Apr 2025 01:40:31 GMT Subject: RFR: 8352088: Call of com.sun.jdi.ThreadReference.threadGroups() can lock up target VM [v7] In-Reply-To: References:

Message-ID: <5kETU88L5YqVVQgMSB7QDo8heC9-R4Ia_JhXt2ZkfVQ=.2ce9a39e-ad21-433d-b724-de1ceb3336f5@github.com> On Tue, 1 Apr 2025 20:00:22 GMT, Chris Plummer wrote: >> Calling ThreadGroupReference.groups() from an event handler can cause a deadlock. Details in first comment. Tested with :jdk_lang on all supported platforms and tier1, tier2, tier3, and tier5 svc testing. > > Chris Plummer has updated the pull request incrementally with one additional commit since the last revision: > > minor comment update Marked as reviewed by jpai (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/24236#pullrequestreview-2734354863 From dholmes at openjdk.org Wed Apr 2 03:00:17 2025 From: dholmes at openjdk.org (David Holmes) Date: Wed, 2 Apr 2025 03:00:17 GMT Subject: RFR: 8323100: com/sun/tools/attach/StartManagementAgent.java failed with "WaitForSingleObject failed" In-Reply-To: References:

Message-ID: On Tue, 1 Apr 2025 18:35:36 GMT, Alex Menkov wrote: >> See bug report for gory details. >> >> Short version: in the Windows version of `VirtualMachineImpl::execute`, if an exception occurred after we created the `SocketInputStreamImpl` (which is the test scenario of the failing test), we would close the native `HANDLE` to the pipe twice. But after the first close the `HANDLE` could be reassigned to another object (e.g. the `_ParkHandle` of the `StreamPumper` thread) and the second close would close that `HANDLE` resulting in the failure of `WaitForSingleObject`. (Other failure modes with different invalid handles have also been seen.) >> >> Fix: shorten the outer try/catch block so that we only directly close the pipe if the `IOException` happens before we create the `SocketStreamImpl` - after which the closing of the stream will close the pipe `HANDLE`. >> >> Testing: >> - ran the com/sun/tools/attach tets group 2500 times without failure >> - tiers 3-5 as a sanity check (Windows only) >> >> Thanks > > Marked as reviewed by amenkov (Reviewer). Thanks for the review @alexmenkov ! ------------- PR Comment: https://git.openjdk.org/jdk/pull/24346#issuecomment-2771215407 From dholmes at openjdk.org Wed Apr 2 03:00:18 2025 From: dholmes at openjdk.org (David Holmes) Date: Wed, 2 Apr 2025 03:00:18 GMT Subject: Integrated: 8323100: com/sun/tools/attach/StartManagementAgent.java failed with "WaitForSingleObject failed" In-Reply-To: References: Message-ID: <6XUN6YuzWuA_WbJMP4CzRUyfFJr3aeozRTUcXJe1pFo=.451b7623-8873-4d35-a826-1c22665f39db@github.com> On Tue, 1 Apr 2025 05:57:55 GMT, David Holmes wrote: > See bug report for gory details. > > Short version: in the Windows version of `VirtualMachineImpl::execute`, if an exception occurred after we created the `SocketInputStreamImpl` (which is the test scenario of the failing test), we would close the native `HANDLE` to the pipe twice. But after the first close the `HANDLE` could be reassigned to another object (e.g. the `_ParkHandle` of the `StreamPumper` thread) and the second close would close that `HANDLE` resulting in the failure of `WaitForSingleObject`. (Other failure modes with different invalid handles have also been seen.) > > Fix: shorten the outer try/catch block so that we only directly close the pipe if the `IOException` happens before we create the `SocketStreamImpl` - after which the closing of the stream will close the pipe `HANDLE`. > > Testing: > - ran the com/sun/tools/attach tets group 2500 times without failure > - tiers 3-5 as a sanity check (Windows only) > > Thanks This pull request has now been integrated. Changeset: e6fe2490 Author: David Holmes URL: https://git.openjdk.org/jdk/commit/e6fe2490bc48acf01ccf81b38d578d20ed09f238 Stats: 26 lines in 1 file changed: 14 ins; 11 del; 1 mod 8323100: com/sun/tools/attach/StartManagementAgent.java failed with "WaitForSingleObject failed" Reviewed-by: kevinw, amenkov ------------- PR: https://git.openjdk.org/jdk/pull/24346 From alanb at openjdk.org Wed Apr 2 06:13:11 2025 From: alanb at openjdk.org (Alan Bateman) Date: Wed, 2 Apr 2025 06:13:11 GMT Subject: RFR: 8352088: Call of com.sun.jdi.ThreadReference.threadGroups() can lock up target VM [v7] In-Reply-To: References:

Message-ID: <5451248a-ded9-4b57-bc4d-23c336adca0f@jetbrains.com> Thanks Chris! I've made the bug?https://youtrack.jetbrains.com/issue/IDEA-365900 visible, there's a reproducer there. Thanks, Egor On 02.04.2025 01:39, Chris Plummer wrote: > The short answer is yes. The debug agent needs to deal with > JVMTI_EVENT_VIRTUAL_THREAD_START/END events for every virtual thread. > What makes it worse is when there are a large number of virtual > threads that are currently alive. They are tracked on a list of > ThreadNodes that starts to slow down debug agent performance when it > gets too long. I have a work in progress that proactively purges these > ThreadNodes so the list does not get too big.?I've been meaning to > revive this project for quite some time. If you have a test case I'd > be willing to experiment with these changes some more. I could not > access to the IDEA-365900 link you provided. > > Note I think after the work is done to purge ThreadNodes proactively > it might not be that hard of step to move to not needing > JVMTI_EVENT_VIRTUAL_THREAD_START/END events enabled, which will help > performance a lot more. > > Chris > > On 4/1/25 10:14 AM, Egor Ushakov wrote: >> Hi everyone! >> >> Is it expected that with the debugger attached creating virtual >> threads is much slower? >> We're getting bugs like: >> https://youtrack.jetbrains.com/issue/IDEA-365900 >> And I can reproduce it easily with jdb... >> Just attaching the debugger immediately slows down virtual threads >> creation significantly. >> >> >java >> -agentlib:jdwp=transport=dt_shmem,server=y,suspend=n,address=8000 app >> ... >> 6808805 (1.2046688E7 threads per second) >> ... >> after >jdb -attach 8000 >> ... >> 30215 (95986.055 threads per second) >> ... >> >> Thanks, >> Egor From jkern at openjdk.org Wed Apr 2 11:05:01 2025 From: jkern at openjdk.org (Joachim Kern) Date: Wed, 2 Apr 2025 11:05:01 GMT Subject: RFR: 8352392: AIX: implement attach API v2 and streaming output [v4] In-Reply-To: References:

Message-ID: On Tue, 1 Apr 2025 10:30:56 GMT, Varada M wrote: >> AIX changes for attach API to support arbitrary length arguments and the streaming output support. >> serviceability/attach/AttachAPIv2/StreamingOutputTest.java test passes >> >> tier1, tier2 and tier3 testing is successful with fastdebug level >> >> JBS Issue : [JDK-8352392](https://bugs.openjdk.org/browse/JDK-8352392) > > Varada M has updated the pull request incrementally with one additional commit since the last revision: > > 8352392: AIX: implement attach API v2 and streaming output Hi Varada, I see that you've largely adapted the code to the POSIX version. Essentially, only the shutdown special handling remains. But the AIX special handling that you removed was introduced for some reason. Do you know why? Does this reasoning no longer apply? I have no idea and can't judge whether the different semantics have created holes in the code that could reappear under certain circumstances. Therefore, I would like an explanation from you as to why you can make this change now. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24177#issuecomment-2772207203 From jsjolen at openjdk.org Wed Apr 2 12:45:59 2025 From: jsjolen at openjdk.org (Johan =?UTF-8?B?U2rDtmxlbg==?=) Date: Wed, 2 Apr 2025 12:45:59 GMT Subject: RFR: 8353479: jcmd with streaming output breaks intendation In-Reply-To: References: Message-ID: On Tue, 1 Apr 2025 21:23:01 GMT, Alex Menkov wrote: > `outputStream` implementations should call `update_position` from `write` to correctly handle indentation. > The fix adds the call to `attachStream::write` > > testing: sanity tier1; > in progress: tier2..4,hs-tier5-svc Thanks! ------------- Marked as reviewed by jsjolen (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/24368#pullrequestreview-2736139795 From duke at openjdk.org Wed Apr 2 14:02:25 2025 From: duke at openjdk.org (Robert Toyonaga) Date: Wed, 2 Apr 2025 14:02:25 GMT Subject: RFR: 8341491: Reserve and commit memory operations should be protected by NMT lock [v2] In-Reply-To: References: Message-ID: > ### Summary: > This PR makes memory operations atomic with NMT accounting. > > ### The problem: > In memory related functions like `os::commit_memory` and `os::reserve_memory` the OS memory operations are currently done before acquiring the the NMT mutex. And the the virtual memory accounting is done later in `MemTracker`, after the lock has been acquired. Doing the memory operations outside of the lock scope can lead to races. > > 1.1 Thread_1 releases range_A. > 1.2 Thread_1 tells NMT "range_A has been released". > > 2.1 Thread_2 reserves (the now free) range_A. > 2.2 Thread_2 tells NMT "range_A is reserved". > > Since the sequence (1.1) (1.2) is not atomic, if Thread_2 begins operating after (1.1), we can have (1.1) (2.1) (2.2) (1.2). The OS sees two valid subsequent calls (release range_A, followed by map range_A). But NMT sees "reserve range_A", "release range_A" and is now out of sync with the OS. > > ### Solution: > Where memory operations such as reserve, commit, or release virtual memory happen, I've expanded the scope of `NmtVirtualMemoryLocker` to protect both the NMT accounting and the memory operation itself. > > ### Other notes: > I also simplified this pattern found in many places: > > if (MemTracker::enabled()) { > MemTracker::NmtVirtualMemoryLocker nvml; > result = pd_some_operation(addr, bytes); > if (result != nullptr) { > MemTracker::record_some_operation(addr, bytes); > } > } else { > result = pd_unmap_memory(addr, bytes); > } > ``` > To: > > MemTracker::NmtVirtualMemoryLocker nvml; > result = pd_unmap_memory(addr, bytes); > MemTracker::record_some_operation(addr, bytes); > ``` > This is possible because `NmtVirtualMemoryLocker` now checks `MemTracker::enabled()`. `MemTracker::record_some_operation` already checks `MemTracker::enabled()` and checks against nullptr. This refactoring previously wasn't possible because `ThreadCritical` was used before https://github.com/openjdk/jdk/pull/22745 introduced `NmtVirtualMemoryLocker`. > > I considered moving the locking and NMT accounting down into platform specific code: Ex. lock around { munmap() + MemTracker::record }. The hope was that this would help reduce the size of the critical section. However, I found that the OS-specific "pd_" functions are already short and to-the-point, so doing this wasn't reducing the lock scope very much. Instead it just makes the code more messy by having to maintain the locking and NMT accounting in each platform specific implementation. > > In many places I've done minor refactoring by relocating call... Robert Toyonaga has updated the pull request incrementally with two additional commits since the last revision: - tests and comments - Revert "make memory op and NMT accounting atomic" This reverts commit 86423d0b7e8e2b0b313a686a64c803028a5f2420. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24084/files - new: https://git.openjdk.org/jdk/pull/24084/files/86423d0b..74f31202 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24084&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24084&range=00-01 Stats: 246 lines in 12 files changed: 60 ins; 123 del; 63 mod Patch: https://git.openjdk.org/jdk/pull/24084.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24084/head:pull/24084 PR: https://git.openjdk.org/jdk/pull/24084 From duke at openjdk.org Wed Apr 2 14:06:09 2025 From: duke at openjdk.org (Robert Toyonaga) Date: Wed, 2 Apr 2025 14:06:09 GMT Subject: RFR: 8341491: Reserve and commit memory operations should be protected by NMT lock [v2] In-Reply-To: References:

Message-ID: On Wed, 2 Apr 2025 14:02:25 GMT, Robert Toyonaga wrote: >> ### Summary: >> This PR makes memory operations atomic with NMT accounting. >> >> ### The problem: >> In memory related functions like `os::commit_memory` and `os::reserve_memory` the OS memory operations are currently done before acquiring the the NMT mutex. And the the virtual memory accounting is done later in `MemTracker`, after the lock has been acquired. Doing the memory operations outside of the lock scope can lead to races. >> >> 1.1 Thread_1 releases range_A. >> 1.2 Thread_1 tells NMT "range_A has been released". >> >> 2.1 Thread_2 reserves (the now free) range_A. >> 2.2 Thread_2 tells NMT "range_A is reserved". >> >> Since the sequence (1.1) (1.2) is not atomic, if Thread_2 begins operating after (1.1), we can have (1.1) (2.1) (2.2) (1.2). The OS sees two valid subsequent calls (release range_A, followed by map range_A). But NMT sees "reserve range_A", "release range_A" and is now out of sync with the OS. >> >> ### Solution: >> Where memory operations such as reserve, commit, or release virtual memory happen, I've expanded the scope of `NmtVirtualMemoryLocker` to protect both the NMT accounting and the memory operation itself. >> >> ### Other notes: >> I also simplified this pattern found in many places: >> >> if (MemTracker::enabled()) { >> MemTracker::NmtVirtualMemoryLocker nvml; >> result = pd_some_operation(addr, bytes); >> if (result != nullptr) { >> MemTracker::record_some_operation(addr, bytes); >> } >> } else { >> result = pd_unmap_memory(addr, bytes); >> } >> ``` >> To: >> >> MemTracker::NmtVirtualMemoryLocker nvml; >> result = pd_unmap_memory(addr, bytes); >> MemTracker::record_some_operation(addr, bytes); >> ``` >> This is possible because `NmtVirtualMemoryLocker` now checks `MemTracker::enabled()`. `MemTracker::record_some_operation` already checks `MemTracker::enabled()` and checks against nullptr. This refactoring previously wasn't possible because `ThreadCritical` was used before https://github.com/openjdk/jdk/pull/22745 introduced `NmtVirtualMemoryLocker`. >> >> I considered moving the locking and NMT accounting down into platform specific code: Ex. lock around { munmap() + MemTracker::record }. The hope was that this would help reduce the size of the critical section. However, I found that the OS-specific "pd_" functions are already short and to-the-point, so doing this wasn't reducing the lock scope very much. Instead it just makes the code more messy by having to maintain the locking and NMT accounting in each platform specific i... > > Robert Toyonaga has updated the pull request incrementally with two additional commits since the last revision: > > - tests and comments > - Revert "make memory op and NMT accounting atomic" > > This reverts commit 86423d0b7e8e2b0b313a686a64c803028a5f2420. OK I have reverted the original changes, added comments, and kept the new tests that are still relevant. Please have another look when you have time. I'll go ahead and open RFE's for the topics you suggested above. Thanks! ------------- PR Comment: https://git.openjdk.org/jdk/pull/24084#issuecomment-2772669368 From duke at openjdk.org Wed Apr 2 16:03:04 2025 From: duke at openjdk.org (Larry Cable) Date: Wed, 2 Apr 2025 16:03:04 GMT Subject: Integrated: 8344671: Few JFR streaming tests fail with application not alive error on MacOS 15 In-Reply-To: <3xUroXKNX4bBRb0L4r5WJ9V_TEJRbtS_hmdZ3AMCTFo=.86aaf7a8-d2c1-4f07-9f74-4e2cab2d0fa2@github.com> References: <3xUroXKNX4bBRb0L4r5WJ9V_TEJRbtS_hmdZ3AMCTFo=.86aaf7a8-d2c1-4f07-9f74-4e2cab2d0fa2@github.com> Message-ID: On Mon, 17 Mar 2025 18:26:57 GMT, Larry Cable wrote: > on both Linux and MacOS libattach utilizes UNIX signal (QUIT) to cause a target JVM (attachee) to create the socket file used as transport for subsequent jcmds (and other attach based interactions) and to listen upon that for such. > > it should be noted that the default behavior for QUIT (if not blocked or caught) is to terminate the signalled process. > > during the early lifetime of a JVM, its signal handlers are not yet installed, and thus any signal such as QUIT will cause the > default behavior to occur, in this case the JVM will be terminated. > > this is why some tests are failing with "not alive" > > the "fix" is similar in nature to that already implemented for linux (however using a different OS dependent mechanism to obtain the attachee JVM's signal masks: sysctl(2)). > > the method "checkCatchesAndSendQuitTo" will now obtain the "attachee" JVM signal masks and only kill(QUIT) if the > current masks indicate that the JVM's signals are now being handled. > > the behavior in the success case is now identical to the previous implementation, however should the target JVM not > become "ready" (signal handlers installed) prior to the attach "timeout" occurring the attach operation will throw an > "AttachNotSupportedException" with a suitable error message. > > see also: https://bugs.openjdk.org/browse/JDK-8350766 This pull request has now been integrated. Changeset: d979bd85 Author: Larry Cable Committer: Kevin Walls URL: https://git.openjdk.org/jdk/commit/d979bd859215a16e6398ae627acfd40e8d71102c Stats: 60 lines in 3 files changed: 44 ins; 3 del; 13 mod 8344671: Few JFR streaming tests fail with application not alive error on MacOS 15 Reviewed-by: dholmes, kevinw ------------- PR: https://git.openjdk.org/jdk/pull/24085 From cjplummer at openjdk.org Wed Apr 2 17:07:02 2025 From: cjplummer at openjdk.org (Chris Plummer) Date: Wed, 2 Apr 2025 17:07:02 GMT Subject: RFR: 8352088: Call of com.sun.jdi.ThreadReference.threadGroups() can lock up target VM [v7] In-Reply-To: References:

Message-ID: On Tue, 1 Apr 2025 20:00:22 GMT, Chris Plummer wrote: >> Calling ThreadGroupReference.groups() from an event handler can cause a deadlock. Details in first comment. Tested with :jdk_lang on all supported platforms and tier1, tier2, tier3, and tier5 svc testing. > > Chris Plummer has updated the pull request incrementally with one additional commit since the last revision: > > minor comment update Thanks reviews Sergei, Jai, and Alan ------------- PR Comment: https://git.openjdk.org/jdk/pull/24236#issuecomment-2773202187 From cjplummer at openjdk.org Wed Apr 2 17:07:03 2025 From: cjplummer at openjdk.org (Chris Plummer) Date: Wed, 2 Apr 2025 17:07:03 GMT Subject: Integrated: 8352088: Call of com.sun.jdi.ThreadReference.threadGroups() can lock up target VM In-Reply-To: References: Message-ID: On Tue, 25 Mar 2025 20:36:28 GMT, Chris Plummer wrote: > Calling ThreadGroupReference.groups() from an event handler can cause a deadlock. Details in first comment. Tested with :jdk_lang on all supported platforms and tier1, tier2, tier3, and tier5 svc testing. This pull request has now been integrated. Changeset: cc870d49 Author: Chris Plummer URL: https://git.openjdk.org/jdk/commit/cc870d4960b3e121afc76df546228cda4b600632 Stats: 130 lines in 2 files changed: 128 ins; 0 del; 2 mod 8352088: Call of com.sun.jdi.ThreadReference.threadGroups() can lock up target VM Reviewed-by: alanb, jpai, sspitsyn ------------- PR: https://git.openjdk.org/jdk/pull/24236 From duke at openjdk.org Thu Apr 3 02:56:54 2025 From: duke at openjdk.org (duke) Date: Thu, 3 Apr 2025 02:56:54 GMT Subject: Withdrawn: 8336017: Deprecate java.util.logging.LoggingMXBean, its implementation, and accessor method for removal In-Reply-To: References: Message-ID: On Thu, 23 Jan 2025 15:23:37 GMT, Kevin Walls wrote: > java.util.logging.LoggingMXBean and java.util.logging.LogManager::getLoggingMXBean are deprecated since JDK-8139982 in JDK 9. > > These deprecations should be uprated to state they are for future removal. > > java.util.logging.Logging (implements LoggingMXBean) should also be deprecated for removal. This pull request has been closed without being integrated. ------------- PR: https://git.openjdk.org/jdk/pull/23271 From iklam at openjdk.org Thu Apr 3 04:09:28 2025 From: iklam at openjdk.org (Ioi Lam) Date: Thu, 3 Apr 2025 04:09:28 GMT Subject: RFR: 8353597: Refactor handling VM options for AOT cache input and output Message-ID: Since [JEP 483: Ahead-of-Time Class Loading & Linking](https://openjdk.org/jeps/483), VM options such as `-XX:AOTCache `are implemented as aliases of "classical" CDS options such as `-XX:SharedArchiveFile`. In anticipation of the [JEP: Ahead-of-time Command Line Ergonomics](https://bugs.openjdk.org/browse/JDK-8350022), we should refactor the code that deals with the AOT options. Specifically, as we expect the JVM to be able to load from an "input AOT cache" and write to an "output AOT cache", we should clearly identify the input and output caches in separate APIs: const char* CDSConfig::input_static_archive_path(); const char* CDSConfig::input_dynamic_archive_path(); const char* CDSConfig::output_archive_path(); This PR also cleans up the code by: - renaming a few function to reflect what they actually do - moving more "config" management code into cdsConfig.cpp There's also a behavioral bug fix: before this PR, `-XX:AOTCache` was handled by the `ergo_init_classic_archive_paths()` function, which allows two files to be specified. E.g., `java -XX:AOTCache=static.jsa:dynamic.jsa`. That's because `-XX:AOTCache` was implemented as an alias of `-XX:SharedArchiveFile`, and the latter allows this usage. However, this behavior is not specified in JEP 483. Allowing two files in -XX:AOTCache will cause unnecessary complexity when we implement [JDK-8353598: Allow AOT cache to be used in training run](https://bugs.openjdk.org/browse/JDK-8353598). Therefore, I added new test cases to disallow the use of two files. This also means that we don't need to modify the already over-complicated `ergo_init_classic_archive_paths()` for the AOT use cases ------------- Depends on: https://git.openjdk.org/jdk/pull/24272 Commit messages: - Minimized changes in ergo_init_classic_archive_paths() - Clean up CDS input/output path handling Changes: https://git.openjdk.org/jdk/pull/24401/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=24401&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8353597 Stats: 304 lines in 15 files changed: 156 ins; 55 del; 93 mod Patch: https://git.openjdk.org/jdk/pull/24401.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24401/head:pull/24401 PR: https://git.openjdk.org/jdk/pull/24401 From iklam at openjdk.org Thu Apr 3 04:09:29 2025 From: iklam at openjdk.org (Ioi Lam) Date: Thu, 3 Apr 2025 04:09:29 GMT Subject: RFR: 8353597: Refactor handling VM options for AOT cache input and output In-Reply-To: References: Message-ID: On Thu, 3 Apr 2025 04:00:59 GMT, Ioi Lam wrote: > Since [JEP 483: Ahead-of-Time Class Loading & Linking](https://openjdk.org/jeps/483), VM options such as `-XX:AOTCache `are implemented as aliases of "classical" CDS options such as `-XX:SharedArchiveFile`. > > In anticipation of the [JEP: Ahead-of-time Command Line Ergonomics](https://bugs.openjdk.org/browse/JDK-8350022), we should refactor the code that deals with the AOT options. Specifically, as we expect the JVM to be able to load from an "input AOT cache" and write to an "output AOT cache", we should clearly identify the input and output caches in separate APIs: > > > const char* CDSConfig::input_static_archive_path(); > const char* CDSConfig::input_dynamic_archive_path(); > const char* CDSConfig::output_archive_path(); > > > This PR also cleans up the code by: > - renaming a few function to reflect what they actually do > - moving more "config" management code into cdsConfig.cpp > > There's also a behavioral bug fix: before this PR, `-XX:AOTCache` was handled by the `ergo_init_classic_archive_paths()` function, which allows two files to be specified. E.g., `java -XX:AOTCache=static.jsa:dynamic.jsa`. That's because `-XX:AOTCache` was implemented as an alias of `-XX:SharedArchiveFile`, and the latter allows this usage. > > However, this behavior is not specified in JEP 483. Allowing two files in -XX:AOTCache will cause unnecessary complexity when we implement [JDK-8353598: Allow AOT cache to be used in training run](https://bugs.openjdk.org/browse/JDK-8353598). Therefore, I added new test cases to disallow the use of two files. This also means that we don't need to modify the already over-complicated `ergo_init_classic_archive_paths()` for the AOT use cases src/hotspot/share/cds/dynamicArchive.cpp line 499: > 497: } > 498: } > 499: Moved to `CDSConfig::prepare_for_dumping()` src/hotspot/share/cds/metaspaceShared.cpp line 795: > 793: assert(CDSConfig::is_dumping_archive(), "sanity"); > 794: CDSConfig::check_unsupported_dumping_module_options(); > 795: } Moved to `CDSConfig::prepare_for_dumping()` ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24401#discussion_r2026141716 PR Review Comment: https://git.openjdk.org/jdk/pull/24401#discussion_r2026142127 From varadam at openjdk.org Thu Apr 3 07:23:05 2025 From: varadam at openjdk.org (Varada M) Date: Thu, 3 Apr 2025 07:23:05 GMT Subject: RFR: 8352392: AIX: implement attach API v2 and streaming output [v4] In-Reply-To: References:

<5451248a-ded9-4b57-bc4d-23c336adca0f@jetbrains.com> Message-ID: Hi Egor, Thank you for reporting this scalability issue when debugger is enabled. It looks like a JVMTI problem and we have some guesses but need some investigation to identify it better. Thanks, Serguei From: serviceability-dev on behalf of Egor Ushakov Date: Wednesday, April 2, 2025 at 3:32?AM To: Chris Plummer , serviceability-dev Subject: Re: Debugger overhead for virtual threads creation Thanks Chris! I've made the bug https://youtrack.jetbrains.com/issue/IDEA-365900 visible, there's a reproducer there. Thanks, Egor On 02.04.2025 01:39, Chris Plummer wrote: > The short answer is yes. The debug agent needs to deal with > JVMTI_EVENT_VIRTUAL_THREAD_START/END events for every virtual thread. > What makes it worse is when there are a large number of virtual > threads that are currently alive. They are tracked on a list of > ThreadNodes that starts to slow down debug agent performance when it > gets too long. I have a work in progress that proactively purges these > ThreadNodes so the list does not get too big. I've been meaning to > revive this project for quite some time. If you have a test case I'd > be willing to experiment with these changes some more. I could not > access to the IDEA-365900 link you provided. > > Note I think after the work is done to purge ThreadNodes proactively > it might not be that hard of step to move to not needing > JVMTI_EVENT_VIRTUAL_THREAD_START/END events enabled, which will help > performance a lot more. > > Chris > > On 4/1/25 10:14 AM, Egor Ushakov wrote: >> Hi everyone! >> >> Is it expected that with the debugger attached creating virtual >> threads is much slower? >> We're getting bugs like: >> https://youtrack.jetbrains.com/issue/IDEA-365900 >> And I can reproduce it easily with jdb... >> Just attaching the debugger immediately slows down virtual threads >> creation significantly. >> >> >java >> -agentlib:jdwp=transport=dt_shmem,server=y,suspend=n,address=8000 app >> ... >> 6808805 (1.2046688E7 threads per second) >> ... >> after >jdb -attach 8000 >> ... >> 30215 (95986.055 threads per second) >> ... >> >> Thanks, >> Egor -------------- next part -------------- An HTML attachment was scrubbed... URL: From jkern at openjdk.org Thu Apr 3 10:23:55 2025 From: jkern at openjdk.org (Joachim Kern) Date: Thu, 3 Apr 2025 10:23:55 GMT Subject: RFR: 8341491: Reserve and commit memory operations should be protected by NMT lock [v2] In-Reply-To: References:

Message-ID: <-x81ydaT2ImEfbGOLisk3XiQDsbln91IUf3jDNA1NDk=.d035465a-cb77-4c88-99aa-f7fc171411d0@github.com> On Tue, 1 Apr 2025 10:30:56 GMT, Varada M wrote: >> AIX changes for attach API to support arbitrary length arguments and the streaming output support. >> serviceability/attach/AttachAPIv2/StreamingOutputTest.java test passes >> >> tier1, tier2 and tier3 testing is successful with fastdebug level >> >> JBS Issue : [JDK-8352392](https://bugs.openjdk.org/browse/JDK-8352392) > > Varada M has updated the pull request incrementally with one additional commit since the last revision: > > 8352392: AIX: implement attach API v2 and streaming output Reasonable change. ------------- Marked as reviewed by jkern (Committer). PR Review: https://git.openjdk.org/jdk/pull/24177#pullrequestreview-2739729179 From mdoerr at openjdk.org Thu Apr 3 14:48:01 2025 From: mdoerr at openjdk.org (Martin Doerr) Date: Thu, 3 Apr 2025 14:48:01 GMT Subject: RFR: 8352392: AIX: implement attach API v2 and streaming output [v4] In-Reply-To: References:

Message-ID: On Tue, 1 Apr 2025 10:30:56 GMT, Varada M wrote: >> AIX changes for attach API to support arbitrary length arguments and the streaming output support. >> serviceability/attach/AttachAPIv2/StreamingOutputTest.java test passes >> >> tier1, tier2 and tier3 testing is successful with fastdebug level >> >> JBS Issue : [JDK-8352392](https://bugs.openjdk.org/browse/JDK-8352392) > > Varada M has updated the pull request incrementally with one additional commit since the last revision: > > 8352392: AIX: implement attach API v2 and streaming output Marked as reviewed by mdoerr (Reviewer). ------------- PR Review: https://git.openjdk.org/jdk/pull/24177#pullrequestreview-2740165191 From duke at openjdk.org Thu Apr 3 15:27:15 2025 From: duke at openjdk.org (Robert Toyonaga) Date: Thu, 3 Apr 2025 15:27:15 GMT Subject: RFR: 8341491: Reserve and commit memory operations should be protected by NMT lock [v3] In-Reply-To: References: Message-ID: > ### Summary: > This PR makes memory operations atomic with NMT accounting. > > ### The problem: > In memory related functions like `os::commit_memory` and `os::reserve_memory` the OS memory operations are currently done before acquiring the the NMT mutex. And the the virtual memory accounting is done later in `MemTracker`, after the lock has been acquired. Doing the memory operations outside of the lock scope can lead to races. > > 1.1 Thread_1 releases range_A. > 1.2 Thread_1 tells NMT "range_A has been released". > > 2.1 Thread_2 reserves (the now free) range_A. > 2.2 Thread_2 tells NMT "range_A is reserved". > > Since the sequence (1.1) (1.2) is not atomic, if Thread_2 begins operating after (1.1), we can have (1.1) (2.1) (2.2) (1.2). The OS sees two valid subsequent calls (release range_A, followed by map range_A). But NMT sees "reserve range_A", "release range_A" and is now out of sync with the OS. > > ### Solution: > Where memory operations such as reserve, commit, or release virtual memory happen, I've expanded the scope of `NmtVirtualMemoryLocker` to protect both the NMT accounting and the memory operation itself. > > ### Other notes: > I also simplified this pattern found in many places: > > if (MemTracker::enabled()) { > MemTracker::NmtVirtualMemoryLocker nvml; > result = pd_some_operation(addr, bytes); > if (result != nullptr) { > MemTracker::record_some_operation(addr, bytes); > } > } else { > result = pd_unmap_memory(addr, bytes); > } > ``` > To: > > MemTracker::NmtVirtualMemoryLocker nvml; > result = pd_unmap_memory(addr, bytes); > MemTracker::record_some_operation(addr, bytes); > ``` > This is possible because `NmtVirtualMemoryLocker` now checks `MemTracker::enabled()`. `MemTracker::record_some_operation` already checks `MemTracker::enabled()` and checks against nullptr. This refactoring previously wasn't possible because `ThreadCritical` was used before https://github.com/openjdk/jdk/pull/22745 introduced `NmtVirtualMemoryLocker`. > > I considered moving the locking and NMT accounting down into platform specific code: Ex. lock around { munmap() + MemTracker::record }. The hope was that this would help reduce the size of the critical section. However, I found that the OS-specific "pd_" functions are already short and to-the-point, so doing this wasn't reducing the lock scope very much. Instead it just makes the code more messy by having to maintain the locking and NMT accounting in each platform specific implementation. > > In many places I've done minor refactoring by relocating call... Robert Toyonaga has updated the pull request incrementally with one additional commit since the last revision: exclude file mapping tests on AIX. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24084/files - new: https://git.openjdk.org/jdk/pull/24084/files/74f31202..5c23a76a Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24084&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24084&range=01-02 Stats: 2 lines in 1 file changed: 2 ins; 0 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/24084.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24084/head:pull/24084 PR: https://git.openjdk.org/jdk/pull/24084 From duke at openjdk.org Thu Apr 3 15:27:16 2025 From: duke at openjdk.org (Robert Toyonaga) Date: Thu, 3 Apr 2025 15:27:16 GMT Subject: RFR: 8341491: Reserve and commit memory operations should be protected by NMT lock [v2] In-Reply-To: References:

Message-ID: On Thu, 3 Apr 2025 10:21:29 GMT, Joachim Kern wrote: > Internal Error (os_aix.cpp:1917), pid=26476938, tid=258 Error: guarantee((vmi)) failed > > This will happen if a `os::pd_commit_memory()` or `os::pd_release_memory()` or `os::pd_uncommit_memory()` is called on memory not allocated with `os::pd_reserve_memory()` or `os::pd_attempt_map_memory_to_file_at()` or `os::pd_attempt_reserve_memory_at()` Thank you for running the tests on AIX. I've excluded the file mapping tests that don't meet that criteria on AIX. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24084#issuecomment-2776162180 From iklam at openjdk.org Thu Apr 3 15:46:31 2025 From: iklam at openjdk.org (Ioi Lam) Date: Thu, 3 Apr 2025 15:46:31 GMT Subject: RFR: 8353597: Refactor handling VM options for AOT cache input and output [v2] In-Reply-To: References: Message-ID: > Since [JEP 483: Ahead-of-Time Class Loading & Linking](https://openjdk.org/jeps/483), VM options such as `-XX:AOTCache `are implemented as aliases of "classical" CDS options such as `-XX:SharedArchiveFile`. > > In anticipation of the [JEP: Ahead-of-time Command Line Ergonomics](https://bugs.openjdk.org/browse/JDK-8350022), we should refactor the code that deals with the AOT options. Specifically, as we expect the JVM to be able to load from an "input AOT cache" and write to an "output AOT cache", we should clearly identify the input and output caches in separate APIs: > > > const char* CDSConfig::input_static_archive_path(); > const char* CDSConfig::input_dynamic_archive_path(); > const char* CDSConfig::output_archive_path(); > > > This PR also cleans up the code by: > - renaming a few function to reflect what they actually do > - moving more "config" management code into cdsConfig.cpp > > There's also a behavioral bug fix: before this PR, `-XX:AOTCache` was handled by the `ergo_init_classic_archive_paths()` function, which allows two files to be specified. E.g., `java -XX:AOTCache=static.jsa:dynamic.jsa`. That's because `-XX:AOTCache` was implemented as an alias of `-XX:SharedArchiveFile`, and the latter allows this usage. > > However, this behavior is not specified in JEP 483. Allowing two files in -XX:AOTCache will cause unnecessary complexity when we implement [JDK-8353598: Allow AOT cache to be used in training run](https://bugs.openjdk.org/browse/JDK-8353598). Therefore, I added new test cases to disallow the use of two files. This also means that we don't need to modify the already over-complicated `ergo_init_classic_archive_paths()` for the AOT use cases Ioi Lam has updated the pull request incrementally with one additional commit since the last revision: more clean up ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24401/files - new: https://git.openjdk.org/jdk/pull/24401/files/9e17fedb..3ad42a3e Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24401&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24401&range=00-01 Stats: 8 lines in 1 file changed: 4 ins; 0 del; 4 mod Patch: https://git.openjdk.org/jdk/pull/24401.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24401/head:pull/24401 PR: https://git.openjdk.org/jdk/pull/24401 From lmesnik at openjdk.org Thu Apr 3 19:36:55 2025 From: lmesnik at openjdk.org (Leonid Mesnik) Date: Thu, 3 Apr 2025 19:36:55 GMT Subject: RFR: 8353597: Refactor handling VM options for AOT cache input and output [v2] In-Reply-To: References:

Message-ID: On Thu, 3 Apr 2025 15:46:31 GMT, Ioi Lam wrote: >> Since [JEP 483: Ahead-of-Time Class Loading & Linking](https://openjdk.org/jeps/483), VM options such as `-XX:AOTCache `are implemented as aliases of "classical" CDS options such as `-XX:SharedArchiveFile`. >> >> In anticipation of the [JEP: Ahead-of-time Command Line Ergonomics](https://bugs.openjdk.org/browse/JDK-8350022), we should refactor the code that deals with the AOT options. Specifically, as we expect the JVM to be able to load from an "input AOT cache" and write to an "output AOT cache", we should clearly identify the input and output caches in separate APIs: >> >> >> const char* CDSConfig::input_static_archive_path(); >> const char* CDSConfig::input_dynamic_archive_path(); >> const char* CDSConfig::output_archive_path(); >> >> >> This PR also cleans up the code by: >> - renaming a few function to reflect what they actually do >> - moving more "config" management code into cdsConfig.cpp >> >> There's also a behavioral bug fix: before this PR, `-XX:AOTCache` was handled by the `ergo_init_classic_archive_paths()` function, which allows two files to be specified. E.g., `java -XX:AOTCache=static.jsa:dynamic.jsa`. That's because `-XX:AOTCache` was implemented as an alias of `-XX:SharedArchiveFile`, and the latter allows this usage. >> >> However, this behavior is not specified in JEP 483. Allowing two files in -XX:AOTCache will cause unnecessary complexity when we implement [JDK-8353598: Allow AOT cache to be used in training run](https://bugs.openjdk.org/browse/JDK-8353598). Therefore, I added new test cases to disallow the use of two files. This also means that we don't need to modify the already over-complicated `ergo_init_classic_archive_paths()` for the AOT use cases > > Ioi Lam has updated the pull request incrementally with one additional commit since the last revision: > > more clean up Changes requested by lmesnik (Reviewer). test/hotspot/jtreg/runtime/cds/appcds/AOTFlags.java line 28: > 26: * @test > 27: * @summary "AOT" aliases for traditional CDS command-line options > 28: * @requires vm.cds & vm.compMode != "Xcomp" The test completely ignore external VM flags, so it should have `@requires vm.flagless ` and no Xcomp exclusion is required. ------------- PR Review: https://git.openjdk.org/jdk/pull/24401#pullrequestreview-2740946504 PR Review Comment: https://git.openjdk.org/jdk/pull/24401#discussion_r2027621201 From iklam at openjdk.org Thu Apr 3 20:31:50 2025 From: iklam at openjdk.org (Ioi Lam) Date: Thu, 3 Apr 2025 20:31:50 GMT Subject: RFR: 8353597: Refactor handling VM options for AOT cache input and output [v3] In-Reply-To: References: Message-ID: > Since [JEP 483: Ahead-of-Time Class Loading & Linking](https://openjdk.org/jeps/483), VM options such as `-XX:AOTCache `are implemented as aliases of "classical" CDS options such as `-XX:SharedArchiveFile`. > > In anticipation of the [JEP: Ahead-of-time Command Line Ergonomics](https://bugs.openjdk.org/browse/JDK-8350022), we should refactor the code that deals with the AOT options. Specifically, as we expect the JVM to be able to load from an "input AOT cache" and write to an "output AOT cache", we should clearly identify the input and output caches in separate APIs: > > > const char* CDSConfig::input_static_archive_path(); > const char* CDSConfig::input_dynamic_archive_path(); > const char* CDSConfig::output_archive_path(); > > > This PR also cleans up the code by: > - renaming a few function to reflect what they actually do > - moving more "config" management code into cdsConfig.cpp > > There's also a behavioral bug fix: before this PR, `-XX:AOTCache` was handled by the `ergo_init_classic_archive_paths()` function, which allows two files to be specified. E.g., `java -XX:AOTCache=static.jsa:dynamic.jsa`. That's because `-XX:AOTCache` was implemented as an alias of `-XX:SharedArchiveFile`, and the latter allows this usage. > > However, this behavior is not specified in JEP 483. Allowing two files in -XX:AOTCache will cause unnecessary complexity when we implement [JDK-8353598: Allow AOT cache to be used in training run](https://bugs.openjdk.org/browse/JDK-8353598). Therefore, I added new test cases to disallow the use of two files. This also means that we don't need to modify the already over-complicated `ergo_init_classic_archive_paths()` for the AOT use cases Ioi Lam has updated the pull request incrementally with one additional commit since the last revision: @lmesnik comments ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24401/files - new: https://git.openjdk.org/jdk/pull/24401/files/3ad42a3e..90b4b688 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24401&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24401&range=01-02 Stats: 2 lines in 1 file changed: 1 ins; 0 del; 1 mod Patch: https://git.openjdk.org/jdk/pull/24401.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24401/head:pull/24401 PR: https://git.openjdk.org/jdk/pull/24401 From chris.plummer at oracle.com Thu Apr 3 20:32:50 2025 From: chris.plummer at oracle.com (Chris Plummer) Date: Thu, 3 Apr 2025 13:32:50 -0700 Subject: Debugger overhead for virtual threads creation In-Reply-To: References:

<5451248a-ded9-4b57-bc4d-23c336adca0f@jetbrains.com> Message-ID: <3ade7875-b4c8-4bf0-82cf-71452caf8df1@oracle.com> I got my prototype ThreadNode cleanup code working again. It seems to be doing the job of keeping the list down to a minimal size, with usually at most a few ThreadNodes on the list. However, that didn't help performance. The reason is because the debug agent normally does not need to traverse the list. It maps jthread to ThreadNode by using JVMTI thread local data, not by searching the list, and the list is doubly linked, so ThreadNodes can be removed quickly. After the above results I talked with Serguei and he confirmed that JVMTI also has a linked list, and there are some occasions where it needs to be traversed, so that likely the bottleneck here. Regarding my suggestion that we may be able to disable JVMTI_EVENT_VIRTUAL_THREAD_START/END, I did experiment with that and when disabled there are no performance issues, so I will look into getting this to work properly. The events need to be enable if there are any ThreadStartRequests or ThreadDeathRequests that do not have the PlatformThreadOnly filter enabled. https://docs.oracle.com/en/java/javase/24/docs/api/jdk.jdi/com/sun/jdi/request/ThreadStartRequest.html#addPlatformThreadsOnlyFilter() Jdb by default uses this filter, and I suspect that IDEA does also. If not, it would get a flood of ThreadStart and ThreadDeath events for all the virtual threads. Chris On 4/3/25 2:15 AM, Serguei Spitsyn wrote: > > Hi Egor, > > Thank you for reporting this scalability issue when debugger is enabled. > It looks like a JVMTI problem and we have some guesses but need some > investigation to identify it better. > > Thanks, > > Serguei > > *From: *serviceability-dev on > behalf of Egor Ushakov > *Date: *Wednesday, April 2, 2025 at 3:32?AM > *To: *Chris Plummer , serviceability-dev > > *Subject: *Re: Debugger overhead for virtual threads creation > > Thanks Chris! > > I've made the bug https://youtrack.jetbrains.com/issue/IDEA-365900 > visible, there's a reproducer there. > > Thanks, > Egor > > On 02.04.2025 01:39, Chris Plummer wrote: > > The short answer is yes. The debug agent needs to deal with > > JVMTI_EVENT_VIRTUAL_THREAD_START/END events for every virtual thread. > > What makes it worse is when there are a large number of virtual > > threads that are currently alive. They are tracked on a list of > > ThreadNodes that starts to slow down debug agent performance when it > > gets too long. I have a work in progress that proactively purges these > > ThreadNodes so the list does not get too big.?I've been meaning to > > revive this project for quite some time. If you have a test case I'd > > be willing to experiment with these changes some more. I could not > > access to the IDEA-365900 link you provided. > > > > Note I think after the work is done to purge ThreadNodes proactively > > it might not be that hard of step to move to not needing > > JVMTI_EVENT_VIRTUAL_THREAD_START/END events enabled, which will help > > performance a lot more. > > > > Chris > > > > On 4/1/25 10:14 AM, Egor Ushakov wrote: > >> Hi everyone! > >> > >> Is it expected that with the debugger attached creating virtual > >> threads is much slower? > >> We're getting bugs like: > >> https://youtrack.jetbrains.com/issue/IDEA-365900 > >> And I can reproduce it easily with jdb... > >> Just attaching the debugger immediately slows down virtual threads > >> creation significantly. > >> > >> >java > >> -agentlib:jdwp=transport=dt_shmem,server=y,suspend=n,address=8000 app > >> ... > >> 6808805 (1.2046688E7 threads per second) > >> ... > >> after >jdb -attach 8000 > >> ... > >> 30215 (95986.055 threads per second) > >> ... > >> > >> Thanks, > >> Egor > -------------- next part -------------- An HTML attachment was scrubbed... URL: From iklam at openjdk.org Thu Apr 3 21:10:51 2025 From: iklam at openjdk.org (Ioi Lam) Date: Thu, 3 Apr 2025 21:10:51 GMT Subject: RFR: 8353597: Refactor handling VM options for AOT cache input and output [v2] In-Reply-To: References:

Message-ID: On Thu, 3 Apr 2025 19:33:12 GMT, Leonid Mesnik wrote: >> Ioi Lam has updated the pull request incrementally with one additional commit since the last revision: >> >> more clean up > > test/hotspot/jtreg/runtime/cds/appcds/AOTFlags.java line 28: > >> 26: * @test >> 27: * @summary "AOT" aliases for traditional CDS command-line options >> 28: * @requires vm.cds & vm.compMode != "Xcomp" > > The test completely ignore external VM flags, so it should have > `@requires vm.flagless ` > and no Xcomp exclusion is required. Fixed. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24401#discussion_r2027743880 From iklam at openjdk.org Thu Apr 3 21:43:56 2025 From: iklam at openjdk.org (Ioi Lam) Date: Thu, 3 Apr 2025 21:43:56 GMT Subject: RFR: 8353597: Refactor handling VM options for AOT cache input and output [v2] In-Reply-To: References:

Message-ID: On Thu, 3 Apr 2025 15:56:02 GMT, Vladimir Kozlov wrote: > @iklam one annoying thing in current ergonomic setting for AOTCode flags in mainline is checking which phase we are executing. We agreed before that we should only save/load AOT code when `AOTClassLinking` is on because AOT code needs classes to be preloaded. > > I have to do next checks to enable AOTCode in `CDSConfig::check_vm_args_consistency()`: > > ``` > if (AOTClassLinking && is_using_archive() && !is_dumping_archive() && !FLAG_IS_DEFAULT(AOTCache)) { > FLAG_SET_ERGO_IF_DEFAULT(LoadAOTCode, true); > ... > if (AOTClassLinking && is_dumping_final_static_archive()) { > FLAG_SET_ERGO_IF_DEFAULT(StoreAOTCode, true); > ``` > > First, I am not sure these conditions are correct. > > Second, it would be nice to have simple checks instead: `is_dumping_aot_archive()` and `is_using_aot_archive()`. > > May be also consider it is error if both conditions are true (we don't support updating archive yet). There are a lot of dependencies between different AOT capabilities, and it's hard to control that using global variables. At the point of `CDSConfig::check_vm_args_consistency()`, we don't have complete knowledge whether the AOT cache exists, or whether the cache contains AOT code, or whether the GC compressed oops settings are compatible with the AOT code. In the handling of such "AOT capability flags", I have been using the following pattern: In `CDSConfig::check_vm_args_consistency()` we update the default values of the flags according to their dependencies on other flags. E.g., by specifying `-XX:AOTMode=create`, `AOTClassLinking` and `AOTInvokeDynamicLinking` are enabled by default. if (!FLAG_IS_DEFAULT(AOTMode)) { // Using any form of the new AOTMode switch enables enhanced optimizations. FLAG_SET_ERGO_IF_DEFAULT(AOTClassLinking, true); } if (AOTClassLinking) { // If AOTClassLinking is specified, enable all AOT optimizations by default. FLAG_SET_ERGO_IF_DEFAULT(AOTInvokeDynamicLinking, true); } else { // AOTInvokeDynamicLinking depends on AOTClassLinking. FLAG_SET_ERGO(AOTInvokeDynamicLinking, false); } However, the values of these flags are just advisory. Even if a flag is enabled, the underlying capability may be disabled. For example, `AOTClassLinking` requires the ability of dumping heap objects, which is not available if ZGC is used. Because the dependencies are complex, it's difficult to resolve them statically and set a global boolean variable for each capability. Instead, I have been expressing the dependencies programmatically using accessor functions: bool CDSConfig::is_dumping_aot_linked_classes() { if (is_dumping_preimage_static_archive()) { return false; } else if (is_dumping_dynamic_archive()) { return is_using_full_module_graph() && AOTClassLinking; } else if (is_dumping_static_archive()) { return is_dumping_full_module_graph() && AOTClassLinking; } else { return false; } } bool CDSConfig::is_dumping_invokedynamic() { // Requires is_dumping_aot_linked_classes(). Otherwise the classes of some archived heap // objects used by the archive indy callsites may be replaced at runtime. return AOTInvokeDynamicLinking && is_dumping_aot_linked_classes() && is_dumping_heap(); } I would suggest doing something like this for storing AOT code: bool CDSConfig::is_dumping_aot_code() { return StoreAOTCode && is_dumping_final_static_archive() && is_dumping_aot_linked_classes(); } For loading AOT code, it's simpler. We can do a definite check immediately after the AOT cache has been mapped. This also makes the run-time check efficient (whereas the assembly-time checks can take their time). if (LoadAOTCode && cache has AOT code && vm options are compatible) { CDSConfig::_is_using_aot_code = true; } else { CDSConfig::_is_using_aot_code = false; } inline bool CDSConfig::is_using_aot_code() { return CDSConfig::_is_using_aot_code; } ------------- PR Comment: https://git.openjdk.org/jdk/pull/24401#issuecomment-2776976568 From kvn at openjdk.org Thu Apr 3 22:04:54 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Thu, 3 Apr 2025 22:04:54 GMT Subject: RFR: 8353597: Refactor handling VM options for AOT cache input and output [v2] In-Reply-To: References:

Message-ID: On Thu, 3 Apr 2025 17:58:30 GMT, Leonid Mesnik wrote: >> Serguei Spitsyn has updated the pull request incrementally with one additional commit since the last revision: >> >> some cleanup > > src/hotspot/share/prims/jvmtiEnv.cpp line 1078: > >> 1076: JvmtiEnv::ResumeThread(jthread thread) { >> 1077: // resume thread with handshake >> 1078: ResumeThreadClosure op(/* single_resume */ true); > > Could you please explain how thread is protected from racing with mounting<->unmounting operations with resume_thread operations? > It might be unlikely happens for suspended threads, but for alive threads the results are not defined. Thank you for the question. The `JvmtiHanshake::execute()` has a `JvmtiVTMSTransitionDisabler` installed: JvmtiHandshake::execute(JvmtiUnitedHandshakeClosure* hs_cl, jthread target) { JavaThread* current = JavaThread::current(); HandleMark hm(current); JvmtiVTMSTransitionDisabler disabler(target); <= !!!!!!! . . . > src/hotspot/share/prims/jvmtiEnvBase.cpp line 1759: > >> 1757: Handle thread_h(current, thread_oop); >> 1758: bool is_virtual = java_lang_VirtualThread::is_instance(thread_h()); >> 1759: bool is_thread_carrying = is_thread_carrying_vthread(java_thread, thread_h()); > > I think that somewhere in this place should be an explanation of suspend<->resume synchronization. As I understand the hadshake can't be executed and clear suspend state while suspend_thread is done for the same thread. How it is guaranteed that suspend_thread flag cann't be updated? > It is not obvious and also put some restrictions on the suspend_thread implementation to keep this behaviour. Thank you for reviewing and this suggestion. Yes, you are right. I'll try to find a good place to add such a comment. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24269#discussion_r2028158825 PR Review Comment: https://git.openjdk.org/jdk/pull/24269#discussion_r2028161088 From varadam at openjdk.org Fri Apr 4 06:43:55 2025 From: varadam at openjdk.org (Varada M) Date: Fri, 4 Apr 2025 06:43:55 GMT Subject: RFR: 8352392: AIX: implement attach API v2 and streaming output [v4] In-Reply-To: References:

Message-ID: On Tue, 1 Apr 2025 10:30:56 GMT, Varada M wrote: >> AIX changes for attach API to support arbitrary length arguments and the streaming output support. >> serviceability/attach/AttachAPIv2/StreamingOutputTest.java test passes >> >> tier1, tier2 and tier3 testing is successful with fastdebug level >> >> JBS Issue : [JDK-8352392](https://bugs.openjdk.org/browse/JDK-8352392) > > Varada M has updated the pull request incrementally with one additional commit since the last revision: > > 8352392: AIX: implement attach API v2 and streaming output Thanks all, ------------- PR Comment: https://git.openjdk.org/jdk/pull/24177#issuecomment-2777687525 From varadam at openjdk.org Fri Apr 4 06:43:56 2025 From: varadam at openjdk.org (Varada M) Date: Fri, 4 Apr 2025 06:43:56 GMT Subject: Integrated: 8352392: AIX: implement attach API v2 and streaming output In-Reply-To: References: Message-ID: On Sun, 23 Mar 2025 14:33:36 GMT, Varada M wrote: > AIX changes for attach API to support arbitrary length arguments and the streaming output support. > serviceability/attach/AttachAPIv2/StreamingOutputTest.java test passes > > tier1, tier2 and tier3 testing is successful with fastdebug level > > JBS Issue : [JDK-8352392](https://bugs.openjdk.org/browse/JDK-8352392) This pull request has now been integrated. Changeset: 41d4a0d7 Author: Varada M URL: https://git.openjdk.org/jdk/commit/41d4a0d7bdda2a96af1e7f549c05d99d68c040dc Stats: 283 lines in 3 files changed: 64 ins; 203 del; 16 mod 8352392: AIX: implement attach API v2 and streaming output Reviewed-by: mdoerr, jkern, amenkov ------------- PR: https://git.openjdk.org/jdk/pull/24177 From serb at openjdk.org Fri Apr 4 09:29:56 2025 From: serb at openjdk.org (Sergey Bylokhov) Date: Fri, 4 Apr 2025 09:29:56 GMT Subject: RFR: 8344671: Few JFR streaming tests fail with application not alive error on MacOS 15 [v5] In-Reply-To: References: <3xUroXKNX4bBRb0L4r5WJ9V_TEJRbtS_hmdZ3AMCTFo=.86aaf7a8-d2c1-4f07-9f74-4e2cab2d0fa2@github.com> Message-ID: On Mon, 31 Mar 2025 18:02:12 GMT, Larry Cable wrote: >> on both Linux and MacOS libattach utilizes UNIX signal (QUIT) to cause a target JVM (attachee) to create the socket file used as transport for subsequent jcmds (and other attach based interactions) and to listen upon that for such. >> >> it should be noted that the default behavior for QUIT (if not blocked or caught) is to terminate the signalled process. >> >> during the early lifetime of a JVM, its signal handlers are not yet installed, and thus any signal such as QUIT will cause the >> default behavior to occur, in this case the JVM will be terminated. >> >> this is why some tests are failing with "not alive" >> >> the "fix" is similar in nature to that already implemented for linux (however using a different OS dependent mechanism to obtain the attachee JVM's signal masks: sysctl(2)). >> >> the method "checkCatchesAndSendQuitTo" will now obtain the "attachee" JVM signal masks and only kill(QUIT) if the >> current masks indicate that the JVM's signals are now being handled. >> >> the behavior in the success case is now identical to the previous implementation, however should the target JVM not >> become "ready" (signal handlers installed) prior to the attach "timeout" occurring the attach operation will throw an >> "AttachNotSupportedException" with a suitable error message. >> >> see also: https://bugs.openjdk.org/browse/JDK-8350766 > > Larry Cable has updated the pull request incrementally with two additional commits since the last revision: > > - Merge branch 'JDK-8344671' of github.com:larry-cable/jdk into JDK-8344671 > - JDK-8334671: minor changes requested by @dholmes src/jdk.attach/macosx/native/libattach/VirtualMachineImpl.c line 118: > 116: * Signature: (I)V > 117: */ > 118: JNIEXPORT jboolean JNICALL Java_sun_tools_attach_VirtualMachineImpl_checkCatchesAndSendQuitTo I?m just curious - why does this method return a boolean? It looks like the result is never actually checked, and the control flow is managed through exceptions or errors(on both linux and macos). ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24085#discussion_r2028449674 From iklam at openjdk.org Sun Apr 6 21:30:49 2025 From: iklam at openjdk.org (Ioi Lam) Date: Sun, 6 Apr 2025 21:30:49 GMT Subject: RFR: 8353597: Refactor handling VM options for AOT cache input and output [v2] In-Reply-To: References:

Message-ID: On Thu, 3 Apr 2025 21:40:48 GMT, Ioi Lam wrote: >> @iklam one annoying thing in current ergonomic setting for AOTCode flags in mainline is checking which phase we are executing. We agreed before that we should only save/load AOT code when `AOTClassLinking` is on because AOT code needs classes to be preloaded. >> >> I have to do next checks to enable AOTCode in `CDSConfig::check_vm_args_consistency()`: >> >> if (AOTClassLinking && is_using_archive() && !is_dumping_archive() && !FLAG_IS_DEFAULT(AOTCache)) { >> FLAG_SET_ERGO_IF_DEFAULT(LoadAOTCode, true); >> ... >> if (AOTClassLinking && is_dumping_final_static_archive()) { >> FLAG_SET_ERGO_IF_DEFAULT(StoreAOTCode, true); >> >> >> First, I am not sure these conditions are correct. >> >> Second, it would be nice to have simple checks instead: `is_dumping_aot_archive()` and `is_using_aot_archive()`. >> >> May be also consider it is error if both conditions are true (we don't support updating archive yet). > >> @iklam one annoying thing in current ergonomic setting for AOTCode flags in mainline is checking which phase we are executing. We agreed before that we should only save/load AOT code when `AOTClassLinking` is on because AOT code needs classes to be preloaded. >> >> I have to do next checks to enable AOTCode in `CDSConfig::check_vm_args_consistency()`: >> >> ``` >> if (AOTClassLinking && is_using_archive() && !is_dumping_archive() && !FLAG_IS_DEFAULT(AOTCache)) { >> FLAG_SET_ERGO_IF_DEFAULT(LoadAOTCode, true); >> ... >> if (AOTClassLinking && is_dumping_final_static_archive()) { >> FLAG_SET_ERGO_IF_DEFAULT(StoreAOTCode, true); >> ``` >> >> First, I am not sure these conditions are correct. >> >> Second, it would be nice to have simple checks instead: `is_dumping_aot_archive()` and `is_using_aot_archive()`. >> >> May be also consider it is error if both conditions are true (we don't support updating archive yet). > > There are a lot of dependencies between different AOT capabilities, and it's hard to control that using global variables. At the point of `CDSConfig::check_vm_args_consistency()`, we don't have complete knowledge whether the AOT cache exists, or whether the cache contains AOT code, or whether the GC compressed oops settings are compatible with the AOT code. > > In the handling of such "AOT capability flags", I have been using the following pattern: > > In `CDSConfig::check_vm_args_consistency()` we update the default values of the flags according to their dependencies on other flags. E.g., by specifying `-XX:AOTMode=create`, `AOTClassLinking` and `AOTInvokeDynamicLinking` are enabled by default. > > > if (!FLAG_IS_DEFAULT(AOTMode)) { > // Using any form of the new AOTMode switch enables enhanced optimizations. > FLAG_SET_ERGO_IF_DEFAULT(AOTClassLinking, true); > } > > if (AOTClassLinking) { > // If AOTClassLinking is specified, enable all AOT optimizations by default. > FLAG_SET_ERGO_IF_DEFAULT(AOTInvokeDynamicLinking, true); > } else { > // AOTInvokeDynamicLinking depends on AOTClassLinking. > FLAG_SET_ERGO(AOTInvokeDynamicLinking, false); > } > > > However, the values of these flags are just advisory. Even if a flag is enabled, the underlying capability may be disabled. For example, `AOTClassLinking` requires the ability of dumping heap objects, which is not available if ZGC is used. > > Because the dependencies are complex, it's difficult to resolve them statically and set a global boolean variable for each capability. Instead, I have been expres... > Thank you @iklam for explanation. I can do final adjustment to `Store|LoadAOTCode` flags values in `StoreAOTCode::initialize()` which is called from `initialize_shared_spaces()`: > > ``` > MetaspaceShared::initialize_shared_spaces() { > ... > static_mapinfo->patch_heap_embedded_pointers(); > ArchiveHeapLoader::finish_initialization(); > Universe::load_archived_object_instances(); > + AOTCodeCache::initialize(); > ``` > > The question: at this place are all CDS AOT flags are final (flags compatibility and cache presence are verified)? > > Note, `Store|LoadAOTCode` flags are diagnostic and disabled by default. I need to set them to `true` somewhere. Yes, at this point all configuration related to AOT should be final. You can set the final values for the `Store|LoadAOTCode` flags here. `StoreAOTCode` should be true only if `CDSConfig::is_dumping_final_static_archive()` is true. `LoadAOTCode` should be true only if `CDSConfig::is_loading_archive()` is true and the archive contains AOT code. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24401#issuecomment-2781681413 From dholmes at openjdk.org Mon Apr 7 07:39:54 2025 From: dholmes at openjdk.org (David Holmes) Date: Mon, 7 Apr 2025 07:39:54 GMT Subject: RFR: 8353231: Test com/sun/management/OperatingSystemMXBean/GetProcessCpuLoad still fails intermittently In-Reply-To: References: Message-ID: On Tue, 1 Apr 2025 10:27:39 GMT, Kevin Walls wrote: > Test com/sun/management/OperatingSystemMXBean/GetProcessCpuLoad still fails intermittently. > On failure, 10 attempts with sleep(200) each time, only read -1 from mbean.getProcessCpuLoad(). > The method is documented to return -1 when info is not available, but want to avoid the test accepting a -1 and masking real problems. > > Test failures are happening when multiple CPU load reding tests ran on the same host, at the same second. > Add a TEST.properties file containing: exclusiveAccess.dirs=. Okay lets give this a try. Thanks ------------- Marked as reviewed by dholmes (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/24352#pullrequestreview-2745797954 From kevinw at openjdk.org Mon Apr 7 09:28:38 2025 From: kevinw at openjdk.org (Kevin Walls) Date: Mon, 7 Apr 2025 09:28:38 GMT Subject: RFR: 8353727: HeapDumpPath doesn't expand %p Message-ID: This is a long-standing oversight: HeapDumpPath does not recognise %p for pid expansion. The default filename uses a pid (e.g. java_pid1676937.hprof) but HeapDumpPath does not. It has always done a manual "root plus pid plus extension" on the default filename only, and should move to using Argument::copy_expand_pid() like we do with other such filenames. We also assumed the default filename is not a directory (which is very very likely, but doesn't have to be true). ------------- Commit messages: - 8353727: HeapDumpPath doesn't expand %p Changes: https://git.openjdk.org/jdk/pull/24482/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=24482&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8353727 Stats: 73 lines in 2 files changed: 26 ins; 23 del; 24 mod Patch: https://git.openjdk.org/jdk/pull/24482.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24482/head:pull/24482 PR: https://git.openjdk.org/jdk/pull/24482 From jkern at openjdk.org Mon Apr 7 09:58:50 2025 From: jkern at openjdk.org (Joachim Kern) Date: Mon, 7 Apr 2025 09:58:50 GMT Subject: RFR: 8341491: Reserve and commit memory operations should be protected by NMT lock [v3] In-Reply-To: References:

Message-ID: On Thu, 3 Apr 2025 15:27:15 GMT, Robert Toyonaga wrote: >> ### Summary: >> This PR makes memory operations atomic with NMT accounting. >> >> ### The problem: >> In memory related functions like `os::commit_memory` and `os::reserve_memory` the OS memory operations are currently done before acquiring the the NMT mutex. And the the virtual memory accounting is done later in `MemTracker`, after the lock has been acquired. Doing the memory operations outside of the lock scope can lead to races. >> >> 1.1 Thread_1 releases range_A. >> 1.2 Thread_1 tells NMT "range_A has been released". >> >> 2.1 Thread_2 reserves (the now free) range_A. >> 2.2 Thread_2 tells NMT "range_A is reserved". >> >> Since the sequence (1.1) (1.2) is not atomic, if Thread_2 begins operating after (1.1), we can have (1.1) (2.1) (2.2) (1.2). The OS sees two valid subsequent calls (release range_A, followed by map range_A). But NMT sees "reserve range_A", "release range_A" and is now out of sync with the OS. >> >> ### Solution: >> Where memory operations such as reserve, commit, or release virtual memory happen, I've expanded the scope of `NmtVirtualMemoryLocker` to protect both the NMT accounting and the memory operation itself. >> >> ### Other notes: >> I also simplified this pattern found in many places: >> >> if (MemTracker::enabled()) { >> MemTracker::NmtVirtualMemoryLocker nvml; >> result = pd_some_operation(addr, bytes); >> if (result != nullptr) { >> MemTracker::record_some_operation(addr, bytes); >> } >> } else { >> result = pd_unmap_memory(addr, bytes); >> } >> ``` >> To: >> >> MemTracker::NmtVirtualMemoryLocker nvml; >> result = pd_unmap_memory(addr, bytes); >> MemTracker::record_some_operation(addr, bytes); >> ``` >> This is possible because `NmtVirtualMemoryLocker` now checks `MemTracker::enabled()`. `MemTracker::record_some_operation` already checks `MemTracker::enabled()` and checks against nullptr. This refactoring previously wasn't possible because `ThreadCritical` was used before https://github.com/openjdk/jdk/pull/22745 introduced `NmtVirtualMemoryLocker`. >> >> I considered moving the locking and NMT accounting down into platform specific code: Ex. lock around { munmap() + MemTracker::record }. The hope was that this would help reduce the size of the critical section. However, I found that the OS-specific "pd_" functions are already short and to-the-point, so doing this wasn't reducing the lock scope very much. Instead it just makes the code more messy by having to maintain the locking and NMT accounting in each platform specific i... > > Robert Toyonaga has updated the pull request incrementally with one additional commit since the last revision: > > exclude file mapping tests on AIX. I ran the tests over the weekend again and now they passed. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24084#issuecomment-2782761982 From kevinw at openjdk.org Mon Apr 7 11:36:55 2025 From: kevinw at openjdk.org (Kevin Walls) Date: Mon, 7 Apr 2025 11:36:55 GMT Subject: RFR: 8353231: Test com/sun/management/OperatingSystemMXBean/GetProcessCpuLoad still fails intermittently In-Reply-To: References: Message-ID: On Tue, 1 Apr 2025 10:27:39 GMT, Kevin Walls wrote: > Test com/sun/management/OperatingSystemMXBean/GetProcessCpuLoad still fails intermittently. > On failure, 10 attempts with sleep(200) each time, only read -1 from mbean.getProcessCpuLoad(). > The method is documented to return -1 when info is not available, but want to avoid the test accepting a -1 and masking real problems. > > Test failures are happening when multiple CPU load reding tests ran on the same host, at the same second. > Add a TEST.properties file containing: exclusiveAccess.dirs=. Thanks David! ------------- PR Comment: https://git.openjdk.org/jdk/pull/24352#issuecomment-2783014959 From kevinw at openjdk.org Mon Apr 7 11:36:55 2025 From: kevinw at openjdk.org (Kevin Walls) Date: Mon, 7 Apr 2025 11:36:55 GMT Subject: Integrated: 8353231: Test com/sun/management/OperatingSystemMXBean/GetProcessCpuLoad still fails intermittently In-Reply-To: References: Message-ID: On Tue, 1 Apr 2025 10:27:39 GMT, Kevin Walls wrote: > Test com/sun/management/OperatingSystemMXBean/GetProcessCpuLoad still fails intermittently. > On failure, 10 attempts with sleep(200) each time, only read -1 from mbean.getProcessCpuLoad(). > The method is documented to return -1 when info is not available, but want to avoid the test accepting a -1 and masking real problems. > > Test failures are happening when multiple CPU load reding tests ran on the same host, at the same second. > Add a TEST.properties file containing: exclusiveAccess.dirs=. This pull request has now been integrated. Changeset: e8c9e5c6 Author: Kevin Walls URL: https://git.openjdk.org/jdk/commit/e8c9e5c6cd3c844765c27c068022a018914fdf4e Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod 8353231: Test com/sun/management/OperatingSystemMXBean/GetProcessCpuLoad still fails intermittently Reviewed-by: dholmes ------------- PR: https://git.openjdk.org/jdk/pull/24352 From zgu at openjdk.org Mon Apr 7 12:35:53 2025 From: zgu at openjdk.org (Zhengyu Gu) Date: Mon, 7 Apr 2025 12:35:53 GMT Subject: RFR: 8353727: HeapDumpPath doesn't expand %p In-Reply-To: References: Message-ID: On Mon, 7 Apr 2025 09:05:34 GMT, Kevin Walls wrote: > This is a long-standing oversight: HeapDumpPath does not recognise %p for pid expansion. > The default filename uses a pid (e.g. java_pid1676937.hprof) but HeapDumpPath does not. > It has always done a manual "root plus pid plus extension" on the default filename only, and > should move to using Argument::copy_expand_pid() like we do with other such filenames. > > > We also assumed the default filename is not a directory (which is very very likely, but doesn't have to be true). @kevinjwalls I have [JDK-8349083](https://bugs.openjdk.org/browse/JDK-8349083) to address similar issues. AFAICT, there are 3 separate code to handle filename expansion and logging has the most complete support, It will be nice to unify them. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24482#issuecomment-2783181347 From kevinw at openjdk.org Mon Apr 7 15:01:34 2025 From: kevinw at openjdk.org (Kevin Walls) Date: Mon, 7 Apr 2025 15:01:34 GMT Subject: RFR: 8353727: HeapDumpPath doesn't expand %p In-Reply-To: References:

Message-ID: On Mon, 7 Apr 2025 12:32:52 GMT, Zhengyu Gu wrote: > @kevinjwalls I have [JDK-8349083](https://bugs.openjdk.org/browse/JDK-8349083) to address similar issues. > > AFAICT, there are 3 separate code to handle filename expansion and logging has the most complete support, It will be nice to unify them. Hi, thanks for the pointer. Yes, we have some duplication in this area... This change is quite small, and removes one duplicate, the manual "base+pid+extension" creation of the filename in HeapDumper. I am looking at the other PR, maybe we can make them share more in future... ------------- PR Comment: https://git.openjdk.org/jdk/pull/24482#issuecomment-2783639159 From heidinga at openjdk.org Mon Apr 7 17:01:56 2025 From: heidinga at openjdk.org (Dan Heidinga) Date: Mon, 7 Apr 2025 17:01:56 GMT Subject: RFR: 8353597: Refactor handling VM options for AOT cache input and output [v3] In-Reply-To: References:

Message-ID: <1ssLzAJVy1yrmNRCmTCz---JPV7_cLWOLwu0A3-yhZw=.02dbfc81-0a2e-4a4a-a7c1-bae8a93af621@github.com> On Thu, 3 Apr 2025 20:31:50 GMT, Ioi Lam wrote: >> Since [JEP 483: Ahead-of-Time Class Loading & Linking](https://openjdk.org/jeps/483), VM options such as `-XX:AOTCache `are implemented as aliases of "classical" CDS options such as `-XX:SharedArchiveFile`. >> >> In anticipation of the [JEP: Ahead-of-time Command Line Ergonomics](https://bugs.openjdk.org/browse/JDK-8350022), we should refactor the code that deals with the AOT options. Specifically, as we expect the JVM to be able to load from an "input AOT cache" and write to an "output AOT cache", we should clearly identify the input and output caches in separate APIs: >> >> >> const char* CDSConfig::input_static_archive_path(); >> const char* CDSConfig::input_dynamic_archive_path(); >> const char* CDSConfig::output_archive_path(); >> >> >> This PR also cleans up the code by: >> - renaming a few function to reflect what they actually do >> - moving more "config" management code into cdsConfig.cpp >> >> There's also a behavioral bug fix: before this PR, `-XX:AOTCache` was handled by the `ergo_init_classic_archive_paths()` function, which allows two files to be specified. E.g., `java -XX:AOTCache=static.jsa:dynamic.jsa`. That's because `-XX:AOTCache` was implemented as an alias of `-XX:SharedArchiveFile`, and the latter allows this usage. >> >> However, this behavior is not specified in JEP 483. Allowing two files in -XX:AOTCache will cause unnecessary complexity when we implement [JDK-8353598: Allow AOT cache to be used in training run](https://bugs.openjdk.org/browse/JDK-8353598). Therefore, I added new test cases to disallow the use of two files. This also means that we don't need to modify the already over-complicated `ergo_init_classic_archive_paths()` for the AOT use cases > > Ioi Lam has updated the pull request incrementally with one additional commit since the last revision: > > @lmesnik comments src/hotspot/share/cds/cdsConfig.cpp line 598: > 596: // - SharedArchiveFile is not specified and the VM doesn't have a compatible default archive > 597: > 598: #define __THEMSG " is unsupported when base CDS archive is not loaded. Run with -Xlog:cds for more info." Do we want to start transitioning existing `-Xlog:cds` options to be `:aot` options? I think making the switch would match out long term direction ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24401#discussion_r2031644463 From amenkov at openjdk.org Mon Apr 7 18:06:49 2025 From: amenkov at openjdk.org (Alex Menkov) Date: Mon, 7 Apr 2025 18:06:49 GMT Subject: RFR: 8353485: Jcms should allow to specify streaming_output mode Message-ID: The fix adds `--streaming_output` jcmd option to manage attach command streaming output. Testing: tier1..tier4,hs-tier5-svc ------------- Commit messages: - jcmd_streaming Changes: https://git.openjdk.org/jdk/pull/24494/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=24494&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8353485 Stats: 172 lines in 9 files changed: 144 ins; 7 del; 21 mod Patch: https://git.openjdk.org/jdk/pull/24494.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24494/head:pull/24494 PR: https://git.openjdk.org/jdk/pull/24494 From cjplummer at openjdk.org Tue Apr 8 01:41:39 2025 From: cjplummer at openjdk.org (Chris Plummer) Date: Tue, 8 Apr 2025 01:41:39 GMT Subject: RFR: 8353727: HeapDumpPath doesn't expand %p In-Reply-To: References: Message-ID: On Mon, 7 Apr 2025 09:05:34 GMT, Kevin Walls wrote: > This is a long-standing oversight: HeapDumpPath does not recognise %p for pid expansion. > The default filename uses a pid (e.g. java_pid1676937.hprof) but HeapDumpPath does not. > It has always done a manual "root plus pid plus extension" on the default filename only, and > should move to using Argument::copy_expand_pid() like we do with other such filenames. > > > We also assumed the default filename is not a directory (which is very very likely, but doesn't have to be true). src/hotspot/share/services/heapDumper.cpp line 2772: > 2770: > 2771: // Set base path (name or directory, default or custom, without seq no), doing %p substitution. > 2772: const char *path_src = (HeapDumpPath && HeapDumpPath[0] != '\0') ? HeapDumpPath : dump_file_name; Should be `HeapDumpPath != nullptr`. src/hotspot/share/services/heapDumper.cpp line 2792: > 2790: // Path is a directory. Append the default name, with %p substitution. Use my_path temporarily. > 2791: if (!Arguments::copy_expand_pid(dump_file_name, strlen(dump_file_name), my_path, JVM_MAXPATHLEN)) { > 2792: warning("Cannot create heap dump file. HeapDumpPath is too long."); What is going to be the end result of this? A truncated file name? test/hotspot/jtreg/runtime/ErrorHandling/TestHeapDumpOnOutOfMemoryError.java line 101: > 99: File dump = new File(heapdumpFilename); > 100: Asserts.assertTrue(dump.exists() && dump.isFile(), "Could not find dump file " + dump.getAbsolutePath()); > 101: I think you can remove this empty line, especially since you don't have one in the similar code below. test/hotspot/jtreg/runtime/ErrorHandling/TestHeapDumpOnOutOfMemoryError.java line 113: > 111: TestHeapDumpOnOutOfMemoryError.class.getName(), type); > 112: > 113: Process proc = pb.start(); No need differ from the above code here. You can just use OutputAnalyzer.pid() to get the pid in the code below. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24482#discussion_r2032238479 PR Review Comment: https://git.openjdk.org/jdk/pull/24482#discussion_r2032243030 PR Review Comment: https://git.openjdk.org/jdk/pull/24482#discussion_r2032233983 PR Review Comment: https://git.openjdk.org/jdk/pull/24482#discussion_r2032234654 From jiangli at openjdk.org Tue Apr 8 02:31:57 2025 From: jiangli at openjdk.org (Jiangli Zhou) Date: Tue, 8 Apr 2025 02:31:57 GMT Subject: RFR: 8353938: hotspot/jtreg/serviceability/dcmd/jvmti/LoadAgentDcmdTest.java fails on static JDK Message-ID: Please review this PR that changes `LoadAgentDcmdTest.getLibInstrumentPath()` to not locate `libinstrument` shared library if running on static JDK, instead just returns "libinstrument." directly. Both test case #1 and #2 in `LoadAgentDcmdTest.run()` run ok on static JDK with the change: Commands: Test case #1: `JVMTI.agent_load libinstrument.so agent.jar` Test case #2: `JVMTI.agent_load libinstrument.so "agent.jar=foo=bar"` Additional notes/considerations: 1. I notice [JDKToolFinder.getJDKTool()](https://github.com/openjdk/jdk/blob/80ff7b9c9406c7845ecb3bc40910e92ccdd23ff2/test/lib/jdk/test/lib/JDKToolFinder.java#L41) is used by the test to find the `jcmd` tool. `JDKToolFinder.getJDKTool()` looks for the requested tool in both `test.jdk` and `compile.jdk`. When running the jtreg test on static JDK, it's able to locate the `jcmd` from `compile.jdk` even though the static JDK binary (`test.jdk`) does not provide the tool. For tools used by jtreg tests at runtime, how about change to always do that? 2. From https://docs.oracle.com/en/java/javase/21/docs/specs/man/jcmd.html: JVMTI.agent_load [arguments] Loads JVMTI native agent. Impact: Low arguments: library path: Absolute path of the JVMTI agent to load. (STRING, no default value) agent option: (Optional) Option string to pass the agent. (STRING, no default value) The command spec requires the absolute path of the JVMTI agent for the `library path` argument. On static JDK, if the agent library is built-in (statically linked), passing the shared library name works and allows the VM to find the built-in agent. There would be no need to specify the absolute path. Please see ?https://bugs.openjdk.org/browse/JDK-8353938?focusedId=14767737&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-14767737 for more details. ------------- Commit messages: - Replace "so" with Platform.sharedLibraryExt(). - Change LoadAgentDcmdTest.getLibInstrumentPath() to return "libinstrument.so" directly without locating the shared library if running on static JDK. Changes: https://git.openjdk.org/jdk/pull/24497/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=24497&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8353938 Stats: 6 lines in 1 file changed: 6 ins; 0 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/24497.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24497/head:pull/24497 PR: https://git.openjdk.org/jdk/pull/24497 From sspitsyn at openjdk.org Tue Apr 8 03:16:30 2025 From: sspitsyn at openjdk.org (Serguei Spitsyn) Date: Tue, 8 Apr 2025 03:16:30 GMT Subject: RFR: 8316682: serviceability/jvmti/vthread/SelfSuspendDisablerTest timed out [v3] In-Reply-To: References: Message-ID: > This fixes the issue with lack of synchronization between JVMTI thread suspend and resume functions in a self-suspend case. More detailed fix description is in the first PR comment. > > Testing: Ran mach5 tiers 1-6. Serguei Spitsyn has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains three additional commits since the last revision: - Merge - some cleanup - 8316682: serviceability/jvmti/vthread/SelfSuspendDisablerTest timed out ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24269/files - new: https://git.openjdk.org/jdk/pull/24269/files/18944347..4a92986a Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24269&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24269&range=01-02 Stats: 68856 lines in 1040 files changed: 24634 ins; 41255 del; 2967 mod Patch: https://git.openjdk.org/jdk/pull/24269.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24269/head:pull/24269 PR: https://git.openjdk.org/jdk/pull/24269 From kevinw at openjdk.org Tue Apr 8 09:02:13 2025 From: kevinw at openjdk.org (Kevin Walls) Date: Tue, 8 Apr 2025 09:02:13 GMT Subject: RFR: 8353727: HeapDumpPath doesn't expand %p In-Reply-To: References:

Message-ID: On Tue, 8 Apr 2025 01:37:21 GMT, Chris Plummer wrote: >> This is a long-standing oversight: HeapDumpPath does not recognise %p for pid expansion. >> The default filename uses a pid (e.g. java_pid1676937.hprof) but HeapDumpPath does not. >> It has always done a manual "root plus pid plus extension" on the default filename only, and >> should move to using Argument::copy_expand_pid() like we do with other such filenames. >> >> >> We also assumed the default filename is not a directory (which is very very likely, but doesn't have to be true). > > src/hotspot/share/services/heapDumper.cpp line 2792: > >> 2790: // Path is a directory. Append the default name, with %p substitution. Use my_path temporarily. >> 2791: if (!Arguments::copy_expand_pid(dump_file_name, strlen(dump_file_name), my_path, JVM_MAXPATHLEN)) { >> 2792: warning("Cannot create heap dump file. HeapDumpPath is too long."); > > What is going to be the end result of this? A truncated file name? Yes the other warnings return - thanks. They all return without incrementing dump_seq, so will hit the same failure each time. Setting a HeapDumpPath near to 4k in length is not an efficient thing to do! ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24482#discussion_r2032733570 From rehn at openjdk.org Tue Apr 8 09:44:23 2025 From: rehn at openjdk.org (Robbin Ehn) Date: Tue, 8 Apr 2025 09:44:23 GMT Subject: RFR: 8352730: RISC-V: Disable tests in qemu-user [v3] In-Reply-To: <5sujqD7L_cmLUyDwYb4PhgOlEeiFwlkAV7RJoVMFTrM=.223437cd-bbb2-4ef3-a6fe-b13ce402e14b@github.com> References: <5sujqD7L_cmLUyDwYb4PhgOlEeiFwlkAV7RJoVMFTrM=.223437cd-bbb2-4ef3-a6fe-b13ce402e14b@github.com> Message-ID: On Mon, 31 Mar 2025 10:45:54 GMT, Robbin Ehn wrote: >> Hi, for you to consider. >> >> These tests constantly fails in qemu-user. >> Either the require host to be same arch explicit or implicit (sysroot). >> E.g. "ptrace(PTRACE_ATTACH, ..) failed for 405157: Function not implemented'" for SA tests. >> >> From bug: >>> qemu-user/rv64 sets uarch to "qemu" in /proc/cpuinfo (qemu-system do not do that). >>> We add this uarch to CPU feature string. >>> This means we can use jtreg 'require' with cpu string to filter out tests in qemu-user. >> >> Relevant qemu code: >> https://github.com/qemu/qemu/blob/170825d14d88a1ce7fae98d5a928480f2f329b22/linux-user/riscv/target_proc.h#L29 >> >> Relevant hotspot code: >> https://github.com/openjdk/jdk/blob/fa0b18bfde38ee2ffbab33a9eaac547fe8aa3c7c/src/hotspot/os_cpu/linux_riscv/vm_version_linux_riscv.cpp#L250 >> >> Tested that the require only filters out tests in qemu+riscv64. >> >> Thanks! >> >> /Robbin > > Robbin Ehn has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains seven additional commits since the last revision: > > - Merge branch 'master' into qemu-user-issues > - Revert > - Merge branch 'master' into qemu-user-issues > - Merge branch 'master' into qemu-user-issues > - more > - more > - native or very long Any takers? ------------- PR Comment: https://git.openjdk.org/jdk/pull/24229#issuecomment-2785851072 From kevinw at openjdk.org Tue Apr 8 11:43:20 2025 From: kevinw at openjdk.org (Kevin Walls) Date: Tue, 8 Apr 2025 11:43:20 GMT Subject: RFR: 8353727: HeapDumpPath doesn't expand %p [v2] In-Reply-To: References: Message-ID: > This is a long-standing oversight: HeapDumpPath does not recognise %p for pid expansion. > The default filename uses a pid (e.g. java_pid1676937.hprof) but HeapDumpPath does not. > It has always done a manual "root plus pid plus extension" on the default filename only, and > should move to using Argument::copy_expand_pid() like we do with other such filenames. > > > We also assumed the default filename is not a directory (which is very very likely, but doesn't have to be true). Kevin Walls has updated the pull request incrementally with two additional commits since the last revision: - length checking update - Chris feedback ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24482/files - new: https://git.openjdk.org/jdk/pull/24482/files/ab82116e..c32e4ca4 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24482&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24482&range=00-01 Stats: 23 lines in 2 files changed: 1 ins; 15 del; 7 mod Patch: https://git.openjdk.org/jdk/pull/24482.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24482/head:pull/24482 PR: https://git.openjdk.org/jdk/pull/24482 From kevinw at openjdk.org Tue Apr 8 11:47:20 2025 From: kevinw at openjdk.org (Kevin Walls) Date: Tue, 8 Apr 2025 11:47:20 GMT Subject: RFR: 8353727: HeapDumpPath doesn't expand %p [v2] In-Reply-To: References:

Message-ID: On Tue, 8 Apr 2025 11:44:52 GMT, Kevin Walls wrote: > Updated. Additionally, the total_length check at line 2760 is wrong now. But it is also redundant, we use copy_expand_pid to do our length checks on expansion. Use max_digit_chars to reduce buffer length in those copy_expand_pid calls, to leave room for possible later sequence numbers (this is very conservative). > > On longer path lengths, worth noting that using MAXPATHLEN (4k) is higher than outputStream::print_cr allows. This means we can get through all of HeapDumper::dump_heap(bool oome) and call HeapDumper::dump() which uses: 2606 out->print_cr("Dumping heap to %s ...", path); ..and will show a VM warning like "outputStream::do_vsnprintf output truncated" > > Again, very very long HeapDumpPaths are not efficient. 8-) I keep thinking that such a coding can benefit from using stringStream; it makes most of the associated buffer counting etc obsolete, supports dynamic buffers, or optionally can be laid over a fixed-sized buffer and then handles truncation. It does not yet provide a way to report truncation, but that can be added really easily. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24482#issuecomment-2786180952 From kevinw at openjdk.org Tue Apr 8 12:12:11 2025 From: kevinw at openjdk.org (Kevin Walls) Date: Tue, 8 Apr 2025 12:12:11 GMT Subject: RFR: 8353727: HeapDumpPath doesn't expand %p [v2] In-Reply-To: References:

Message-ID: On Tue, 8 Apr 2025 11:51:18 GMT, Thomas Stuefe wrote: > I keep thinking that such a coding can benefit from using stringStream; it makes most of the associated buffer counting etc obsolete, supports dynamic buffers, or optionally can be laid over a fixed-sized buffer and then handles truncation. It does not yet provide a way to report truncation, but that can be added really easily. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24482#issuecomment-2786226321 From stuefe at openjdk.org Tue Apr 8 13:16:20 2025 From: stuefe at openjdk.org (Thomas Stuefe) Date: Tue, 8 Apr 2025 13:16:20 GMT Subject: RFR: 8353727: HeapDumpPath doesn't expand %p [v2] In-Reply-To: References:

Message-ID: On Tue, 8 Apr 2025 13:13:25 GMT, Thomas Stuefe wrote: >> Kevin Walls has updated the pull request incrementally with two additional commits since the last revision: >> >> - length checking update >> - Chris feedback > > src/hotspot/share/services/heapDumper.cpp line 2779: > >> 2777: } >> 2778: // Then add the default name, with %p substitution. Use my_path temporarily. >> 2779: if (!Arguments::copy_expand_pid(dump_file_name, strlen(dump_file_name), my_path, JVM_MAXPATHLEN - max_digit_chars)) { > > IIUC there is a pre-existing bug, and if I am right one you should fix: this calculation assumes that there is only a single %p. There may be multiple. Many. E.g. as a malicious attempt to cause a buffer overflow. > > This is what I meant with stringStream. stringStream offers protection against stuff like that without the manual buffer counting headaches. I would give Arguments a method like this: > > print_expand_pid(outputStream* sink, const char* input); > > > and in there print to sink, with print or putc. This would never truncate. Then use it like this: > > > outputStream st(caller buffer, caller buffer size) > if (have HeapDumpPath) { > Arguments::print_expand_pid(st, HeapDumpPath); > if (st->was_truncated()) return with warning > // now st->base() ist der expanded heap path. Test if its a directory etc > } > // append file name > Arguments::print_expand_pid(st, dump_file_name); > if (st->was_truncated()) return with warning > > > Just a rough sketch. And fine for followup PRs, though I think it may make your life easier if you do it now. Thankfully copy_expand_pid does handle multiple %p replacements. It seems good to use that to check the buffer length, partly for that reason, as just knowing a max number of digits wasn't so flexible if many %p were present. Thanks for the other ideas! ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24482#discussion_r2033234374 From stefank at openjdk.org Tue Apr 8 14:22:27 2025 From: stefank at openjdk.org (Stefan Karlsson) Date: Tue, 8 Apr 2025 14:22:27 GMT Subject: RFR: 8341491: Reserve and commit memory operations should be protected by NMT lock [v3] In-Reply-To: References:

Message-ID: On Thu, 3 Apr 2025 15:27:15 GMT, Robert Toyonaga wrote: >> ### Update: >> After some discussion it was decided it's not necessary to expand the lock scope for reserve/commit. Instead, we are opting to add comments explaining the reasons for locking and the conditions to avoid which could lead to races. Some of the new tests can be kept because they are general enough to be useful outside of this context. >> >> ### Summary: >> This PR makes memory operations atomic with NMT accounting. >> >> ### The problem: >> In memory related functions like `os::commit_memory` and `os::reserve_memory` the OS memory operations are currently done before acquiring the the NMT mutex. And the the virtual memory accounting is done later in `MemTracker`, after the lock has been acquired. Doing the memory operations outside of the lock scope can lead to races. >> >> 1.1 Thread_1 releases range_A. >> 1.2 Thread_1 tells NMT "range_A has been released". >> >> 2.1 Thread_2 reserves (the now free) range_A. >> 2.2 Thread_2 tells NMT "range_A is reserved". >> >> Since the sequence (1.1) (1.2) is not atomic, if Thread_2 begins operating after (1.1), we can have (1.1) (2.1) (2.2) (1.2). The OS sees two valid subsequent calls (release range_A, followed by map range_A). But NMT sees "reserve range_A", "release range_A" and is now out of sync with the OS. >> >> ### Solution: >> Where memory operations such as reserve, commit, or release virtual memory happen, I've expanded the scope of `NmtVirtualMemoryLocker` to protect both the NMT accounting and the memory operation itself. >> >> ### Other notes: >> I also simplified this pattern found in many places: >> >> if (MemTracker::enabled()) { >> MemTracker::NmtVirtualMemoryLocker nvml; >> result = pd_some_operation(addr, bytes); >> if (result != nullptr) { >> MemTracker::record_some_operation(addr, bytes); >> } >> } else { >> result = pd_unmap_memory(addr, bytes); >> } >> ``` >> To: >> >> MemTracker::NmtVirtualMemoryLocker nvml; >> result = pd_unmap_memory(addr, bytes); >> MemTracker::record_some_operation(addr, bytes); >> ``` >> This is possible because `NmtVirtualMemoryLocker` now checks `MemTracker::enabled()`. `MemTracker::record_some_operation` already checks `MemTracker::enabled()` and checks against nullptr. This refactoring previously wasn't possible because `ThreadCritical` was used before https://github.com/openjdk/jdk/pull/22745 introduced `NmtVirtualMemoryLocker`. >> >> I considered moving the locking and NMT accounting down into platform specific code: Ex. lock around { munmap() + MemTracker:... > > Robert Toyonaga has updated the pull request incrementally with one additional commit since the last revision: > > exclude file mapping tests on AIX. I think this looks good to me, but please seek feedback from others as well. I've added a couple of suggestions. None of them are required, but I think they would be nice to do. src/hotspot/share/runtime/os.cpp line 2206: > 2204: // when it is actually committed. The opposite scenario is not guarded against. pd_commit_memory and > 2205: // record_virtual_memory_commit do not happen atomically. We assume that there is some external synchronization > 2206: // that prevents a region from being uncommitted before it is finished being committed. It's not a requirement, but you get kudos from me if you keep comments lines below 80 lines. I typically don't like code to be 80 lines, but comments tend to be nicer if they are. test/hotspot/gtest/runtime/test_os.cpp line 1123: > 1121: > 1122: char* base = os::reserve_memory(size, false, mtTest); > 1123: ASSERT_NE(base, (char*) nullptr); Suggestion: ASSERT_NOT_NULL(base); And the same in other places. test/hotspot/gtest/runtime/test_os.cpp line 1133: > 1131: } > 1132: > 1133: #if !defined(_AIX) Suggestion: #if !defined(_AIX) I suggest a blank line here because this ifdef spans multiple tests and not only the nearest test. Having a blank line makes it clearer that this is a large ifdef that is not only related to the test case that it is bunched up against. test/hotspot/gtest/runtime/test_os.cpp line 1145: > 1143: EXPECT_TRUE(result != nullptr); > 1144: > 1145: EXPECT_TRUE(strcmp(letters, result)==0); Suggestion: EXPECT_TRUE(strcmp(letters, result) == 0); but probably even better: Suggestion: EXPECT_EQ(strcmp(letters, result), 0); test/hotspot/gtest/runtime/test_os.cpp line 1184: > 1182: ::close(fd); > 1183: } > 1184: #endif Suggestion: #endif // !defined(_AIX) I suggest a blank line and a matching comment. I know some HotSpots devs tend to appreciate those comments. ------------- Marked as reviewed by stefank (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/24084#pullrequestreview-2750137709 PR Review Comment: https://git.openjdk.org/jdk/pull/24084#discussion_r2033287481 PR Review Comment: https://git.openjdk.org/jdk/pull/24084#discussion_r2033292443 PR Review Comment: https://git.openjdk.org/jdk/pull/24084#discussion_r2033303666 PR Review Comment: https://git.openjdk.org/jdk/pull/24084#discussion_r2033294266 PR Review Comment: https://git.openjdk.org/jdk/pull/24084#discussion_r2033307030 From stefank at openjdk.org Tue Apr 8 14:22:27 2025 From: stefank at openjdk.org (Stefan Karlsson) Date: Tue, 8 Apr 2025 14:22:27 GMT Subject: RFR: 8341491: Reserve and commit memory operations should be protected by NMT lock [v3] In-Reply-To: References:

Message-ID: On Tue, 8 Apr 2025 14:11:21 GMT, Stefan Karlsson wrote: >> Robert Toyonaga has updated the pull request incrementally with one additional commit since the last revision: >> >> exclude file mapping tests on AIX. > > test/hotspot/gtest/runtime/test_os.cpp line 1145: > >> 1143: EXPECT_TRUE(result != nullptr); >> 1144: >> 1145: EXPECT_TRUE(strcmp(letters, result)==0); > > Suggestion: > > EXPECT_TRUE(strcmp(letters, result) == 0); > > but probably even better: > Suggestion: > > EXPECT_EQ(strcmp(letters, result), 0); There are more places like this. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24084#discussion_r2033296961 From alanb at openjdk.org Tue Apr 8 14:29:50 2025 From: alanb at openjdk.org (Alan Bateman) Date: Tue, 8 Apr 2025 14:29:50 GMT Subject: RFR: 8351927: Change VirtualThread implementation to use use FJP delayed task handling Message-ID: <7vE97S4zy2S1vuRwEapE4k9ZScS6yTJBQWyKymjdc0g=.5551d6d6-68ad-446e-84c7-70fd57fa57a6@github.com> Follow up to JDK-8319447 to change the VirtualThread implementation to use FJP's delayed task handling. The SPTE based implementation is not removed. It will continue to be used by tests. If custom schedulers are exposed in the future then they will use this implementation. For timed-Object.wait, waitTimeoutExpired is changed to use lazySubmit to avoid signalling and increase the chance that the unparked virtual thread will continue on the current carrier. For timed-park, the timeout task is changed to reduced form of unpark that also uses lazySubmit, for the same reason. `jcmd Thread.vthread_scheduler` is changed to no longer print the delay schedulers. Instead, the delayed task count will appear in the default scheduler output. ------------- Commit messages: - Merge branch 'master' into JDK-8351927 - Merge branch 'master' into JDK-8351927 - Merge branch 'master' into JDK-8351927 - Merge - Merge branch 'pull/23702' into JDK-8351927 - Typo - Address review comments - Merge branch 'openjdk:master' into JDK-8319447 - Address review comments - Update - ... and 51 more: https://git.openjdk.org/jdk/compare/867a0301...30eba776 Changes: https://git.openjdk.org/jdk/pull/24030/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=24030&range=00 Issue: https://bugs.openjdk.org/browse/JDK-8351927 Stats: 405 lines in 8 files changed: 332 ins; 41 del; 32 mod Patch: https://git.openjdk.org/jdk/pull/24030.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24030/head:pull/24030 PR: https://git.openjdk.org/jdk/pull/24030 From mbaesken at openjdk.org Tue Apr 8 14:46:30 2025 From: mbaesken at openjdk.org (Matthias Baesken) Date: Tue, 8 Apr 2025 14:46:30 GMT Subject: RFR: 8349638: Build libjdwp with SIZE optimization In-Reply-To: References:

Message-ID: <-ut2L3mbuwzJxlZJGiGyhMl7_HCpd1QX-kuFOX8m1Lc=.38fc5800-855a-49cd-9f3e-3ccadd19885b@github.com> On Tue, 1 Apr 2025 11:59:07 GMT, Magnus Ihse Bursie wrote: > It would be interesting to also see how compilation times varies with optimization level. At least some kind of hint if HIGHEST is like 2x slower than LOW, or if SIZE is slower than LOW at all, etc. The relative speed difference is interesting, but so is it in absolute terms. If a library takes 0.5 seconds on LOW but 1.1 seconds on HIGH on a particular system, it is unlikely to matter much to overall build time anywhere. But if it goes from 15s to 30s on a fast machine, it might be a problem if such performance regressions stack up, especially on slower machines (which includes the ones running GHA). This is what I got from my Linux x86_64 system using gcc 13.2.0 devkit (opt build). Note that the build operates on a relatively slow filer, this will slow the build time somewhat but that is true for all opt-levels. rm -rf ./support/modules_libs/jdk.jdwp.agent/libjdwp.so ./jdk/lib/libjdwp.so ./support/native/jdk.jdwp.agent/libjdwp time make jdk.jdwp.agent-libs-only JOBS=1 gave me these times **default (LOW)** real 0m15.661s user 0m8.763s sys 0m2.012s **HIGHEST** real 0m15.201s user 0m9.005s sys 0m2.003s **SIZE** real 0m14.263s user 0m7.905s sys 0m1.891s So it looks like SIZE is a little faster than the other, and LOW and HIGHEST are rather similar. LOW is `-O2` on Linuxx86_64 and HIGHEST is `-O3` , those are maybe rather similar (LOW is a bit misleading because `-O2` is not really that 'low' the gcc docu says about it : 'Optimize even more. GCC performs nearly all supported optimizations that do not involve a space-speed tradeoff.'). ------------- PR Comment: https://git.openjdk.org/jdk/pull/23563#issuecomment-2786694168 From stuefe at openjdk.org Tue Apr 8 15:28:25 2025 From: stuefe at openjdk.org (Thomas Stuefe) Date: Tue, 8 Apr 2025 15:28:25 GMT Subject: RFR: 8353727: HeapDumpPath doesn't expand %p [v2] In-Reply-To: References:

Message-ID: On Tue, 8 Apr 2025 13:47:10 GMT, Kevin Walls wrote: >> src/hotspot/share/services/heapDumper.cpp line 2779: >> >>> 2777: } >>> 2778: // Then add the default name, with %p substitution. Use my_path temporarily. >>> 2779: if (!Arguments::copy_expand_pid(dump_file_name, strlen(dump_file_name), my_path, JVM_MAXPATHLEN - max_digit_chars)) { >> >> IIUC there is a pre-existing bug, and if I am right one you should fix: this calculation assumes that there is only a single %p. There may be multiple. Many. E.g. as a malicious attempt to cause a buffer overflow. >> >> This is what I meant with stringStream. stringStream offers protection against stuff like that without the manual buffer counting headaches. I would give Arguments a method like this: >> >> print_expand_pid(outputStream* sink, const char* input); >> >> >> and in there print to sink, with print or putc. This would never truncate. Then use it like this: >> >> >> outputStream st(caller buffer, caller buffer size) >> if (have HeapDumpPath) { >> Arguments::print_expand_pid(st, HeapDumpPath); >> if (st->was_truncated()) return with warning >> // now st->base() ist der expanded heap path. Test if its a directory etc >> } >> // append file name >> Arguments::print_expand_pid(st, dump_file_name); >> if (st->was_truncated()) return with warning >> >> >> Just a rough sketch. And fine for followup PRs, though I think it may make your life easier if you do it now. > > Thankfully copy_expand_pid does handle multiple %p replacements. It seems good to use that to check the buffer length, partly for that reason, as just knowing a max number of digits wasn't so flexible if many %p were present. > > Thanks for the other ideas! Ah okay, it checks for overflow. Okay, please disregard half of what I have written :) ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24482#discussion_r2033452540 From kvn at openjdk.org Tue Apr 8 16:00:21 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Tue, 8 Apr 2025 16:00:21 GMT Subject: RFR: 8353597: Refactor handling VM options for AOT cache input and output [v3] In-Reply-To: <1ssLzAJVy1yrmNRCmTCz---JPV7_cLWOLwu0A3-yhZw=.02dbfc81-0a2e-4a4a-a7c1-bae8a93af621@github.com> References:

<1ssLzAJVy1yrmNRCmTCz---JPV7_cLWOLwu0A3-yhZw=.02dbfc81-0a2e-4a4a-a7c1-bae8a93af621@github.com> Message-ID: On Mon, 7 Apr 2025 16:54:37 GMT, Dan Heidinga wrote: >> Ioi Lam has updated the pull request incrementally with one additional commit since the last revision: >> >> @lmesnik comments > > src/hotspot/share/cds/cdsConfig.cpp line 598: > >> 596: // - SharedArchiveFile is not specified and the VM doesn't have a compatible default archive >> 597: >> 598: #define __THEMSG " is unsupported when base CDS archive is not loaded. Run with -Xlog:cds for more info." > > Do we want to start transitioning existing `-Xlog:cds` options to be `:aot` options? I think making the switch would match out long term direction Yes, but I think we should do it only if `AOTClassLinking` is enabled. For legacy CDS we should continue use `-Xlog:cds`. I am using `-Xlog:aot+codecache` in AOT code caching. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24401#discussion_r2033524434 From iklam at openjdk.org Tue Apr 8 16:29:21 2025 From: iklam at openjdk.org (Ioi Lam) Date: Tue, 8 Apr 2025 16:29:21 GMT Subject: RFR: 8353597: Refactor handling VM options for AOT cache input and output [v3] In-Reply-To: References:

<1ssLzAJVy1yrmNRCmTCz---JPV7_cLWOLwu0A3-yhZw=.02dbfc81-0a2e-4a4a-a7c1-bae8a93af621@github.com> Message-ID: On Tue, 8 Apr 2025 15:57:46 GMT, Vladimir Kozlov wrote: >> src/hotspot/share/cds/cdsConfig.cpp line 598: >> >>> 596: // - SharedArchiveFile is not specified and the VM doesn't have a compatible default archive >>> 597: >>> 598: #define __THEMSG " is unsupported when base CDS archive is not loaded. Run with -Xlog:cds for more info." >> >> Do we want to start transitioning existing `-Xlog:cds` options to be `:aot` options? I think making the switch would match out long term direction > > Yes, but I think we should do it only if `AOTClassLinking` is enabled. For legacy CDS we should continue use `-Xlog:cds`. > I am using `-Xlog:aot+codecache` in AOT code caching. I created [JDK-8354055 - Change "cds" logging tag to "aot"](https://bugs.openjdk.org/browse/JDK-8354055). There are documentation/compatibility issues so we need to do some planning. This particular block of code is moved from dynamicArchive.cpp to cdsConfig.cpp and I kept the logging tag the same. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24401#discussion_r2033582112 From heidinga at openjdk.org Tue Apr 8 16:56:17 2025 From: heidinga at openjdk.org (Dan Heidinga) Date: Tue, 8 Apr 2025 16:56:17 GMT Subject: RFR: 8353597: Refactor handling VM options for AOT cache input and output [v3] In-Reply-To: References:

<1ssLzAJVy1yrmNRCmTCz---JPV7_cLWOLwu0A3-yhZw=.02dbfc81-0a2e-4a4a-a7c1-bae8a93af621@github.com>

Message-ID: On Tue, 8 Apr 2025 16:26:28 GMT, Ioi Lam wrote: >> Yes, but I think we should do it only if `AOTClassLinking` is enabled. For legacy CDS we should continue use `-Xlog:cds`. >> I am using `-Xlog:aot+codecache` in AOT code caching. > > I created [JDK-8354055 - Change "cds" logging tag to "aot"](https://bugs.openjdk.org/browse/JDK-8354055). There are documentation/compatibility issues so we need to do some planning. > > This particular block of code is moved from dynamicArchive.cpp to cdsConfig.cpp and I kept the logging tag the same. Thanks @iklam. I agree with the approach of doing this in a separate issue. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24401#discussion_r2033640450 From kvn at openjdk.org Tue Apr 8 17:28:26 2025 From: kvn at openjdk.org (Vladimir Kozlov) Date: Tue, 8 Apr 2025 17:28:26 GMT Subject: RFR: 8353597: Refactor handling VM options for AOT cache input and output [v3] In-Reply-To: References:

Message-ID: <2ZMtE0qPWzJ1jSC31oFURQ2b7w3l7d8CJsXgmWauyiY=.e4a74cdf-d76a-4862-8966-eb26a0b57dac@github.com> On Thu, 3 Apr 2025 20:31:50 GMT, Ioi Lam wrote: >> Since [JEP 483: Ahead-of-Time Class Loading & Linking](https://openjdk.org/jeps/483), VM options such as `-XX:AOTCache `are implemented as aliases of "classical" CDS options such as `-XX:SharedArchiveFile`. >> >> In anticipation of the [JEP: Ahead-of-time Command Line Ergonomics](https://bugs.openjdk.org/browse/JDK-8350022), we should refactor the code that deals with the AOT options. Specifically, as we expect the JVM to be able to load from an "input AOT cache" and write to an "output AOT cache", we should clearly identify the input and output caches in separate APIs: >> >> >> const char* CDSConfig::input_static_archive_path(); >> const char* CDSConfig::input_dynamic_archive_path(); >> const char* CDSConfig::output_archive_path(); >> >> >> This PR also cleans up the code by: >> - renaming a few function to reflect what they actually do >> - moving more "config" management code into cdsConfig.cpp >> >> There's also a behavioral bug fix: before this PR, `-XX:AOTCache` was handled by the `ergo_init_classic_archive_paths()` function, which allows two files to be specified. E.g., `java -XX:AOTCache=static.jsa:dynamic.jsa`. That's because `-XX:AOTCache` was implemented as an alias of `-XX:SharedArchiveFile`, and the latter allows this usage. >> >> However, this behavior is not specified in JEP 483. Allowing two files in -XX:AOTCache will cause unnecessary complexity when we implement [JDK-8353598: Allow AOT cache to be used in training run](https://bugs.openjdk.org/browse/JDK-8353598). Therefore, I added new test cases to disallow the use of two files. This also means that we don't need to modify the already over-complicated `ergo_init_classic_archive_paths()` for the AOT use cases > > Ioi Lam has updated the pull request incrementally with one additional commit since the last revision: > > @lmesnik comments Looks good to me. ------------- Marked as reviewed by kvn (Reviewer). PR Review: https://git.openjdk.org/jdk/pull/24401#pullrequestreview-2750854771 From jiangli at openjdk.org Tue Apr 8 18:55:21 2025 From: jiangli at openjdk.org (Jiangli Zhou) Date: Tue, 8 Apr 2025 18:55:21 GMT Subject: RFR: 8353938: hotspot/jtreg/serviceability/dcmd/jvmti/LoadAgentDcmdTest.java fails on static JDK In-Reply-To: References: Message-ID: On Tue, 8 Apr 2025 02:25:58 GMT, Jiangli Zhou wrote: > 2. From https://docs.oracle.com/en/java/javase/21/docs/specs/man/jcmd.html: > > ``` > JVMTI.agent_load [arguments] > Loads JVMTI native agent. > Impact: Low > arguments: > library path: Absolute path of the JVMTI agent to load. (STRING, no default value) > agent option: (Optional) Option string to pass the agent. (STRING, no default value) > ``` > > The command spec requires the absolute path of the JVMTI agent for the `library path` argument. On static JDK, if the agent library is built-in (statically linked), passing the shared library name works and allows the VM to find the built-in agent. There would be no need to specify the absolute path. Please see ?https://bugs.openjdk.org/browse/JDK-8353938?focusedId=14767737&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-14767737 for more details. I discussed the `JVMTI.agent_load` issue on static support with @AlanBateman this morning as part of the hermetic Java meeting. @AlanBateman suggested considering adding an alternative diagnostic command or argument for the static (built-in) agent load support. We also discussed the use case of statically-linked dynamic (attached) native agents loaded by `jcmd` tool on static, and considerations for resolving (or clarifying) the `JVMTI.agent_load` static support at a later point when usage requirements were more clear. Outside the jtreg tests, I haven't run into any such usages yet. Based on the discussion, I'll update this PR to skip the `LoadAgentDcmdTest` on static JDK for now. I'll file a separate bug for the `JVMTI.agent_load` issue on static support so we can revisit that when things are more clear. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24497#issuecomment-2787388528 From jiangli at openjdk.org Tue Apr 8 19:09:02 2025 From: jiangli at openjdk.org (Jiangli Zhou) Date: Tue, 8 Apr 2025 19:09:02 GMT Subject: RFR: 8353938: hotspot/jtreg/serviceability/dcmd/jvmti/LoadAgentDcmdTest.java fails on static JDK [v2] In-Reply-To: References: Message-ID: > Please review this PR that changes `LoadAgentDcmdTest.getLibInstrumentPath()` to not locate `libinstrument` shared library if running on static JDK, instead just returns "libinstrument." directly. Both test case #1 and #2 in `LoadAgentDcmdTest.run()` run ok on static JDK with the change: > > Commands: > Test case #1: `JVMTI.agent_load libinstrument.so agent.jar` > Test case #2: `JVMTI.agent_load libinstrument.so "agent.jar=foo=bar"` > > Additional notes/considerations: > > 1. I notice [JDKToolFinder.getJDKTool()](https://github.com/openjdk/jdk/blob/80ff7b9c9406c7845ecb3bc40910e92ccdd23ff2/test/lib/jdk/test/lib/JDKToolFinder.java#L41) is used by the test to find the `jcmd` tool. `JDKToolFinder.getJDKTool()` looks for the requested tool in both `test.jdk` and `compile.jdk`. When running the jtreg test on static JDK, it's able to locate the `jcmd` from `compile.jdk` even though the static JDK binary (`test.jdk`) does not provide the tool. For tools used by jtreg tests at runtime, how about change to always do that? > > 2. From https://docs.oracle.com/en/java/javase/21/docs/specs/man/jcmd.html: > > JVMTI.agent_load [arguments] > Loads JVMTI native agent. > Impact: Low > arguments: > library path: Absolute path of the JVMTI agent to load. (STRING, no default value) > agent option: (Optional) Option string to pass the agent. (STRING, no default value) > > The command spec requires the absolute path of the JVMTI agent for the `library path` argument. On static JDK, if the agent library is built-in (statically linked), passing the shared library name works and allows the VM to find the built-in agent. There would be no need to specify the absolute path. Please see ?https://bugs.openjdk.org/browse/JDK-8353938?focusedId=14767737&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-14767737 for more details. Jiangli Zhou has updated the pull request incrementally with one additional commit since the last revision: Skip LoadAgentDcmdTest on static JDK. ------------- Changes: - all: https://git.openjdk.org/jdk/pull/24497/files - new: https://git.openjdk.org/jdk/pull/24497/files/1e61a0dd..3b4ef50c Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=24497&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24497&range=00-01 Stats: 7 lines in 1 file changed: 1 ins; 6 del; 0 mod Patch: https://git.openjdk.org/jdk/pull/24497.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24497/head:pull/24497 PR: https://git.openjdk.org/jdk/pull/24497 From jiangli at openjdk.org Tue Apr 8 19:35:23 2025 From: jiangli at openjdk.org (Jiangli Zhou) Date: Tue, 8 Apr 2025 19:35:23 GMT Subject: RFR: 8353938: hotspot/jtreg/serviceability/dcmd/jvmti/LoadAgentDcmdTest.java fails on static JDK [v2] In-Reply-To: References:

Message-ID: On Tue, 8 Apr 2025 19:09:02 GMT, Jiangli Zhou wrote: >> Please review this PR that changes `LoadAgentDcmdTest.getLibInstrumentPath()` to not locate `libinstrument` shared library if running on static JDK, instead just returns "libinstrument." directly. Both test case #1 and #2 in `LoadAgentDcmdTest.run()` run ok on static JDK with the change: >> >> Commands: >> Test case #1: `JVMTI.agent_load libinstrument.so agent.jar` >> Test case #2: `JVMTI.agent_load libinstrument.so "agent.jar=foo=bar"` >> >> Additional notes/considerations: >> >> 1. I notice [JDKToolFinder.getJDKTool()](https://github.com/openjdk/jdk/blob/80ff7b9c9406c7845ecb3bc40910e92ccdd23ff2/test/lib/jdk/test/lib/JDKToolFinder.java#L41) is used by the test to find the `jcmd` tool. `JDKToolFinder.getJDKTool()` looks for the requested tool in both `test.jdk` and `compile.jdk`. When running the jtreg test on static JDK, it's able to locate the `jcmd` from `compile.jdk` even though the static JDK binary (`test.jdk`) does not provide the tool. For tools used by jtreg tests at runtime, how about change to always do that? >> >> 2. From https://docs.oracle.com/en/java/javase/21/docs/specs/man/jcmd.html: >> >> JVMTI.agent_load [arguments] >> Loads JVMTI native agent. >> Impact: Low >> arguments: >> library path: Absolute path of the JVMTI agent to load. (STRING, no default value) >> agent option: (Optional) Option string to pass the agent. (STRING, no default value) >> >> The command spec requires the absolute path of the JVMTI agent for the `library path` argument. On static JDK, if the agent library is built-in (statically linked), passing the shared library name works and allows the VM to find the built-in agent. There would be no need to specify the absolute path. Please see ?https://bugs.openjdk.org/browse/JDK-8353938?focusedId=14767737&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-14767737 for more details. > > Jiangli Zhou has updated the pull request incrementally with one additional commit since the last revision: > > Skip LoadAgentDcmdTest on static JDK. I filed https://bugs.openjdk.org/browse/JDK-8354069. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24497#issuecomment-2787473905 From vklang at openjdk.org Tue Apr 8 20:09:25 2025 From: vklang at openjdk.org (Viktor Klang) Date: Tue, 8 Apr 2025 20:09:25 GMT Subject: RFR: 8351927: Change VirtualThread implementation to use use FJP delayed task handling In-Reply-To: <7vE97S4zy2S1vuRwEapE4k9ZScS6yTJBQWyKymjdc0g=.5551d6d6-68ad-446e-84c7-70fd57fa57a6@github.com> References: <7vE97S4zy2S1vuRwEapE4k9ZScS6yTJBQWyKymjdc0g=.5551d6d6-68ad-446e-84c7-70fd57fa57a6@github.com> Message-ID: <64xe_dZIfIScyB_InbjJJn19DjAY_7qxmsB7FKKKIH4=.0c4ad76a-cbf8-475d-b3e8-6c2336ec1f29@github.com> On Thu, 13 Mar 2025 10:48:14 GMT, Alan Bateman wrote: > Follow up to JDK-8319447 to change the VirtualThread implementation to use FJP's delayed task handling. > > The SPTE based implementation is not removed. It will continue to be used by tests. If custom schedulers are exposed in the future then they will use this implementation. > > For timed-Object.wait, waitTimeoutExpired is changed to use lazySubmit to avoid signalling and increase the chance that the unparked virtual thread will continue on the current carrier. For timed-park, the timeout task is changed to reduced form of unpark that also uses lazySubmit, for the same reason. > > `jcmd Thread.vthread_scheduler` is changed to no longer print the delay schedulers. Instead, the delayed task count will appear in the default scheduler output. src/java.base/share/classes/java/lang/VirtualThread.java line 889: > 887: private void parkTimeoutExpired() { > 888: assert !VirtualThread.currentThread().isVirtual(); > 889: if (!getAndSetParkPermit(true) @AlanBateman Would it make sense to test whether the park-permit is false before the LOCK XCHG? src/java.base/share/classes/java/lang/VirtualThread.java line 1455: > 1453: return pool.schedule(command, delay, unit); > 1454: } else { > 1455: return DelayedTaskSchedulers.schedule(command, delay, unit); @AlanBateman Would it make sense to test if the Scheduler implements ScheduledExecutorService? src/java.base/share/classes/java/lang/VirtualThread.java line 1462: > 1460: * Supports scheduling a runnable task to run after a delay. It uses a number > 1461: * of ScheduledThreadPoolExecutor instances to reduce contention on the delayed > 1462: * work queue used. This class is used when using a custom scheduler. @AlanBateman It might make sense to instead require a custom Scheduler to implement ScheduledExecutorService? ? ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24030#discussion_r2033942539 PR Review Comment: https://git.openjdk.org/jdk/pull/24030#discussion_r2033943663 PR Review Comment: https://git.openjdk.org/jdk/pull/24030#discussion_r2033944680 From lmesnik at openjdk.org Wed Apr 9 00:06:41 2025 From: lmesnik at openjdk.org (Leonid Mesnik) Date: Wed, 9 Apr 2025 00:06:41 GMT Subject: RFR: 8353727: HeapDumpPath doesn't expand %p [v2] In-Reply-To: References:

Message-ID: On Tue, 8 Apr 2025 11:43:20 GMT, Kevin Walls wrote: >> This is a long-standing oversight: HeapDumpPath does not recognise %p for pid expansion. >> The default filename uses a pid (e.g. java_pid1676937.hprof) but HeapDumpPath does not. >> It has always done a manual "root plus pid plus extension" on the default filename only, and >> should move to using Argument::copy_expand_pid() like we do with other such filenames. >> >> >> We also assumed the default filename is not a directory (which is very very likely, but doesn't have to be true). > > Kevin Walls has updated the pull request incrementally with two additional commits since the last revision: > > - length checking update > - Chris feedback Marked as reviewed by lmesnik (Reviewer). test/hotspot/jtreg/runtime/ErrorHandling/TestHeapDumpOnOutOfMemoryError.java line 100: > 98: output.shouldContain("Dumping heap to " + type + ".hprof"); > 99: File dump = new File(heapdumpFilename); > 100: Asserts.assertTrue(dump.exists() && dump.isFile(), "Could not find dump file " + dump.getAbsolutePath()); I. think you could just update the test to use heapdumpFilename = type + ".%p.hprof"; we don't need test twice, it is quite expensive. test/hotspot/jtreg/runtime/ErrorHandling/TestHeapDumpOnOutOfMemoryError.java line 115: > 113: output.stdoutShouldNotBeEmpty(); > 114: String actualHeapdumpFilename = type + "." + output.pid() + ".hprof"; > 115: output.shouldContain("Dumping heap to " + actualHeapdumpFilename); This better to be something like expectedlHeapdumpFilename and "Expected heap dump file". Not very important, but make log cleaner. ------------- PR Review: https://git.openjdk.org/jdk/pull/24482#pullrequestreview-2751644078 PR Review Comment: https://git.openjdk.org/jdk/pull/24482#discussion_r2034187020 PR Review Comment: https://git.openjdk.org/jdk/pull/24482#discussion_r2034189152 From iklam at openjdk.org Wed Apr 9 02:18:41 2025 From: iklam at openjdk.org (Ioi Lam) Date: Wed, 9 Apr 2025 02:18:41 GMT Subject: RFR: 8353597: Refactor handling VM options for AOT cache input and output [v4] In-Reply-To: References: Message-ID: > Since [JEP 483: Ahead-of-Time Class Loading & Linking](https://openjdk.org/jeps/483), VM options such as `-XX:AOTCache `are implemented as aliases of "classical" CDS options such as `-XX:SharedArchiveFile`. > > In anticipation of the [JEP: Ahead-of-time Command Line Ergonomics](https://bugs.openjdk.org/browse/JDK-8350022), we should refactor the code that deals with the AOT options. Specifically, as we expect the JVM to be able to load from an "input AOT cache" and write to an "output AOT cache", we should clearly identify the input and output caches in separate APIs: > > > const char* CDSConfig::input_static_archive_path(); > const char* CDSConfig::input_dynamic_archive_path(); > const char* CDSConfig::output_archive_path(); > > > This PR also cleans up the code by: > - renaming a few function to reflect what they actually do > - moving more "config" management code into cdsConfig.cpp > > There's also a behavioral bug fix: before this PR, `-XX:AOTCache` was handled by the `ergo_init_classic_archive_paths()` function, which allows two files to be specified. E.g., `java -XX:AOTCache=static.jsa:dynamic.jsa`. That's because `-XX:AOTCache` was implemented as an alias of `-XX:SharedArchiveFile`, and the latter allows this usage. > > However, this behavior is not specified in JEP 483. Allowing two files in -XX:AOTCache will cause unnecessary complexity when we implement [JDK-8353598: Allow AOT cache to be used in training run](https://bugs.openjdk.org/browse/JDK-8353598). Therefore, I added new test cases to disallow the use of two files. This also means that we don't need to modify the already over-complicated `ergo_init_classic_archive_paths()` for the AOT use cases Ioi Lam has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains nine commits: - Merge branch 'master' into 8353597-refactor-aot-cache-input-output - @lmesnik comments - more clean up - Minimized changes in ergo_init_classic_archive_paths() - Clean up CDS input/output path handling - Refactored CollectClassesForLinking for simplification - Merge branch 'master' into 8353014-exclude-tooling-classes-from-aot-cache - Reverted some fixes in systemDictionaryShared.cpp that causes test failures - 8353014: Exclude AOT tooling classes from AOT cache ------------- Changes: https://git.openjdk.org/jdk/pull/24401/files Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=24401&range=03 Stats: 309 lines in 15 files changed: 161 ins; 55 del; 93 mod Patch: https://git.openjdk.org/jdk/pull/24401.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/24401/head:pull/24401 PR: https://git.openjdk.org/jdk/pull/24401 From fyang at openjdk.org Wed Apr 9 04:02:29 2025 From: fyang at openjdk.org (Fei Yang) Date: Wed, 9 Apr 2025 04:02:29 GMT Subject: RFR: 8352730: RISC-V: Disable tests in qemu-user [v2] In-Reply-To: <1pa1FDH5Z2quR3fE7o4qfZKwRrz8nXHbMSirSyiqhTw=.9c37d2a9-5b93-40dd-8b5a-a5822030ef48@github.com> References:

<1pa1FDH5Z2quR3fE7o4qfZKwRrz8nXHbMSirSyiqhTw=.9c37d2a9-5b93-40dd-8b5a-a5822030ef48@github.com> Message-ID: <9R7U8cL4aSOayHQzaXoTGx0nXSXqdkO4ZomONZnM0Ao=.c1c90e1c-3085-4656-911d-23c407cff74d@github.com> On Fri, 28 Mar 2025 06:53:15 GMT, Robbin Ehn wrote: > It's not some intermittently failure. The majority of them can't work as they use pstack, open core files, use PerfData, etc.. and expected it to be rv64. But core files, pstack are in host arch as we are running qemu-user. I can remove tests which timeouts and only keep test which simply can't work in qemu-user environment in this PR. Seems good? Hi, That make sense to me. And it doesn't seem to me to be riscv-specific issue, but rather one with qemu-user. Maybe we should update the title and changes to reflect that? I sometimes see people testing with qemu for other CPU platforms as well like ppc, s390, etc. Guess they might be helped with this too. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24229#issuecomment-2788216103 From asmehra at openjdk.org Wed Apr 9 04:06:30 2025 From: asmehra at openjdk.org (Ashutosh Mehra) Date: Wed, 9 Apr 2025 04:06:30 GMT Subject: RFR: 8353597: Refactor handling VM options for AOT cache input and output [v4] In-Reply-To: References:

Message-ID: On Wed, 9 Apr 2025 02:18:41 GMT, Ioi Lam wrote: >> Since [JEP 483: Ahead-of-Time Class Loading & Linking](https://openjdk.org/jeps/483), VM options such as `-XX:AOTCache `are implemented as aliases of "classical" CDS options such as `-XX:SharedArchiveFile`. >> >> In anticipation of the [JEP: Ahead-of-time Command Line Ergonomics](https://bugs.openjdk.org/browse/JDK-8350022), we should refactor the code that deals with the AOT options. Specifically, as we expect the JVM to be able to load from an "input AOT cache" and write to an "output AOT cache", we should clearly identify the input and output caches in separate APIs: >> >> >> const char* CDSConfig::input_static_archive_path(); >> const char* CDSConfig::input_dynamic_archive_path(); >> const char* CDSConfig::output_archive_path(); >> >> >> This PR also cleans up the code by: >> - renaming a few function to reflect what they actually do >> - moving more "config" management code into cdsConfig.cpp >> >> There's also a behavioral bug fix: before this PR, `-XX:AOTCache` was handled by the `ergo_init_classic_archive_paths()` function, which allows two files to be specified. E.g., `java -XX:AOTCache=static.jsa:dynamic.jsa`. That's because `-XX:AOTCache` was implemented as an alias of `-XX:SharedArchiveFile`, and the latter allows this usage. >> >> However, this behavior is not specified in JEP 483. Allowing two files in -XX:AOTCache will cause unnecessary complexity when we implement [JDK-8353598: Allow AOT cache to be used in training run](https://bugs.openjdk.org/browse/JDK-8353598). Therefore, I added new test cases to disallow the use of two files. This also means that we don't need to modify the already over-complicated `ergo_init_classic_archive_paths()` for the AOT use cases > > Ioi Lam has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains nine commits: > > - Merge branch 'master' into 8353597-refactor-aot-cache-input-output > - @lmesnik comments > - more clean up > - Minimized changes in ergo_init_classic_archive_paths() > - Clean up CDS input/output path handling > - Refactored CollectClassesForLinking for simplification > - Merge branch 'master' into 8353014-exclude-tooling-classes-from-aot-cache > - Reverted some fixes in systemDictionaryShared.cpp that causes test failures > - 8353014: Exclude AOT tooling classes from AOT cache lgtm ------------- Marked as reviewed by asmehra (Committer). PR Review: https://git.openjdk.org/jdk/pull/24401#pullrequestreview-2751975775 From alanb at openjdk.org Wed Apr 9 05:59:32 2025 From: alanb at openjdk.org (Alan Bateman) Date: Wed, 9 Apr 2025 05:59:32 GMT Subject: RFR: 8351927: Change VirtualThread implementation to use use FJP delayed task handling In-Reply-To: <64xe_dZIfIScyB_InbjJJn19DjAY_7qxmsB7FKKKIH4=.0c4ad76a-cbf8-475d-b3e8-6c2336ec1f29@github.com> References: <7vE97S4zy2S1vuRwEapE4k9ZScS6yTJBQWyKymjdc0g=.5551d6d6-68ad-446e-84c7-70fd57fa57a6@github.com> <64xe_dZIfIScyB_InbjJJn19DjAY_7qxmsB7FKKKIH4=.0c4ad76a-cbf8-475d-b3e8-6c2336ec1f29@github.com> Message-ID: On Tue, 8 Apr 2025 20:05:26 GMT, Viktor Klang wrote: >> Follow up to JDK-8319447 to change the VirtualThread implementation to use FJP's delayed task handling. >> >> The SPTE based implementation is not removed. It will continue to be used by tests. If custom schedulers are exposed in the future then they will use this implementation. >> >> For timed-Object.wait, waitTimeoutExpired is changed to use lazySubmit to avoid signalling and increase the chance that the unparked virtual thread will continue on the current carrier. For timed-park, the timeout task is changed to reduced form of unpark that also uses lazySubmit, for the same reason. >> >> `jcmd Thread.vthread_scheduler` is changed to no longer print the delay schedulers. Instead, the delayed task count will appear in the default scheduler output. > > src/java.base/share/classes/java/lang/VirtualThread.java line 889: > >> 887: private void parkTimeoutExpired() { >> 888: assert !VirtualThread.currentThread().isVirtual(); >> 889: if (!getAndSetParkPermit(true) > > @AlanBateman Would it make sense to test whether the park-permit is false before the LOCK XCHG? It already does, no CAS if the current value is the new value. > src/java.base/share/classes/java/lang/VirtualThread.java line 1455: > >> 1453: return pool.schedule(command, delay, unit); >> 1454: } else { >> 1455: return DelayedTaskSchedulers.schedule(command, delay, unit); > > @AlanBateman Would it make sense to test if the Scheduler implements ScheduledExecutorService? Not for now. If a custom scheduler feature is exposed some time then we can think about this topic, it may or may be that the custom scheduler supports delayed tasks. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24030#discussion_r2034500634 PR Review Comment: https://git.openjdk.org/jdk/pull/24030#discussion_r2034500533 From rehn at openjdk.org Wed Apr 9 06:34:30 2025 From: rehn at openjdk.org (Robbin Ehn) Date: Wed, 9 Apr 2025 06:34:30 GMT Subject: RFR: 8352730: RISC-V: Disable tests in qemu-user [v2] In-Reply-To: <9R7U8cL4aSOayHQzaXoTGx0nXSXqdkO4ZomONZnM0Ao=.c1c90e1c-3085-4656-911d-23c407cff74d@github.com> References:

<1pa1FDH5Z2quR3fE7o4qfZKwRrz8nXHbMSirSyiqhTw=.9c37d2a9-5b93-40dd-8b5a-a5822030ef48@github.com> <9R7U8cL4aSOayHQzaXoTGx0nXSXqdkO4ZomONZnM0Ao=.c1c90e1c-3085-4656-911d-23c407cff74d@github.com> Message-ID: On Wed, 9 Apr 2025 03:57:10 GMT, Fei Yang wrote: > > It's not some intermittently failure. The majority of them can't work as they use pstack, open core files, use PerfData, etc.. and expected it to be rv64. But core files, pstack are in host arch as we are running qemu-user. I can remove tests which timeouts and only keep test which simply can't work in qemu-user environment in this PR. Seems good? > > Hi, That make sense to me. And it doesn't seem to me to be riscv-specific issue, but rather one with qemu-user. Maybe we should update the title and changes to reflect that? I sometimes see people testing with qemu for other CPU platforms as well like ppc, s390, etc. Guess they might be helped with this too. Hey, thanks for considering. The default qemu /proc/cpu do not contain any information about this being qemu. And there is no standard way to find this out AFIAK. Some platforms have target specific /proc/cpu and put qemu in there, but it have no standard format. The whole proc -> uarch string -> jvm cpu string -> jtreg require is qemu/linux-user/riscv specific. ------------- PR Comment: https://git.openjdk.org/jdk/pull/24229#issuecomment-2788434073 From stuefe at openjdk.org Wed Apr 9 06:46:35 2025 From: stuefe at openjdk.org (Thomas Stuefe) Date: Wed, 9 Apr 2025 06:46:35 GMT Subject: RFR: 8353727: HeapDumpPath doesn't expand %p [v2] In-Reply-To: References:

Message-ID: <1hCrzMxrXFxiNGtP2T6tvgVhRyuCLRQpX5wsQNwCzNU=.a558b333-1057-4902-97f8-c8509ad85471@github.com> On Tue, 8 Apr 2025 11:43:20 GMT, Kevin Walls wrote: >> This is a long-standing oversight: HeapDumpPath does not recognise %p for pid expansion. >> The default filename uses a pid (e.g. java_pid1676937.hprof) but HeapDumpPath does not. >> It has always done a manual "root plus pid plus extension" on the default filename only, and >> should move to using Argument::copy_expand_pid() like we do with other such filenames. >> >> >> We also assumed the default filename is not a directory (which is very very likely, but doesn't have to be true). > > Kevin Walls has updated the pull request incrementally with two additional commits since the last revision: > > - length checking update > - Chris feedback src/hotspot/share/services/heapDumper.cpp line 2760: > 2758: if (dump_file_seq == 0) { // first time in, we initialize base_path > 2759: // Set base path (name or directory, default or custom, without seq no), doing %p substitution. > 2760: const char *path_src = (HeapDumpPath != nullptr && HeapDumpPath[0] != '\0') ? HeapDumpPath : dump_file_name; Why do you expand the dump file name here? If you want to minimize the expand calls, you could: - append the unexpanded dump_file_name to the unexpanded HeapDumpPath - expand - create dir (extract the directory name by temporarily setting the the last '/' to '\0; create dir; restore '/') now you are done. ------------- PR Review Comment: https://git.openjdk.org/jdk/pull/24482#discussion_r2034576102 From fyang at openjdk.org Wed Apr 9 07:35:40 2025 From: fyang at openjdk.org (Fei Yang) Date: Wed, 9 Apr 2025 07:35:40 GMT Subject: RFR: 8352730: RISC-V: Disable tests in qemu-user [v2] In-Reply-To: References:

<1pa1FDH5Z2quR3fE7o4qfZKwRrz8nXHbMSirSyiqhTw=.9c37d2a9-5b93-40dd-8b5a-a5822030ef48@github.com> <9R7U8cL4aSOayHQzaXoTGx0nXSXqdkO4ZomONZnM0Ao=.c1c90e1c-3085-4656-911d-23c407cff74d@github.com>