RFR: 8307365: JvmtiStressModule hit SIGSEGV in JvmtiEventControllerPrivate::recompute_thread_enabled

Serguei Spitsyn sspitsyn at openjdk.org
Tue May 16 00:54:48 UTC 2023


On Fri, 12 May 2023 02:14:00 GMT, Patricio Chilano Mateo <pchilanomate at openjdk.org> wrote:

> The following patch fixes a bug introduced while refactoring the VirtualThreadStart/End events. Specifically, the code to delete the JvmtiThreadState of a terminating vthread was moved before we start the VTMS transition. That allowed said code to run concurrently with recompute_enabled() leading to different crashing modes. I wrote the detailed sequence of events leading to the crash in the bug comments. 
> 
> To fix it I moved the cleanup code back after the call to VTMS_unmount_begin(). Now, since the rebinding of the JvmtiThreadState to that of the carrier has to be done after this cleanup code is executed, otherwise we would delete the wrong JvmtiThreadState state, I had to add a boolean argument to VTMS_unmount_begin() to differentiate the last unmount call from the other ones. This is unfortunate since ideally VTMS_unmount_begin() would be oblivious to these two cases as with VTMS_mount_end() where we don't need to check if this is the first mount.
> I looked for other ways to solve it instead of the extra boolean argument but wasn't convinced. One way would be to have another JvmtiExport::cleanup_thread() that would handle this case. Another way which is very simple is to move the rebind_to_jvmti_thread_state_of() call to VTMS_unmount_end() instead. But that means during the transition the _jvmti_thread_state field of the carrier would be either null or that of the vthread, unlike today which is always that of the carrier during the transitions. I didn't want to change that behavior in this fix but I can also explore that route.
> 
> I tested the patch with the reproducer I attached to the bug, plus I also run tiers1-3 in mach5.
> 
> Thanks,
> Patricio

Thank you for taking care about this issue!
Yes, clearing the `JvmtiThreadState` of a virtual thread has to be done
while in transition as it provides a needed synchronization.
This makes it a little bit ugly but I hope it can be simplified again after getting rid of the `rebind_to_jvmti_thread_state_of()` which is still on my TODO list.
Thanks,
Serguei

src/hotspot/share/prims/jvmtiThreadState.cpp line 559:

> 557:   VTMS_unmount_begin(vthread, /* last_unmount */ true);
> 558:   if (thread->jvmti_thread_state() != nullptr) {
> 559:     assert(thread->jvmti_thread_state()->is_virtual(), "wrong JvmtiThreadState");

We agreed with you to temporarily remove this assert as it triggers the bug:
[8308124](https://bugs.openjdk.org/browse/JDK-8308124) dynamic loading of a JVMTI agent has a race with JvmtiThreadState cleanup

A fix of the [8308124](https://bugs.openjdk.org/browse/JDK-8308124) will add this assert back.

-------------

Marked as reviewed by sspitsyn (Reviewer).

PR Review: https://git.openjdk.org/jdk/pull/13949#pullrequestreview-1427539217
PR Review Comment: https://git.openjdk.org/jdk/pull/13949#discussion_r1194485663


More information about the serviceability-dev mailing list