RFR(s): 8220774: Add HandshakeALot diag option
Daniel D. Daugherty
daniel.daugherty at oracle.com
Tue Mar 19 18:25:47 UTC 2019
On 3/19/19 12:48 PM, Robbin Ehn wrote:
> Hi all, please review.
>
> Changeset:
> http://cr.openjdk.java.net/~rehn/8220774/webrev/
src/hotspot/share/runtime/globals.hpp
No comments.
src/hotspot/share/runtime/thread.hpp
L1871: // Debug method asserting thread states
L1872: DEBUG_ONLY(void assert_states();)
The comment doesn't add anything beyond what you can see with
the line of code (except for the "thread" part). I think you
need a better function name and a better comment.
src/hotspot/share/runtime/thread.cpp
L2952: void JavaThread::assert_states() {
L2953: assert((!has_last_Java_frame() && java_call_counter() == 0) ||
L2954: (has_last_Java_frame() && java_call_counter() > 0),
"wrong java_sp info!");
The assert failure string doesn't really match the assertion
unless I missing something here...
Also what's the reason for this assertion?
src/hotspot/share/runtime/vmThread.hpp
No comments.
src/hotspot/share/runtime/vmThread.cpp
L437: class HandshakeALotTC : public ThreadClosure {
<snip>
L439: virtual void do_thread(Thread* thread) {
<snip>
L441: JavaThread* jt = (JavaThread*)thread;
L442: DEBUG_ONLY(jt->assert_states();)
Okay, but why are these thread states important here?
Also L441 could also be DEBUG_ONLY. In fact the whole function
body could be just #ifdef ASSERT ... #endif.
old L437: VM_Operation* VMThread::no_op_safepoint(bool check_time) {
<snip>
old L444: if (check_time) {
old L445: long interval_ms =
SafepointTracing::time_since_last_safepoint_ms();
old L446: bool max_time_exceeded = GuaranteedSafepointInterval
!= 0 &&
old L447: (interval_ms >
GuaranteedSafepointInterval);
old L448: if (!max_time_exceeded) {
old L449: return NULL;
old L450: }
old L451: }
In the existing code base, all callers to no_op_safepoint()
passed 'false' so L444-L451 was dead code. It looks like
before this changeset, we had both 'true' and 'false' callers:
changeset: 53895:b22d8ae270a2
user: rehn
date: Fri Feb 22 14:20:06 2019 +0100
summary: 8219436: Safepoint logs correction and misc
new L446: VM_Operation* VMThread::no_op_safepoint() {
new L447: long interval_ms =
SafepointTracing::time_since_last_safepoint_ms();
new L448: bool max_time_exceeded = GuaranteedSafepointInterval !=
0 &&
new L449: (interval_ms >=
GuaranteedSafepointInterval);
new L450: if (!max_time_exceeded) {
new L451: return NULL;
new L452: }
In the new code, L447-L452 are no longer dead and in fact the
block is now first and is always checked. This means that if
max_time_exceeded == false, we return NULL and there's no
safepoint.
This is a change in behavior for the SafepointALot flag since
this code block used to be first:
old L438: if (SafepointALot) {
old L439: return &safepointALot_op;
old L440: }
So unless I'm missing something SafepointALot now appears to
be broken since it will only happen if max_time_exceeded == true
and that's just a normal safepoint interval.
old L494: if (timedout && (_cur_vm_operation =
VMThread::no_op_safepoint(false)) != NULL) {
old L495: MutexUnlockerEx mul(VMOperationQueue_lock,
old L496: Mutex::_no_safepoint_check_flag);
new L506: MutexUnlockerEx mul(VMOperationQueue_lock,
Mutex::_no_safepoint_check_flag);
new L507: if (timedout && (_cur_vm_operation =
VMThread::no_op_safepoint()) != NULL) {
The old code only unlocked (and relocked) VMOperationQueue_lock
when we knew that we needed to force a safepoint. Now it is always
unlocked (and relocked) on this code path.
old L621: if ((_cur_vm_operation =
VMThread::no_op_safepoint(false)) != NULL) {
new L633: if ((_cur_vm_operation = VMThread::no_op_safepoint())
!= NULL) {
Before the fix for 8219436, this line used to be:
if (VMThread::no_op_safepoint_needed(true)) {
so the fix for 8219436 is what made old L444-L451 into
dead code.
Okay, so I think that this fix breaks the SafepointALot flag.
The fix for 8219436 broke the max_time_exceeded logic and
this fix solves that, but it breaks SafepointALot at the same
time so I think you need to rework things a bit.
Dan
> Issue:
> https://bugs.openjdk.java.net/browse/JDK-8220774
>
> We need an option to generate a lot of handshakes for testing.
> This adds -XX:+HandshakeALot, typical usage:
> -XX:+UnlockDiagnosticVMOptions -XX:GuaranteedSafepointInterval=200
> -XX:+HandshakeALot
> Will generate a extra handshake if there is no safepoint/handshake per
> 200ms.
>
> Thanks, Robbin
>
More information about the hotspot-runtime-dev
mailing list