From david.holmes at oracle.com  Thu Aug  1 06:50:46 2019
From: david.holmes at oracle.com (David Holmes)
Date: Thu, 1 Aug 2019 16:50:46 +1000
Subject: jmx-dev RFR: 8170299: Debugger does not stop inside the low
 memory notifications code
In-Reply-To: <E2BFE392-EF77-4072-9D7C-65C088D4F007@oracle.com>
References: <75B5F778-DC49-494B-AC12-270F301677CA@oracle.com>
 <60639d41-735a-00d3-c9db-1955f581b89a@oracle.com>
 <E2BFE392-EF77-4072-9D7C-65C088D4F007@oracle.com>
Message-ID: <9783ca89-0af8-2167-436a-e5ff2db631a3@oracle.com>

Hi Daniil,

On 25/07/2019 3:34 am, Daniil Titov wrote:
> Hi David,
> 
> Hope you had a great vacation!

I did thank you. Apologies again for taking so long to get back to this 
work.

> Please find below the latest version of the change . The only difference from the version 01 is
> the corrected ordering of include statements as Serguei suggested.
> 
> Webrev: https://cr.openjdk.java.net/~dtitov/8170299/webrev.02/
> Bug: https://bugs.openjdk.java.net/browse/JDK-8170299

I'm still remain concerned about introducing yet-another-thread to the 
system. The potential interactions with other threads is not at all clear.

I'm also concerned that this thread has to be visible so that you can 
debug the notification code, yet at the same time being visible makes it 
vulnerable to application level actions that don't impact the service 
thread - in particular if we suspend all threads then this thread will 
be suspended too, if we resume a thread that triggers a notification, 
the notification thread won't be able to respond to it as it is 
suspended. The user won't know that they need to explicitly resume this 
internal system thread.

Also note in serviceThread.cpp we have:

  129       // This ThreadBlockInVM object is not also considered to be
  130       // suspend-equivalent because ServiceThread is not visible to
  131       // external suspension.
  132
  133       ThreadBlockInVM tbivm(jt);

and you copied that across to notificationThread.cpp as:

   93       // Need state transition ThreadBlockInVM so that this thread
   94       // will be handled by safepoint correctly when this thread is
   95       // notified at a safepoint.
   96
   97       ThreadBlockInVM tbivm(jt);

so this will continue to not be a suspend-equivalent condition even 
though this thread is visible and suspendible! So something seems wrong 
there. I'm unclear why we need to use the ThreadBlockInVM rather than 
defining the NotificationLock as a safepoint-checks-always lock, rather 
than a safepoint-check-never lock? In fact with some recent changes to 
locks I'm not even sure it is legal for the notification thread to use a 
safepoint-check-never lock - have you re-based this recently?

Thanks,
David

> Thanks!
> --Daniil
> 
> ?On 7/3/19, 11:47 PM, "David Holmes" <david.holmes at oracle.com> wrote:
> 
>      Hi Daniil,
>      
>      On 4/07/2019 1:04 pm, Daniil Titov wrote:
>      > Please review the change the fixes the problem with the debugger not stopping in the low memory notification code.
>      >
>      > The problem here is that the ServiceThread that calls these MXBean listeners is hidden from the external view that prevents the debugger from stopping in it.
>      >
>      > The fix introduces new NotificationThread that is visible to the external view and offloads the ServiceThread from sending low memory and other notifications that could result in Java calls ( GC and diagnostic commands notifications) by moving these activities in this new NotificationThread.
>      
>      There is a long and unfortunate history with this bug.
>      
>      The original incarnation of this fix was introducing a new thread at the
>      Java library level, and I had some concerns about that:
>      
>      http://mail.openjdk.java.net/pipermail/serviceability-dev/2017-December/022612.html
>      
>      That effort was resurrected at:
>      
>      http://mail.openjdk.java.net/pipermail/serviceability-dev/2018-July/024466.html
>      
>      and
>      
>      http://mail.openjdk.java.net/pipermail/serviceability-dev/2018-August/024849.html
>      
>      but was left somewhat in limbo. There was a lot of doubt about the right
>      way to fix this bug and whether introducing a new thread was too disruptive.
>      
>      But introducing a new thread in the VM also has the same set of
>      concerns! This needs consideration by the runtime team before going
>      ahead. Introducing a new thread likes this needs to be examined in
>      detail - particularly the synchronization interactions with other
>      threads. It also introduces another monitor designated safepoint-never
>      at a time when we are in the process of cleaning up monitors so that
>      JavaThreads will only use safepoint-check-always monitors.
>      
>      Unfortunately I'm about to head out for two weeks vacation, and a number
>      of other key runtime folk are also on vacation. but I'd ask that you
>      hold off on this until we can look at it in more detail.
>      
>      Thanks,
>      David
>      -----
>      
>      > Testing: Mach5 tier1,tier2 and tier3 tests succeeded.
>      >
>      > Webrev: https://cr.openjdk.java.net/~dtitov/8170299/webrev.01/
>      > Bug: https://bugs.openjdk.java.net/browse/JDK-8170299
>      >
>      > Thanks!
>      > --Daniil
>      >
>      >
>      
> 
> 

From chris.plummer at oracle.com  Fri Aug  2 18:12:17 2019
From: chris.plummer at oracle.com (Chris Plummer)
Date: Fri, 2 Aug 2019 11:12:17 -0700
Subject: jmx-dev RFR: 8170299: Debugger does not stop inside the low
 memory notifications code
In-Reply-To: <9783ca89-0af8-2167-436a-e5ff2db631a3@oracle.com>
References: <75B5F778-DC49-494B-AC12-270F301677CA@oracle.com>
 <60639d41-735a-00d3-c9db-1955f581b89a@oracle.com>
 <E2BFE392-EF77-4072-9D7C-65C088D4F007@oracle.com>
 <9783ca89-0af8-2167-436a-e5ff2db631a3@oracle.com>
Message-ID: <9a805686-57bf-c158-a777-c3cb7e38f09f@oracle.com>

On 7/31/19 11:50 PM, David Holmes wrote:
> Hi Daniil,
>
> On 25/07/2019 3:34 am, Daniil Titov wrote:
>> Hi David,
>>
>> Hope you had a great vacation!
>
> I did thank you. Apologies again for taking so long to get back to 
> this work.
>
>> Please find below the latest version of the change . The only 
>> difference from the version 01 is
>> the corrected ordering of include statements as Serguei suggested.
>>
>> Webrev: https://cr.openjdk.java.net/~dtitov/8170299/webrev.02/
>> Bug: https://bugs.openjdk.java.net/browse/JDK-8170299
>
> I'm still remain concerned about introducing yet-another-thread to the 
> system. The potential interactions with other threads is not at all 
> clear.
>
> I'm also concerned that this thread has to be visible so that you can 
> debug the notification code, yet at the same time being visible makes 
> it vulnerable to application level actions that don't impact the 
> service thread - in particular if we suspend all threads then this 
> thread will be suspended too, if we resume a thread that triggers a 
> notification, the notification thread won't be able to respond to it 
> as it is suspended. The user won't know that they need to explicitly 
> resume this internal system thread.

This is indeed problematic, but it seems less of an issue than running 
java code on the service thread. BTW, are there any other cases where we 
run java code on the service thread? It seems running java code on a 
hidden thread is just asking for trouble. I assume if? you hit a 
breakpoint while doing this, it is simply ignored. Not exactly what the 
debugger user is expecting.

Chris

>
> Also note in serviceThread.cpp we have:
>
> ?129?????? // This ThreadBlockInVM object is not also considered to be
> ?130?????? // suspend-equivalent because ServiceThread is not visible to
> ?131?????? // external suspension.
> ?132
> ?133?????? ThreadBlockInVM tbivm(jt);
>
> and you copied that across to notificationThread.cpp as:
>
> ? 93?????? // Need state transition ThreadBlockInVM so that this thread
> ? 94?????? // will be handled by safepoint correctly when this thread is
> ? 95?????? // notified at a safepoint.
> ? 96
> ? 97?????? ThreadBlockInVM tbivm(jt);
>
> so this will continue to not be a suspend-equivalent condition even 
> though this thread is visible and suspendible! So something seems 
> wrong there. I'm unclear why we need to use the ThreadBlockInVM rather 
> than defining the NotificationLock as a safepoint-checks-always lock, 
> rather than a safepoint-check-never lock? In fact with some recent 
> changes to locks I'm not even sure it is legal for the notification 
> thread to use a safepoint-check-never lock - have you re-based this 
> recently?
>
> Thanks,
> David
>
>> Thanks!
>> --Daniil
>>
>> ?On 7/3/19, 11:47 PM, "David Holmes" <david.holmes at oracle.com> wrote:
>>
>> ???? Hi Daniil,
>> ???? ???? On 4/07/2019 1:04 pm, Daniil Titov wrote:
>> ???? > Please review the change the fixes the problem with the 
>> debugger not stopping in the low memory notification code.
>> ???? >
>> ???? > The problem here is that the ServiceThread that calls these 
>> MXBean listeners is hidden from the external view that prevents the 
>> debugger from stopping in it.
>> ???? >
>> ???? > The fix introduces new NotificationThread that is visible to 
>> the external view and offloads the ServiceThread from sending low 
>> memory and other notifications that could result in Java calls ( GC 
>> and diagnostic commands notifications) by moving these activities in 
>> this new NotificationThread.
>> ???? ???? There is a long and unfortunate history with this bug.
>> ???? ???? The original incarnation of this fix was introducing a new 
>> thread at the
>> ???? Java library level, and I had some concerns about that:
>> http://mail.openjdk.java.net/pipermail/serviceability-dev/2017-December/022612.html
>> ???? ???? That effort was resurrected at:
>> http://mail.openjdk.java.net/pipermail/serviceability-dev/2018-July/024466.html
>> ???? ???? and
>> http://mail.openjdk.java.net/pipermail/serviceability-dev/2018-August/024849.html
>> ???? ???? but was left somewhat in limbo. There was a lot of doubt 
>> about the right
>> ???? way to fix this bug and whether introducing a new thread was too 
>> disruptive.
>> ???? ???? But introducing a new thread in the VM also has the same 
>> set of
>> ???? concerns! This needs consideration by the runtime team before going
>> ???? ahead. Introducing a new thread likes this needs to be examined in
>> ???? detail - particularly the synchronization interactions with other
>> ???? threads. It also introduces another monitor designated 
>> safepoint-never
>> ???? at a time when we are in the process of cleaning up monitors so 
>> that
>> ???? JavaThreads will only use safepoint-check-always monitors.
>> ???? ???? Unfortunately I'm about to head out for two weeks vacation, 
>> and a number
>> ???? of other key runtime folk are also on vacation. but I'd ask that 
>> you
>> ???? hold off on this until we can look at it in more detail.
>> ???? ???? Thanks,
>> ???? David
>> ???? -----
>> ???? ???? > Testing: Mach5 tier1,tier2 and tier3 tests succeeded.
>> ???? >
>> ???? > Webrev: https://cr.openjdk.java.net/~dtitov/8170299/webrev.01/
>> ???? > Bug: https://bugs.openjdk.java.net/browse/JDK-8170299
>> ???? >
>> ???? > Thanks!
>> ???? > --Daniil
>> ???? >
>> ???? >
>>
>>


From daniil.x.titov at oracle.com  Fri Aug  2 22:16:05 2019
From: daniil.x.titov at oracle.com (Daniil Titov)
Date: Fri, 02 Aug 2019 15:16:05 -0700
Subject: jmx-dev RFR: 8185005: Improve performance of
 ThreadMXBean.getThreadInfo(long ids[], int maxDepth)
In-Reply-To: <7e0ba39e-e5b7-f56b-66ea-820a0a35ec2c@oracle.com>
References: <4C4212D0-BFFF-4C85-ACC6-05200F220C3F@oracle.com>
 <2d6dede1-aa79-99ce-a823-773fa2e19827@oracle.com>
 <6E7B043A-4647-4931-977C-1854CA7EBEC1@oracle.com>
 <bc9fbb03-6591-16e0-6411-d80cd8f56877@oracle.com>
 <76BCC96D-DB5D-409A-95D5-3A64B893832D@oracle.com>
 <7e0ba39e-e5b7-f56b-66ea-820a0a35ec2c@oracle.com>
Message-ID: <87748188-3BD4-4A8B-938A-89DBC8F3C57A@oracle.com>

Hi David,

Thank you for your detailed review. Please review a new version of the fix that includes
the changes you suggested:
- ThreadTableCreate_lock scope is reduced to cover the creation of the table only;
- ThreadTableCreate_lock is made _safepoint_check_always;
- ServiceThread is no longer responsible for the resizing of the thread table, instead,
  the thread table is changed to grow on demand by the thread that is doing the addition;
- fixed nits and formatting issues.

>> The change also includes additional optimization for some callers of find_JavaThread_from_java_tid()
>>  as Daniel suggested.
> Not sure it's best to combine these, but if they are limited to the
> changes in management.cpp only then that may be okay.

The additional optimization for some callers of find_JavaThread_from_java_tid() is 
limited to management.cpp (plus a new test) so I left them in the webrev  but 
I also could move it in the separate issue if required.

  > src/hotspot/share/runtime/threadSMR.cpp
  >755     jlong tid = SharedRuntime::get_java_tid(thread);
  > 926     jlong tid = SharedRuntime::get_java_tid(thread); 
 >  I think it cleaner/better to just use
 > jlong tid = java_lang_Thread::thread_id(thread->threadObj());
 > as we know thread is not NULL, it is a JavaThread and it has to have a
 > non-null threadObj.

I had to leave this code unchanged since it turned out the threadObj is null 
when VM is destroyed:
  
V  [libjvm.so+0xe165d7]  oopDesc::long_field(int) const+0x67
V  [libjvm.so+0x16e06c6]  ThreadsSMRSupport::add_thread(JavaThread*)+0x116
V  [libjvm.so+0x16d1302]  Threads::add(JavaThread*, bool)+0x82
V  [libjvm.so+0xef8369]  attach_current_thread.part.197+0xc9
V  [libjvm.so+0xec136c]  jni_DestroyJavaVM+0x6c
C  [libjli.so+0x4333]  JavaMain+0x2c3
C  [libjli.so+0x8159]  ThreadJavaMain+0x9
 
> src/hotspot/share/services/threadTable.cpp
> 71     static uintx get_hash(Value const& value, bool* is_dead) {

> The is_dead parameter still bothers me here. I can't make enough sense
> out of the template code in ConcurrentHashtable to see why we have to
> have it, but I'm concerned that its very existence means we perhaps
> should not be trying to extend CHT in this context. ??

My understanding is that is_dead parameter provides a mechanism for
ConcurrentHashtable to remove stale entries that were not explicitly 
removed by calling  ConcurrentHashTable::remove() method. 
I think that just because in our case we don't use this mechanism doesn't 
mean we should not use ConcurrentHashTable.

> I would still want to see what impact this has on thread
> startup cost, both with and without the table being initialized.

I run a test that initializes the table by calling ThreadMXBean.get getThreadInfo(),
starts some threads as a worm-up, and then creates and starts 100,000 threads
(each thread just sleeps for 100 ms). In case when the thread table is enabled 
100,000 threads are created and started  for about 15200 ms. If the thread table 
is off the test takes about 14800 ms. Based on this information the enabled 
thread table makes the thread startup about 2.7% slower.

Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.04/ 
Bug: https://bugs.openjdk.java.net/browse/JDK-8185005 

Thanks!
--Daniil


?On 7/29/19, 12:53 AM, "David Holmes" <david.holmes at oracle.com> wrote:

    Hi Daniil,
    
    Overall I think this is a reasonable approach but I would still like to 
    see some performance and footprint numbers, both to verify it fixes the 
    problem reported, and that we are not getting penalized elsewhere.
    
    On 25/07/2019 3:21 am, Daniil Titov wrote:
    > Hi David, Daniel, and Serguei,
    > 
    > Please review the new version of the fix, that makes the thread table initialization on demand and
    > moves it inside ThreadsList::find_JavaThread_from_java_tid(). At the creation time the thread table
    >   is initialized with the threads from the current thread list. We don't want to hold Threads_lock
    > inside find_JavaThread_from_java_tid(),  thus new threads still could be created  while the thread
    > table is being initialized . Such threads will be found by the linear search and added to the thread table
    > later, in ThreadsList::find_JavaThread_from_java_tid().
    
    The initialization allows the created but unpopulated, or partially 
    populated, table to be seen by other threads - is that your intention? 
    It seems it should be okay as the other threads will then race with the 
    initializing thread to add specific entries, and this is a concurrent 
    map so that should be functionally correct. But if so then I think you 
    can also reduce the scope of the ThreadTableCreate_lock so that it 
    covers creation of the table only, not the initial population of the table.
    
    I like the approach of only initializing the table when needed and using 
    that to control when the add/remove-thread code needs to update the 
    table. But I would still want to see what impact this has on thread 
    startup cost, both with and without the table being initialized.
    
    > The change also includes additional optimization for some callers of find_JavaThread_from_java_tid()
    > as Daniel suggested.
    
    Not sure it's best to combine these, but if they are limited to the 
    changes in management.cpp only then that may be okay. It helps to be 
    able to focus on the table related changes without being distracted by 
    other optimizations.
    
    > That is correct that ResolvedMethodTable was used as a blueprint for the thread table, however, I tried
    > to strip it of the all functionality that is not required in the thread table case.
    
    The revised version seems better in that regard. But I still have a 
    concern, see below.
    
    > We need to have the thread table resizable and allow it to grow as the number of threads increases to avoid
    > reserving excessive memory a-priori or deteriorating lookup times. The ServiceThread is responsible for
    > growing the thread table when required.
    
    Yes but why? Why can't this table be grown on demand by the thread that 
    is doing the addition? For other tables we may have to delegate to the 
    service thread because the current thread cannot perform the action, or 
    it doesn't want to perform it at the time the need for the resize is 
    detected (e.g. its detected at a safepoint and you want the resize to 
    happen later outside the safepoint). It's not apparent to me that such 
    restrictions apply here.
    
    > There is no ConcurrentHashTable available in Java 8 and for backporting this fix to Java 8 another implementation
    > of the hash table, probably originally suggested in the patch attached to the JBS issue, should be used.  It will make
    > the backporting more complicated,  however, adding a new Implementation of the hash table in Java 14 while it
    > already has ConcurrentHashTable doesn't seem  reasonable for me.
    
    Ok.
    
    > Webrev: http://cr.openjdk.java.net/~dtitov/8185005/webrev.03
    
    Some specific code comments:
    
    src/hotspot/share/runtime/mutexLocker.cpp
    
    +   def(ThreadTableCreate_lock       , PaddedMutex  , special, 
    false, Monitor::_safepoint_check_never);
    
    I think this needs to be a _safepoint_check_always lock. The table will 
    be created by regular JavaThreads and they should (nearly) always be 
    checking for safepoints if they are going to block acquiring the lock. 
    And it isn't at all obvious that the thread doing the creation can't go 
    to a safepoint whilst this lock is held.
    
    ---
    
    src/hotspot/share/runtime/threadSMR.cpp
    
    Nit:
    
      618       JavaThread* thread = thread_at(i);
    
    you could reuse the new java_thread local you introduced at line 613 and 
    just rename that "new" variable to "thread" so you don't have to change 
    all other uses.
    
    628   } else if (java_thread != NULL && ...
    
    You don't need to check != NULL here as you only get here when 
    java_thread is not NULL.
    
      755     jlong tid = SharedRuntime::get_java_tid(thread);
      926     jlong tid = SharedRuntime::get_java_tid(thread);
    
    I think it cleaner/better to just use
    
    jlong tid = java_lang_Thread::thread_id(thread->threadObj());
    
    as we know thread is not NULL, it is a JavaThread and it has to have a 
    non-null threadObj.
    
    ---
    
    src/hotspot/share/services/management.cpp
    
    1323         if (THREAD->is_Java_thread()) {
    1324           JavaThread* current_thread = (JavaThread*)THREAD;
    
    These calls can only be made on a JavaThread so this be simplified to 
    remove the is_Java_thread() call. Similarly in other places.
    
    ---
    
    src/hotspot/share/services/threadTable.cpp
    
       55 class ThreadTableEntry : public CHeapObj<mtInternal> {
       56   private:
       57     jlong _tid;
    
    I believe hotspot style is to not indent the access modifiers in C++ 
    class declarations, so the above would just be:
    
       55 class ThreadTableEntry : public CHeapObj<mtInternal> {
       56 private:
       57   jlong _tid;
    
    etc.
    
      60     ThreadTableEntry(jlong tid, JavaThread* java_thread) :
      61     _tid(tid),_java_thread(java_thread) {}
    
    line 61 should be indented as it continues line 60.
    
       67 class ThreadTableConfig : public AllStatic {
       ...
       71     static uintx get_hash(Value const& value, bool* is_dead) {
    
    The is_dead parameter still bothers me here. I can't make enough sense 
    out of the template code in ConcurrentHashtable to see why we have to 
    have it, but I'm concerned that its very existence means we perhaps 
    should not be trying to extend CHT in this context. ??
    
      115   size_t start_size_log = size_log > DefaultThreadTableSizeLog
      116   ? size_log : DefaultThreadTableSizeLog;
    
    line 116 should be indented, though in this case I think a better layout 
    would be:
    
      115   size_t start_size_log =
      116       size_log > DefaultThreadTableSizeLog ? size_log : 
    DefaultThreadTableSizeLog;
    
      131 double ThreadTable::get_load_factor() {
      132   return (double)_items_count/_current_size;
      133 }
    
    Not sure that is doing what you want/expect. It will perform integer 
    division and then cast that whole integer to a double. If you want 
    double arithmetic you need:
    
    return ((double)_items_count)/_current_size;
    
    180     jlong          _tid;
    181     uintx         _hash;
    
    Nit: no need for all those spaces before the variable name.
    
      183     ThreadTableLookup(jlong tid)
      184     : _tid(tid), _hash(primitive_hash(tid)) {}
    
    line 184 should be indented.
    
    201     ThreadGet():_return(NULL) {}
    
    Nit: need space after :
    
      211    assert(_is_initialized, "Thread table is not initialized");
      212   _has_work = false;
    
    line 211 is indented one space too far.
    
    229     ThreadTableEntry* entry = new ThreadTableEntry(tid,java_thread);
    
    Nit: need space after ,
    
    252   return _local_table->remove(thread,lookup);
    
    Nit: need space after ,
    
    Thanks,
    David
    ------
    
    > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
    > 
    > Thanks!
    > --Daniil
    > 
    > 
    > ?On 7/8/19, 3:24 PM, "Daniel D. Daugherty" <daniel.daugherty at oracle.com> wrote:
    > 
    >      On 6/29/19 12:06 PM, Daniil Titov wrote:
    >      > Hi Serguei and David,
    >      >
    >      > Serguei is right, ThreadTable::find_thread(java_tid) cannot  return a JavaThread with an unmatched java_tid.
    >      >
    >      > Please find a new version of the fix that includes the changes Serguei suggested.
    >      >
    >      > Regarding the concern about the maintaining the thread table when it may never even be queried, one of
    >      > the options could be to add ThreadTable ::isEnabled flag, set it to "false" by default, and wrap the calls to the thread table
    >      > in ThreadsSMRSupport add_thread() and remove_thread() methods to check this flag.
    >      >
    >      > When ThreadsList::find_JavaThread_from_java_tid() is called for the first time it could check if ThreadTable ::isEnabled
    >      > Is on and if not then set it on and populate the thread table with all existing threads from the thread list.
    >      
    >      I have the same concerns as David H. about this new ThreadTable.
    >      ThreadsList::find_JavaThread_from_java_tid() is only called from code
    >      in src/hotspot/share/services/management.cpp so I think that table
    >      needs to enabled and populated only if it is going to be used.
    >      
    >      I've taken a look at the webrev below and I see that David has
    >      followed up with additional comments. Before I do a crawl through
    >      code review for this, I would like to see the ThreadTable stuff
    >      made optional and David's other comments addressed.
    >      
    >      Another possible optimization is for callers of
    >      find_JavaThread_from_java_tid() to save the calling thread's
    >      tid value before they loop and if the current tid == saved_tid
    >      then use the current JavaThread* instead of calling
    >      find_JavaThread_from_java_tid() to get the JavaThread*.
    >      
    >      Dan
    >      
    >      >
    >      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.02/
    >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
    >      >
    >      > Thanks!
    >      > --Daniil
    >      >
    >      > From: <serguei.spitsyn at oracle.com>
    >      > Organization: Oracle Corporation
    >      > Date: Friday, June 28, 2019 at 7:56 PM
    >      > To: Daniil Titov <daniil.x.titov at oracle.com>, OpenJDK Serviceability <serviceability-dev at openjdk.java.net>, "hotspot-runtime-dev at openjdk.java.net" <hotspot-runtime-dev at openjdk.java.net>, "jmx-dev at openjdk.java.net" <jmx-dev at openjdk.java.net>
    >      > Subject: Re: RFR: 8185005: Improve performance of ThreadMXBean.getThreadInfo(long ids[], int maxDepth)
    >      >
    >      > Hi Daniil,
    >      >
    >      > I have several quick comments.
    >      >
    >      > The indent in the hotspot c/c++ files has to be 2, not 4.
    >      >
    >      > https://cr.openjdk.java.net/~dtitov/8185005/webrev.01/src/hotspot/share/runtime/threadSMR.cpp.frames.html
    >      > 614 JavaThread* ThreadsList::find_JavaThread_from_java_tid(jlong java_tid) const {
    >      >   615     JavaThread* java_thread = ThreadTable::find_thread(java_tid);
    >      >   616     if (java_thread == NULL && java_tid == PMIMORDIAL_JAVA_TID) {
    >      >   617         // ThreadsSMRSupport::add_thread() is not called for the primordial
    >      >   618         // thread. Thus, we find this thread with a linear search and add it
    >      >   619         // to the thread table.
    >      >   620         for (uint i = 0; i < length(); i++) {
    >      >   621             JavaThread* thread = thread_at(i);
    >      >   622             if (is_valid_java_thread(java_tid,thread)) {
    >      >   623                 ThreadTable::add_thread(java_tid, thread);
    >      >   624                 return thread;
    >      >   625             }
    >      >   626         }
    >      >   627     } else if (java_thread != NULL && is_valid_java_thread(java_tid, java_thread)) {
    >      >   628         return java_thread;
    >      >   629     }
    >      >   630     return NULL;
    >      >   631 }
    >      >   632 bool ThreadsList::is_valid_java_thread(jlong java_tid, JavaThread* java_thread) {
    >      >   633     oop tobj = java_thread->threadObj();
    >      >   634     // Ignore the thread if it hasn't run yet, has exited
    >      >   635     // or is starting to exit.
    >      >   636     return (tobj != NULL && !java_thread->is_exiting() &&
    >      >   637             java_tid == java_lang_Thread::thread_id(tobj));
    >      >   638 }
    >      >
    >      >   615     JavaThread* java_thread = ThreadTable::find_thread(java_tid);
    >      >
    >      >    I'd suggest to rename find_thread() to find_thread_by_tid().
    >      >
    >      > A space is missed after the comma:
    >      >    622 if (is_valid_java_thread(java_tid,thread)) {
    >      >
    >      > An empty line is needed before L632.
    >      >
    >      > The name 'is_valid_java_thread' looks wrong (or confusing) to me.
    >      > Something like 'is_alive_java_thread_with_tid()' would be better.
    >      > It'd better to list parameters in the opposite order.
    >      >
    >      > The call to is_valid_java_thread() is confusing:
    >      >     627 } else if (java_thread != NULL && is_valid_java_thread(java_tid, java_thread)) {
    >      >
    >      > Why would the call ThreadTable::find_thread(java_tid) return a JavaThread with an unmatched java_tid?
    >      >
    >      >
    >      > Thanks,
    >      > Serguei
    >      >
    >      > On 6/28/19, 9:40 PM, "David Holmes" <david.holmes at oracle.com> wrote:
    >      >
    >      >      Hi Daniil,
    >      >
    >      >      The definition and use of this hashtable (yet another hashtable
    >      >      implementation!) will need careful examination. We have to be concerned
    >      >      about the cost of maintaining it when it may never even be queried. You
    >      >      would need to look at footprint cost and performance impact.
    >      >
    >      >      Unfortunately I'm just about to board a plane and will be out for the
    >      >      next few days. I will try to look at this asap next week, but we will
    >      >      need a lot more data on it.
    >      >
    >      >      Thanks,
    >      >      David
    >      >
    >      > On 6/28/19 3:31 PM, Daniil Titov wrote:
    >      > Please review the change that improves performance of ThreadMXBean MXBean methods returning the
    >      > information for specific threads. The change introduces the thread table that uses ConcurrentHashTable
    >      > to store one-to-one the mapping between the thread ids and JavaThread objects and replaces the linear
    >      > search over the thread list in ThreadsList::find_JavaThread_from_java_tid(jlong tid) method with the lookup
    >      > in the thread table.
    >      >
    >      > Testing: Mach5 tier1,tier2 and tier3 tests successfully passed.
    >      >
    >      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.01/
    >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
    >      >
    >      > Thanks!
    >      >
    >      > Best regards,
    >      > Daniil
    >      >
    >      >
    >      >
    >      >
    >      >
    >      >
    >      >
    >      
    >      
    > 
    > 
    

From david.holmes at oracle.com  Mon Aug  5 01:34:34 2019
From: david.holmes at oracle.com (David Holmes)
Date: Mon, 5 Aug 2019 11:34:34 +1000
Subject: jmx-dev RFR: 8170299: Debugger does not stop inside the low
 memory notifications code
In-Reply-To: <9a805686-57bf-c158-a777-c3cb7e38f09f@oracle.com>
References: <75B5F778-DC49-494B-AC12-270F301677CA@oracle.com>
 <60639d41-735a-00d3-c9db-1955f581b89a@oracle.com>
 <E2BFE392-EF77-4072-9D7C-65C088D4F007@oracle.com>
 <9783ca89-0af8-2167-436a-e5ff2db631a3@oracle.com>
 <9a805686-57bf-c158-a777-c3cb7e38f09f@oracle.com>
Message-ID: <194bd23a-0f16-19a9-a3e7-d02fa6d58369@oracle.com>

Hi Chris,

On 3/08/2019 4:12 am, Chris Plummer wrote:
> On 7/31/19 11:50 PM, David Holmes wrote:
>> Hi Daniil,
>>
>> On 25/07/2019 3:34 am, Daniil Titov wrote:
>>> Hi David,
>>>
>>> Hope you had a great vacation!
>>
>> I did thank you. Apologies again for taking so long to get back to 
>> this work.
>>
>>> Please find below the latest version of the change . The only 
>>> difference from the version 01 is
>>> the corrected ordering of include statements as Serguei suggested.
>>>
>>> Webrev: https://cr.openjdk.java.net/~dtitov/8170299/webrev.02/
>>> Bug: https://bugs.openjdk.java.net/browse/JDK-8170299
>>
>> I'm still remain concerned about introducing yet-another-thread to the 
>> system. The potential interactions with other threads is not at all 
>> clear.
>>
>> I'm also concerned that this thread has to be visible so that you can 
>> debug the notification code, yet at the same time being visible makes 
>> it vulnerable to application level actions that don't impact the 
>> service thread - in particular if we suspend all threads then this 
>> thread will be suspended too, if we resume a thread that triggers a 
>> notification, the notification thread won't be able to respond to it 
>> as it is suspended. The user won't know that they need to explicitly 
>> resume this internal system thread.
> 
> This is indeed problematic, but it seems less of an issue than running 
> java code on the service thread. BTW, are there any other cases where we 
> run java code on the service thread? It seems running java code on a 
> hidden thread is just asking for trouble. I assume if? you hit a 
> breakpoint while doing this, it is simply ignored. Not exactly what the 
> debugger user is expecting.

The ServiceThread was introduced as a generalization of the 
LowMemoryDetectorThread, specifically because it was needed to run Java 
code (in particular load Java classes) in response to other event 
notifications (ie compiler load events). See JDK-6766644 fixed in JDK 7.

So it has always been the case that the low memory notification code has 
executed in a hidden thread; and since JDK-6766644 other Java code has 
executed in this same hidden thread.

As I said at the start this bug has a long and complex history. It is 
far from clear that it can, or even should, be fixed, due to the other 
compatibility problems that the fix will introduce.

If you look at the docs for MemoryPoolMXBean:

https://docs.oracle.com/javase/10/docs/api/java/lang/management/MemoryPoolMXBean.html

it recommends doing minimal work in the actual notification code and 
instead hand off the real work to another thread:

"The handleNotification method should be designed to do a very minimal 
amount of work and return without delay to avoid causing delay in 
delivering subsequent notifications. Time-consuming actions should be 
performed by a separate thread."

with that approach you don't need to be able to debug the notification 
code executed by the service-thread because it should be trivial.

I would agree that the threading model for the notification system is 
under-specified and that these kinds of details should have been 
considered originally and any limitations clearly spelt out. Perhaps all 
we should do here is improve the documentation?

Either way any change in doc or behaviour will require a CSR request.

Cheers,
David
-----

> Chris
> 
>>
>> Also note in serviceThread.cpp we have:
>>
>> ?129?????? // This ThreadBlockInVM object is not also considered to be
>> ?130?????? // suspend-equivalent because ServiceThread is not visible to
>> ?131?????? // external suspension.
>> ?132
>> ?133?????? ThreadBlockInVM tbivm(jt);
>>
>> and you copied that across to notificationThread.cpp as:
>>
>> ? 93?????? // Need state transition ThreadBlockInVM so that this thread
>> ? 94?????? // will be handled by safepoint correctly when this thread is
>> ? 95?????? // notified at a safepoint.
>> ? 96
>> ? 97?????? ThreadBlockInVM tbivm(jt);
>>
>> so this will continue to not be a suspend-equivalent condition even 
>> though this thread is visible and suspendible! So something seems 
>> wrong there. I'm unclear why we need to use the ThreadBlockInVM rather 
>> than defining the NotificationLock as a safepoint-checks-always lock, 
>> rather than a safepoint-check-never lock? In fact with some recent 
>> changes to locks I'm not even sure it is legal for the notification 
>> thread to use a safepoint-check-never lock - have you re-based this 
>> recently?
>>
>> Thanks,
>> David
>>
>>> Thanks!
>>> --Daniil
>>>
>>> ?On 7/3/19, 11:47 PM, "David Holmes" <david.holmes at oracle.com> wrote:
>>>
>>> ???? Hi Daniil,
>>> ???? ???? On 4/07/2019 1:04 pm, Daniil Titov wrote:
>>> ???? > Please review the change the fixes the problem with the 
>>> debugger not stopping in the low memory notification code.
>>> ???? >
>>> ???? > The problem here is that the ServiceThread that calls these 
>>> MXBean listeners is hidden from the external view that prevents the 
>>> debugger from stopping in it.
>>> ???? >
>>> ???? > The fix introduces new NotificationThread that is visible to 
>>> the external view and offloads the ServiceThread from sending low 
>>> memory and other notifications that could result in Java calls ( GC 
>>> and diagnostic commands notifications) by moving these activities in 
>>> this new NotificationThread.
>>> ???? ???? There is a long and unfortunate history with this bug.
>>> ???? ???? The original incarnation of this fix was introducing a new 
>>> thread at the
>>> ???? Java library level, and I had some concerns about that:
>>> http://mail.openjdk.java.net/pipermail/serviceability-dev/2017-December/022612.html 
>>>
>>> ???? ???? That effort was resurrected at:
>>> http://mail.openjdk.java.net/pipermail/serviceability-dev/2018-July/024466.html 
>>>
>>> ???? ???? and
>>> http://mail.openjdk.java.net/pipermail/serviceability-dev/2018-August/024849.html 
>>>
>>> ???? ???? but was left somewhat in limbo. There was a lot of doubt 
>>> about the right
>>> ???? way to fix this bug and whether introducing a new thread was too 
>>> disruptive.
>>> ???? ???? But introducing a new thread in the VM also has the same 
>>> set of
>>> ???? concerns! This needs consideration by the runtime team before going
>>> ???? ahead. Introducing a new thread likes this needs to be examined in
>>> ???? detail - particularly the synchronization interactions with other
>>> ???? threads. It also introduces another monitor designated 
>>> safepoint-never
>>> ???? at a time when we are in the process of cleaning up monitors so 
>>> that
>>> ???? JavaThreads will only use safepoint-check-always monitors.
>>> ???? ???? Unfortunately I'm about to head out for two weeks vacation, 
>>> and a number
>>> ???? of other key runtime folk are also on vacation. but I'd ask that 
>>> you
>>> ???? hold off on this until we can look at it in more detail.
>>> ???? ???? Thanks,
>>> ???? David
>>> ???? -----
>>> ???? ???? > Testing: Mach5 tier1,tier2 and tier3 tests succeeded.
>>> ???? >
>>> ???? > Webrev: https://cr.openjdk.java.net/~dtitov/8170299/webrev.01/
>>> ???? > Bug: https://bugs.openjdk.java.net/browse/JDK-8170299
>>> ???? >
>>> ???? > Thanks!
>>> ???? > --Daniil
>>> ???? >
>>> ???? >
>>>
>>>
> 
> 

From david.holmes at oracle.com  Mon Aug  5 02:54:07 2019
From: david.holmes at oracle.com (David Holmes)
Date: Mon, 5 Aug 2019 12:54:07 +1000
Subject: jmx-dev RFR: 8185005: Improve performance of
 ThreadMXBean.getThreadInfo(long ids[], int maxDepth)
In-Reply-To: <87748188-3BD4-4A8B-938A-89DBC8F3C57A@oracle.com>
References: <4C4212D0-BFFF-4C85-ACC6-05200F220C3F@oracle.com>
 <2d6dede1-aa79-99ce-a823-773fa2e19827@oracle.com>
 <6E7B043A-4647-4931-977C-1854CA7EBEC1@oracle.com>
 <bc9fbb03-6591-16e0-6411-d80cd8f56877@oracle.com>
 <76BCC96D-DB5D-409A-95D5-3A64B893832D@oracle.com>
 <7e0ba39e-e5b7-f56b-66ea-820a0a35ec2c@oracle.com>
 <87748188-3BD4-4A8B-938A-89DBC8F3C57A@oracle.com>
Message-ID: <acfdf23e-4d6f-aa2b-3711-1da17a7f5f2c@oracle.com>

Hi Daniil,

On 3/08/2019 8:16 am, Daniil Titov wrote:
> Hi David,
> 
> Thank you for your detailed review. Please review a new version of the fix that includes
> the changes you suggested:
> - ThreadTableCreate_lock scope is reduced to cover the creation of the table only;
> - ThreadTableCreate_lock is made _safepoint_check_always;

Okay.

> - ServiceThread is no longer responsible for the resizing of the thread table, instead,
>    the thread table is changed to grow on demand by the thread that is doing the addition;

Okay - I'm happy to get the serviceThread out of the picture here.

> - fixed nits and formatting issues.

Okay.

>>> The change also includes additional optimization for some callers of find_JavaThread_from_java_tid()
>>>   as Daniel suggested.
>> Not sure it's best to combine these, but if they are limited to the
>> changes in management.cpp only then that may be okay.
> 
> The additional optimization for some callers of find_JavaThread_from_java_tid() is
> limited to management.cpp (plus a new test) so I left them in the webrev  but
> I also could move it in the separate issue if required.

I'd prefer this part of be separated out, but won't insist. Let's see if 
Dan or Serguei have a strong opinion.

>    > src/hotspot/share/runtime/threadSMR.cpp
>    >755     jlong tid = SharedRuntime::get_java_tid(thread);
>    > 926     jlong tid = SharedRuntime::get_java_tid(thread);
>   >  I think it cleaner/better to just use
>   > jlong tid = java_lang_Thread::thread_id(thread->threadObj());
>   > as we know thread is not NULL, it is a JavaThread and it has to have a
>   > non-null threadObj.
> 
> I had to leave this code unchanged since it turned out the threadObj is null
> when VM is destroyed:
>    
> V  [libjvm.so+0xe165d7]  oopDesc::long_field(int) const+0x67
> V  [libjvm.so+0x16e06c6]  ThreadsSMRSupport::add_thread(JavaThread*)+0x116
> V  [libjvm.so+0x16d1302]  Threads::add(JavaThread*, bool)+0x82
> V  [libjvm.so+0xef8369]  attach_current_thread.part.197+0xc9
> V  [libjvm.so+0xec136c]  jni_DestroyJavaVM+0x6c
> C  [libjli.so+0x4333]  JavaMain+0x2c3
> C  [libjli.so+0x8159]  ThreadJavaMain+0x9

This is actually nothing to do with the VM being destroyed, but is an 
issue with JNI_AttachCurrentThread and its interaction with the 
ThreadSMR iterators. The attach process is:
- create JavaThread
- mark as "is attaching via jni"
- add to ThreadsList
- create java.lang.Thread object (you can only execute Java code after 
you are attached)
- mark as "attach completed"

So while a thread "is attaching" it will be seen by the ThreadSMR thread 
iterator but will have a NULL java.lang.Thread object.

We special-case attaching threads in a number of places in the VM and I 
think we should be explicitly doing something here to filter out 
attaching threads, rather than just being tolerant of a NULL j.l.Thread 
object. Specifically in ThreadsSMRSupport::add_thread:

if (ThreadTable::is_initialized() && !thread->is_attaching_via_jni()) {
   jlong tid = java_lang_Thread::thread_id(thread->threadObj());
   ThreadTable::add_thread(tid, thread);
}

Note that in ThreadsSMRSupport::remove_thread we can use the same guard, 
which covers the case the JNI attach encountered an error trying to 
create the j.l.Thread object.

>> src/hotspot/share/services/threadTable.cpp
>> 71     static uintx get_hash(Value const& value, bool* is_dead) {
> 
>> The is_dead parameter still bothers me here. I can't make enough sense
>> out of the template code in ConcurrentHashtable to see why we have to
>> have it, but I'm concerned that its very existence means we perhaps
>> should not be trying to extend CHT in this context. ??
> 
> My understanding is that is_dead parameter provides a mechanism for
> ConcurrentHashtable to remove stale entries that were not explicitly
> removed by calling  ConcurrentHashTable::remove() method.
> I think that just because in our case we don't use this mechanism doesn't
> mean we should not use ConcurrentHashTable.

Can you confirm that this usage is okay with Robbin Ehn please. He's 
back from vacation this week.

>> I would still want to see what impact this has on thread
>> startup cost, both with and without the table being initialized.
> 
> I run a test that initializes the table by calling ThreadMXBean.get getThreadInfo(),
> starts some threads as a worm-up, and then creates and starts 100,000 threads
> (each thread just sleeps for 100 ms). In case when the thread table is enabled
> 100,000 threads are created and started  for about 15200 ms. If the thread table
> is off the test takes about 14800 ms. Based on this information the enabled
> thread table makes the thread startup about 2.7% slower.

That doesn't sound very good. I think we may need to Claes involved to 
help investigate overall performance impact here.

> Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.04/
> Bug: https://bugs.openjdk.java.net/browse/JDK-8185005

No further code comments.

I didn't look at the test in detail.

Thanks,
David

> Thanks!
> --Daniil
> 
> 
> ?On 7/29/19, 12:53 AM, "David Holmes" <david.holmes at oracle.com> wrote:
> 
>      Hi Daniil,
>      
>      Overall I think this is a reasonable approach but I would still like to
>      see some performance and footprint numbers, both to verify it fixes the
>      problem reported, and that we are not getting penalized elsewhere.
>      
>      On 25/07/2019 3:21 am, Daniil Titov wrote:
>      > Hi David, Daniel, and Serguei,
>      >
>      > Please review the new version of the fix, that makes the thread table initialization on demand and
>      > moves it inside ThreadsList::find_JavaThread_from_java_tid(). At the creation time the thread table
>      >   is initialized with the threads from the current thread list. We don't want to hold Threads_lock
>      > inside find_JavaThread_from_java_tid(),  thus new threads still could be created  while the thread
>      > table is being initialized . Such threads will be found by the linear search and added to the thread table
>      > later, in ThreadsList::find_JavaThread_from_java_tid().
>      
>      The initialization allows the created but unpopulated, or partially
>      populated, table to be seen by other threads - is that your intention?
>      It seems it should be okay as the other threads will then race with the
>      initializing thread to add specific entries, and this is a concurrent
>      map so that should be functionally correct. But if so then I think you
>      can also reduce the scope of the ThreadTableCreate_lock so that it
>      covers creation of the table only, not the initial population of the table.
>      
>      I like the approach of only initializing the table when needed and using
>      that to control when the add/remove-thread code needs to update the
>      table. But I would still want to see what impact this has on thread
>      startup cost, both with and without the table being initialized.
>      
>      > The change also includes additional optimization for some callers of find_JavaThread_from_java_tid()
>      > as Daniel suggested.
>      
>      Not sure it's best to combine these, but if they are limited to the
>      changes in management.cpp only then that may be okay. It helps to be
>      able to focus on the table related changes without being distracted by
>      other optimizations.
>      
>      > That is correct that ResolvedMethodTable was used as a blueprint for the thread table, however, I tried
>      > to strip it of the all functionality that is not required in the thread table case.
>      
>      The revised version seems better in that regard. But I still have a
>      concern, see below.
>      
>      > We need to have the thread table resizable and allow it to grow as the number of threads increases to avoid
>      > reserving excessive memory a-priori or deteriorating lookup times. The ServiceThread is responsible for
>      > growing the thread table when required.
>      
>      Yes but why? Why can't this table be grown on demand by the thread that
>      is doing the addition? For other tables we may have to delegate to the
>      service thread because the current thread cannot perform the action, or
>      it doesn't want to perform it at the time the need for the resize is
>      detected (e.g. its detected at a safepoint and you want the resize to
>      happen later outside the safepoint). It's not apparent to me that such
>      restrictions apply here.
>      
>      > There is no ConcurrentHashTable available in Java 8 and for backporting this fix to Java 8 another implementation
>      > of the hash table, probably originally suggested in the patch attached to the JBS issue, should be used.  It will make
>      > the backporting more complicated,  however, adding a new Implementation of the hash table in Java 14 while it
>      > already has ConcurrentHashTable doesn't seem  reasonable for me.
>      
>      Ok.
>      
>      > Webrev: http://cr.openjdk.java.net/~dtitov/8185005/webrev.03
>      
>      Some specific code comments:
>      
>      src/hotspot/share/runtime/mutexLocker.cpp
>      
>      +   def(ThreadTableCreate_lock       , PaddedMutex  , special,
>      false, Monitor::_safepoint_check_never);
>      
>      I think this needs to be a _safepoint_check_always lock. The table will
>      be created by regular JavaThreads and they should (nearly) always be
>      checking for safepoints if they are going to block acquiring the lock.
>      And it isn't at all obvious that the thread doing the creation can't go
>      to a safepoint whilst this lock is held.
>      
>      ---
>      
>      src/hotspot/share/runtime/threadSMR.cpp
>      
>      Nit:
>      
>        618       JavaThread* thread = thread_at(i);
>      
>      you could reuse the new java_thread local you introduced at line 613 and
>      just rename that "new" variable to "thread" so you don't have to change
>      all other uses.
>      
>      628   } else if (java_thread != NULL && ...
>      
>      You don't need to check != NULL here as you only get here when
>      java_thread is not NULL.
>      
>        755     jlong tid = SharedRuntime::get_java_tid(thread);
>        926     jlong tid = SharedRuntime::get_java_tid(thread);
>      
>      I think it cleaner/better to just use
>      
>      jlong tid = java_lang_Thread::thread_id(thread->threadObj());
>      
>      as we know thread is not NULL, it is a JavaThread and it has to have a
>      non-null threadObj.
>      
>      ---
>      
>      src/hotspot/share/services/management.cpp
>      
>      1323         if (THREAD->is_Java_thread()) {
>      1324           JavaThread* current_thread = (JavaThread*)THREAD;
>      
>      These calls can only be made on a JavaThread so this be simplified to
>      remove the is_Java_thread() call. Similarly in other places.
>      
>      ---
>      
>      src/hotspot/share/services/threadTable.cpp
>      
>         55 class ThreadTableEntry : public CHeapObj<mtInternal> {
>         56   private:
>         57     jlong _tid;
>      
>      I believe hotspot style is to not indent the access modifiers in C++
>      class declarations, so the above would just be:
>      
>         55 class ThreadTableEntry : public CHeapObj<mtInternal> {
>         56 private:
>         57   jlong _tid;
>      
>      etc.
>      
>        60     ThreadTableEntry(jlong tid, JavaThread* java_thread) :
>        61     _tid(tid),_java_thread(java_thread) {}
>      
>      line 61 should be indented as it continues line 60.
>      
>         67 class ThreadTableConfig : public AllStatic {
>         ...
>         71     static uintx get_hash(Value const& value, bool* is_dead) {
>      
>      The is_dead parameter still bothers me here. I can't make enough sense
>      out of the template code in ConcurrentHashtable to see why we have to
>      have it, but I'm concerned that its very existence means we perhaps
>      should not be trying to extend CHT in this context. ??
>      
>        115   size_t start_size_log = size_log > DefaultThreadTableSizeLog
>        116   ? size_log : DefaultThreadTableSizeLog;
>      
>      line 116 should be indented, though in this case I think a better layout
>      would be:
>      
>        115   size_t start_size_log =
>        116       size_log > DefaultThreadTableSizeLog ? size_log :
>      DefaultThreadTableSizeLog;
>      
>        131 double ThreadTable::get_load_factor() {
>        132   return (double)_items_count/_current_size;
>        133 }
>      
>      Not sure that is doing what you want/expect. It will perform integer
>      division and then cast that whole integer to a double. If you want
>      double arithmetic you need:
>      
>      return ((double)_items_count)/_current_size;
>      
>      180     jlong          _tid;
>      181     uintx         _hash;
>      
>      Nit: no need for all those spaces before the variable name.
>      
>        183     ThreadTableLookup(jlong tid)
>        184     : _tid(tid), _hash(primitive_hash(tid)) {}
>      
>      line 184 should be indented.
>      
>      201     ThreadGet():_return(NULL) {}
>      
>      Nit: need space after :
>      
>        211    assert(_is_initialized, "Thread table is not initialized");
>        212   _has_work = false;
>      
>      line 211 is indented one space too far.
>      
>      229     ThreadTableEntry* entry = new ThreadTableEntry(tid,java_thread);
>      
>      Nit: need space after ,
>      
>      252   return _local_table->remove(thread,lookup);
>      
>      Nit: need space after ,
>      
>      Thanks,
>      David
>      ------
>      
>      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
>      >
>      > Thanks!
>      > --Daniil
>      >
>      >
>      > ?On 7/8/19, 3:24 PM, "Daniel D. Daugherty" <daniel.daugherty at oracle.com> wrote:
>      >
>      >      On 6/29/19 12:06 PM, Daniil Titov wrote:
>      >      > Hi Serguei and David,
>      >      >
>      >      > Serguei is right, ThreadTable::find_thread(java_tid) cannot  return a JavaThread with an unmatched java_tid.
>      >      >
>      >      > Please find a new version of the fix that includes the changes Serguei suggested.
>      >      >
>      >      > Regarding the concern about the maintaining the thread table when it may never even be queried, one of
>      >      > the options could be to add ThreadTable ::isEnabled flag, set it to "false" by default, and wrap the calls to the thread table
>      >      > in ThreadsSMRSupport add_thread() and remove_thread() methods to check this flag.
>      >      >
>      >      > When ThreadsList::find_JavaThread_from_java_tid() is called for the first time it could check if ThreadTable ::isEnabled
>      >      > Is on and if not then set it on and populate the thread table with all existing threads from the thread list.
>      >
>      >      I have the same concerns as David H. about this new ThreadTable.
>      >      ThreadsList::find_JavaThread_from_java_tid() is only called from code
>      >      in src/hotspot/share/services/management.cpp so I think that table
>      >      needs to enabled and populated only if it is going to be used.
>      >
>      >      I've taken a look at the webrev below and I see that David has
>      >      followed up with additional comments. Before I do a crawl through
>      >      code review for this, I would like to see the ThreadTable stuff
>      >      made optional and David's other comments addressed.
>      >
>      >      Another possible optimization is for callers of
>      >      find_JavaThread_from_java_tid() to save the calling thread's
>      >      tid value before they loop and if the current tid == saved_tid
>      >      then use the current JavaThread* instead of calling
>      >      find_JavaThread_from_java_tid() to get the JavaThread*.
>      >
>      >      Dan
>      >
>      >      >
>      >      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.02/
>      >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
>      >      >
>      >      > Thanks!
>      >      > --Daniil
>      >      >
>      >      > From: <serguei.spitsyn at oracle.com>
>      >      > Organization: Oracle Corporation
>      >      > Date: Friday, June 28, 2019 at 7:56 PM
>      >      > To: Daniil Titov <daniil.x.titov at oracle.com>, OpenJDK Serviceability <serviceability-dev at openjdk.java.net>, "hotspot-runtime-dev at openjdk.java.net" <hotspot-runtime-dev at openjdk.java.net>, "jmx-dev at openjdk.java.net" <jmx-dev at openjdk.java.net>
>      >      > Subject: Re: RFR: 8185005: Improve performance of ThreadMXBean.getThreadInfo(long ids[], int maxDepth)
>      >      >
>      >      > Hi Daniil,
>      >      >
>      >      > I have several quick comments.
>      >      >
>      >      > The indent in the hotspot c/c++ files has to be 2, not 4.
>      >      >
>      >      > https://cr.openjdk.java.net/~dtitov/8185005/webrev.01/src/hotspot/share/runtime/threadSMR.cpp.frames.html
>      >      > 614 JavaThread* ThreadsList::find_JavaThread_from_java_tid(jlong java_tid) const {
>      >      >   615     JavaThread* java_thread = ThreadTable::find_thread(java_tid);
>      >      >   616     if (java_thread == NULL && java_tid == PMIMORDIAL_JAVA_TID) {
>      >      >   617         // ThreadsSMRSupport::add_thread() is not called for the primordial
>      >      >   618         // thread. Thus, we find this thread with a linear search and add it
>      >      >   619         // to the thread table.
>      >      >   620         for (uint i = 0; i < length(); i++) {
>      >      >   621             JavaThread* thread = thread_at(i);
>      >      >   622             if (is_valid_java_thread(java_tid,thread)) {
>      >      >   623                 ThreadTable::add_thread(java_tid, thread);
>      >      >   624                 return thread;
>      >      >   625             }
>      >      >   626         }
>      >      >   627     } else if (java_thread != NULL && is_valid_java_thread(java_tid, java_thread)) {
>      >      >   628         return java_thread;
>      >      >   629     }
>      >      >   630     return NULL;
>      >      >   631 }
>      >      >   632 bool ThreadsList::is_valid_java_thread(jlong java_tid, JavaThread* java_thread) {
>      >      >   633     oop tobj = java_thread->threadObj();
>      >      >   634     // Ignore the thread if it hasn't run yet, has exited
>      >      >   635     // or is starting to exit.
>      >      >   636     return (tobj != NULL && !java_thread->is_exiting() &&
>      >      >   637             java_tid == java_lang_Thread::thread_id(tobj));
>      >      >   638 }
>      >      >
>      >      >   615     JavaThread* java_thread = ThreadTable::find_thread(java_tid);
>      >      >
>      >      >    I'd suggest to rename find_thread() to find_thread_by_tid().
>      >      >
>      >      > A space is missed after the comma:
>      >      >    622 if (is_valid_java_thread(java_tid,thread)) {
>      >      >
>      >      > An empty line is needed before L632.
>      >      >
>      >      > The name 'is_valid_java_thread' looks wrong (or confusing) to me.
>      >      > Something like 'is_alive_java_thread_with_tid()' would be better.
>      >      > It'd better to list parameters in the opposite order.
>      >      >
>      >      > The call to is_valid_java_thread() is confusing:
>      >      >     627 } else if (java_thread != NULL && is_valid_java_thread(java_tid, java_thread)) {
>      >      >
>      >      > Why would the call ThreadTable::find_thread(java_tid) return a JavaThread with an unmatched java_tid?
>      >      >
>      >      >
>      >      > Thanks,
>      >      > Serguei
>      >      >
>      >      > On 6/28/19, 9:40 PM, "David Holmes" <david.holmes at oracle.com> wrote:
>      >      >
>      >      >      Hi Daniil,
>      >      >
>      >      >      The definition and use of this hashtable (yet another hashtable
>      >      >      implementation!) will need careful examination. We have to be concerned
>      >      >      about the cost of maintaining it when it may never even be queried. You
>      >      >      would need to look at footprint cost and performance impact.
>      >      >
>      >      >      Unfortunately I'm just about to board a plane and will be out for the
>      >      >      next few days. I will try to look at this asap next week, but we will
>      >      >      need a lot more data on it.
>      >      >
>      >      >      Thanks,
>      >      >      David
>      >      >
>      >      > On 6/28/19 3:31 PM, Daniil Titov wrote:
>      >      > Please review the change that improves performance of ThreadMXBean MXBean methods returning the
>      >      > information for specific threads. The change introduces the thread table that uses ConcurrentHashTable
>      >      > to store one-to-one the mapping between the thread ids and JavaThread objects and replaces the linear
>      >      > search over the thread list in ThreadsList::find_JavaThread_from_java_tid(jlong tid) method with the lookup
>      >      > in the thread table.
>      >      >
>      >      > Testing: Mach5 tier1,tier2 and tier3 tests successfully passed.
>      >      >
>      >      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.01/
>      >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
>      >      >
>      >      > Thanks!
>      >      >
>      >      > Best regards,
>      >      > Daniil
>      >      >
>      >      >
>      >      >
>      >      >
>      >      >
>      >      >
>      >      >
>      >
>      >
>      >
>      >
>      
> 
> 

From robbin.ehn at oracle.com  Wed Aug  7 10:16:44 2019
From: robbin.ehn at oracle.com (Robbin Ehn)
Date: Wed, 7 Aug 2019 12:16:44 +0200
Subject: jmx-dev RFR: 8185005: Improve performance of
 ThreadMXBean.getThreadInfo(long ids[], int maxDepth)
In-Reply-To: <87748188-3BD4-4A8B-938A-89DBC8F3C57A@oracle.com>
References: <4C4212D0-BFFF-4C85-ACC6-05200F220C3F@oracle.com>
 <2d6dede1-aa79-99ce-a823-773fa2e19827@oracle.com>
 <6E7B043A-4647-4931-977C-1854CA7EBEC1@oracle.com>
 <bc9fbb03-6591-16e0-6411-d80cd8f56877@oracle.com>
 <76BCC96D-DB5D-409A-95D5-3A64B893832D@oracle.com>
 <7e0ba39e-e5b7-f56b-66ea-820a0a35ec2c@oracle.com>
 <87748188-3BD4-4A8B-938A-89DBC8F3C57A@oracle.com>
Message-ID: <30776902-2164-9654-3c08-0656d30d495c@oracle.com>

Hi Daniil,

On 8/3/19 12:16 AM, Daniil Titov wrote:
>> The is_dead parameter still bothers me here. I can't make enough sense
>> out of the template code in ConcurrentHashtable to see why we have to
>> have it, but I'm concerned that its very existence means we perhaps
>> should not be trying to extend CHT in this context. ??
> 
> My understanding is that is_dead parameter provides a mechanism for
> ConcurrentHashtable to remove stale entries that were not explicitly
> removed by calling  ConcurrentHashTable::remove() method.
> I think that just because in our case we don't use this mechanism doesn't
> mean we should not use ConcurrentHashTable.

is_dead is an optimization for cleaning on inserts.
When we have indirect to oops as values, they can be dead when loading them.
But feeding back this information we reduce the chance of having to do a table
scan to remove does dead entries. Should be refactored.
CHT set it default to false, if you don't touch it it will just be false and
unused, thus compiler can remove that code.

> 
>> I would still want to see what impact this has on thread
>> startup cost, both with and without the table being initialized.
> 
> I run a test that initializes the table by calling ThreadMXBean.get getThreadInfo(),
> starts some threads as a worm-up, and then creates and starts 100,000 threads
> (each thread just sleeps for 100 ms). In case when the thread table is enabled
> 100,000 threads are created and started  for about 15200 ms. If the thread table
> is off the test takes about 14800 ms. Based on this information the enabled
> thread table makes the thread startup about 2.7% slower.
> 
> Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.04/

Seems good!

Thanks, Robbin

> Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
> 
> Thanks!
> --Daniil
> 
> 
> ?On 7/29/19, 12:53 AM, "David Holmes" <david.holmes at oracle.com> wrote:
> 
>      Hi Daniil,
>      
>      Overall I think this is a reasonable approach but I would still like to
>      see some performance and footprint numbers, both to verify it fixes the
>      problem reported, and that we are not getting penalized elsewhere.
>      
>      On 25/07/2019 3:21 am, Daniil Titov wrote:
>      > Hi David, Daniel, and Serguei,
>      >
>      > Please review the new version of the fix, that makes the thread table initialization on demand and
>      > moves it inside ThreadsList::find_JavaThread_from_java_tid(). At the creation time the thread table
>      >   is initialized with the threads from the current thread list. We don't want to hold Threads_lock
>      > inside find_JavaThread_from_java_tid(),  thus new threads still could be created  while the thread
>      > table is being initialized . Such threads will be found by the linear search and added to the thread table
>      > later, in ThreadsList::find_JavaThread_from_java_tid().
>      
>      The initialization allows the created but unpopulated, or partially
>      populated, table to be seen by other threads - is that your intention?
>      It seems it should be okay as the other threads will then race with the
>      initializing thread to add specific entries, and this is a concurrent
>      map so that should be functionally correct. But if so then I think you
>      can also reduce the scope of the ThreadTableCreate_lock so that it
>      covers creation of the table only, not the initial population of the table.
>      
>      I like the approach of only initializing the table when needed and using
>      that to control when the add/remove-thread code needs to update the
>      table. But I would still want to see what impact this has on thread
>      startup cost, both with and without the table being initialized.
>      
>      > The change also includes additional optimization for some callers of find_JavaThread_from_java_tid()
>      > as Daniel suggested.
>      
>      Not sure it's best to combine these, but if they are limited to the
>      changes in management.cpp only then that may be okay. It helps to be
>      able to focus on the table related changes without being distracted by
>      other optimizations.
>      
>      > That is correct that ResolvedMethodTable was used as a blueprint for the thread table, however, I tried
>      > to strip it of the all functionality that is not required in the thread table case.
>      
>      The revised version seems better in that regard. But I still have a
>      concern, see below.
>      
>      > We need to have the thread table resizable and allow it to grow as the number of threads increases to avoid
>      > reserving excessive memory a-priori or deteriorating lookup times. The ServiceThread is responsible for
>      > growing the thread table when required.
>      
>      Yes but why? Why can't this table be grown on demand by the thread that
>      is doing the addition? For other tables we may have to delegate to the
>      service thread because the current thread cannot perform the action, or
>      it doesn't want to perform it at the time the need for the resize is
>      detected (e.g. its detected at a safepoint and you want the resize to
>      happen later outside the safepoint). It's not apparent to me that such
>      restrictions apply here.
>      
>      > There is no ConcurrentHashTable available in Java 8 and for backporting this fix to Java 8 another implementation
>      > of the hash table, probably originally suggested in the patch attached to the JBS issue, should be used.  It will make
>      > the backporting more complicated,  however, adding a new Implementation of the hash table in Java 14 while it
>      > already has ConcurrentHashTable doesn't seem  reasonable for me.
>      
>      Ok.
>      
>      > Webrev: http://cr.openjdk.java.net/~dtitov/8185005/webrev.03
>      
>      Some specific code comments:
>      
>      src/hotspot/share/runtime/mutexLocker.cpp
>      
>      +   def(ThreadTableCreate_lock       , PaddedMutex  , special,
>      false, Monitor::_safepoint_check_never);
>      
>      I think this needs to be a _safepoint_check_always lock. The table will
>      be created by regular JavaThreads and they should (nearly) always be
>      checking for safepoints if they are going to block acquiring the lock.
>      And it isn't at all obvious that the thread doing the creation can't go
>      to a safepoint whilst this lock is held.
>      
>      ---
>      
>      src/hotspot/share/runtime/threadSMR.cpp
>      
>      Nit:
>      
>        618       JavaThread* thread = thread_at(i);
>      
>      you could reuse the new java_thread local you introduced at line 613 and
>      just rename that "new" variable to "thread" so you don't have to change
>      all other uses.
>      
>      628   } else if (java_thread != NULL && ...
>      
>      You don't need to check != NULL here as you only get here when
>      java_thread is not NULL.
>      
>        755     jlong tid = SharedRuntime::get_java_tid(thread);
>        926     jlong tid = SharedRuntime::get_java_tid(thread);
>      
>      I think it cleaner/better to just use
>      
>      jlong tid = java_lang_Thread::thread_id(thread->threadObj());
>      
>      as we know thread is not NULL, it is a JavaThread and it has to have a
>      non-null threadObj.
>      
>      ---
>      
>      src/hotspot/share/services/management.cpp
>      
>      1323         if (THREAD->is_Java_thread()) {
>      1324           JavaThread* current_thread = (JavaThread*)THREAD;
>      
>      These calls can only be made on a JavaThread so this be simplified to
>      remove the is_Java_thread() call. Similarly in other places.
>      
>      ---
>      
>      src/hotspot/share/services/threadTable.cpp
>      
>         55 class ThreadTableEntry : public CHeapObj<mtInternal> {
>         56   private:
>         57     jlong _tid;
>      
>      I believe hotspot style is to not indent the access modifiers in C++
>      class declarations, so the above would just be:
>      
>         55 class ThreadTableEntry : public CHeapObj<mtInternal> {
>         56 private:
>         57   jlong _tid;
>      
>      etc.
>      
>        60     ThreadTableEntry(jlong tid, JavaThread* java_thread) :
>        61     _tid(tid),_java_thread(java_thread) {}
>      
>      line 61 should be indented as it continues line 60.
>      
>         67 class ThreadTableConfig : public AllStatic {
>         ...
>         71     static uintx get_hash(Value const& value, bool* is_dead) {
>      
>      The is_dead parameter still bothers me here. I can't make enough sense
>      out of the template code in ConcurrentHashtable to see why we have to
>      have it, but I'm concerned that its very existence means we perhaps
>      should not be trying to extend CHT in this context. ??
>      
>        115   size_t start_size_log = size_log > DefaultThreadTableSizeLog
>        116   ? size_log : DefaultThreadTableSizeLog;
>      
>      line 116 should be indented, though in this case I think a better layout
>      would be:
>      
>        115   size_t start_size_log =
>        116       size_log > DefaultThreadTableSizeLog ? size_log :
>      DefaultThreadTableSizeLog;
>      
>        131 double ThreadTable::get_load_factor() {
>        132   return (double)_items_count/_current_size;
>        133 }
>      
>      Not sure that is doing what you want/expect. It will perform integer
>      division and then cast that whole integer to a double. If you want
>      double arithmetic you need:
>      
>      return ((double)_items_count)/_current_size;
>      
>      180     jlong          _tid;
>      181     uintx         _hash;
>      
>      Nit: no need for all those spaces before the variable name.
>      
>        183     ThreadTableLookup(jlong tid)
>        184     : _tid(tid), _hash(primitive_hash(tid)) {}
>      
>      line 184 should be indented.
>      
>      201     ThreadGet():_return(NULL) {}
>      
>      Nit: need space after :
>      
>        211    assert(_is_initialized, "Thread table is not initialized");
>        212   _has_work = false;
>      
>      line 211 is indented one space too far.
>      
>      229     ThreadTableEntry* entry = new ThreadTableEntry(tid,java_thread);
>      
>      Nit: need space after ,
>      
>      252   return _local_table->remove(thread,lookup);
>      
>      Nit: need space after ,
>      
>      Thanks,
>      David
>      ------
>      
>      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
>      >
>      > Thanks!
>      > --Daniil
>      >
>      >
>      > ?On 7/8/19, 3:24 PM, "Daniel D. Daugherty" <daniel.daugherty at oracle.com> wrote:
>      >
>      >      On 6/29/19 12:06 PM, Daniil Titov wrote:
>      >      > Hi Serguei and David,
>      >      >
>      >      > Serguei is right, ThreadTable::find_thread(java_tid) cannot  return a JavaThread with an unmatched java_tid.
>      >      >
>      >      > Please find a new version of the fix that includes the changes Serguei suggested.
>      >      >
>      >      > Regarding the concern about the maintaining the thread table when it may never even be queried, one of
>      >      > the options could be to add ThreadTable ::isEnabled flag, set it to "false" by default, and wrap the calls to the thread table
>      >      > in ThreadsSMRSupport add_thread() and remove_thread() methods to check this flag.
>      >      >
>      >      > When ThreadsList::find_JavaThread_from_java_tid() is called for the first time it could check if ThreadTable ::isEnabled
>      >      > Is on and if not then set it on and populate the thread table with all existing threads from the thread list.
>      >
>      >      I have the same concerns as David H. about this new ThreadTable.
>      >      ThreadsList::find_JavaThread_from_java_tid() is only called from code
>      >      in src/hotspot/share/services/management.cpp so I think that table
>      >      needs to enabled and populated only if it is going to be used.
>      >
>      >      I've taken a look at the webrev below and I see that David has
>      >      followed up with additional comments. Before I do a crawl through
>      >      code review for this, I would like to see the ThreadTable stuff
>      >      made optional and David's other comments addressed.
>      >
>      >      Another possible optimization is for callers of
>      >      find_JavaThread_from_java_tid() to save the calling thread's
>      >      tid value before they loop and if the current tid == saved_tid
>      >      then use the current JavaThread* instead of calling
>      >      find_JavaThread_from_java_tid() to get the JavaThread*.
>      >
>      >      Dan
>      >
>      >      >
>      >      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.02/
>      >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
>      >      >
>      >      > Thanks!
>      >      > --Daniil
>      >      >
>      >      > From: <serguei.spitsyn at oracle.com>
>      >      > Organization: Oracle Corporation
>      >      > Date: Friday, June 28, 2019 at 7:56 PM
>      >      > To: Daniil Titov <daniil.x.titov at oracle.com>, OpenJDK Serviceability <serviceability-dev at openjdk.java.net>, "hotspot-runtime-dev at openjdk.java.net" <hotspot-runtime-dev at openjdk.java.net>, "jmx-dev at openjdk.java.net" <jmx-dev at openjdk.java.net>
>      >      > Subject: Re: RFR: 8185005: Improve performance of ThreadMXBean.getThreadInfo(long ids[], int maxDepth)
>      >      >
>      >      > Hi Daniil,
>      >      >
>      >      > I have several quick comments.
>      >      >
>      >      > The indent in the hotspot c/c++ files has to be 2, not 4.
>      >      >
>      >      > https://cr.openjdk.java.net/~dtitov/8185005/webrev.01/src/hotspot/share/runtime/threadSMR.cpp.frames.html
>      >      > 614 JavaThread* ThreadsList::find_JavaThread_from_java_tid(jlong java_tid) const {
>      >      >   615     JavaThread* java_thread = ThreadTable::find_thread(java_tid);
>      >      >   616     if (java_thread == NULL && java_tid == PMIMORDIAL_JAVA_TID) {
>      >      >   617         // ThreadsSMRSupport::add_thread() is not called for the primordial
>      >      >   618         // thread. Thus, we find this thread with a linear search and add it
>      >      >   619         // to the thread table.
>      >      >   620         for (uint i = 0; i < length(); i++) {
>      >      >   621             JavaThread* thread = thread_at(i);
>      >      >   622             if (is_valid_java_thread(java_tid,thread)) {
>      >      >   623                 ThreadTable::add_thread(java_tid, thread);
>      >      >   624                 return thread;
>      >      >   625             }
>      >      >   626         }
>      >      >   627     } else if (java_thread != NULL && is_valid_java_thread(java_tid, java_thread)) {
>      >      >   628         return java_thread;
>      >      >   629     }
>      >      >   630     return NULL;
>      >      >   631 }
>      >      >   632 bool ThreadsList::is_valid_java_thread(jlong java_tid, JavaThread* java_thread) {
>      >      >   633     oop tobj = java_thread->threadObj();
>      >      >   634     // Ignore the thread if it hasn't run yet, has exited
>      >      >   635     // or is starting to exit.
>      >      >   636     return (tobj != NULL && !java_thread->is_exiting() &&
>      >      >   637             java_tid == java_lang_Thread::thread_id(tobj));
>      >      >   638 }
>      >      >
>      >      >   615     JavaThread* java_thread = ThreadTable::find_thread(java_tid);
>      >      >
>      >      >    I'd suggest to rename find_thread() to find_thread_by_tid().
>      >      >
>      >      > A space is missed after the comma:
>      >      >    622 if (is_valid_java_thread(java_tid,thread)) {
>      >      >
>      >      > An empty line is needed before L632.
>      >      >
>      >      > The name 'is_valid_java_thread' looks wrong (or confusing) to me.
>      >      > Something like 'is_alive_java_thread_with_tid()' would be better.
>      >      > It'd better to list parameters in the opposite order.
>      >      >
>      >      > The call to is_valid_java_thread() is confusing:
>      >      >     627 } else if (java_thread != NULL && is_valid_java_thread(java_tid, java_thread)) {
>      >      >
>      >      > Why would the call ThreadTable::find_thread(java_tid) return a JavaThread with an unmatched java_tid?
>      >      >
>      >      >
>      >      > Thanks,
>      >      > Serguei
>      >      >
>      >      > On 6/28/19, 9:40 PM, "David Holmes" <david.holmes at oracle.com> wrote:
>      >      >
>      >      >      Hi Daniil,
>      >      >
>      >      >      The definition and use of this hashtable (yet another hashtable
>      >      >      implementation!) will need careful examination. We have to be concerned
>      >      >      about the cost of maintaining it when it may never even be queried. You
>      >      >      would need to look at footprint cost and performance impact.
>      >      >
>      >      >      Unfortunately I'm just about to board a plane and will be out for the
>      >      >      next few days. I will try to look at this asap next week, but we will
>      >      >      need a lot more data on it.
>      >      >
>      >      >      Thanks,
>      >      >      David
>      >      >
>      >      > On 6/28/19 3:31 PM, Daniil Titov wrote:
>      >      > Please review the change that improves performance of ThreadMXBean MXBean methods returning the
>      >      > information for specific threads. The change introduces the thread table that uses ConcurrentHashTable
>      >      > to store one-to-one the mapping between the thread ids and JavaThread objects and replaces the linear
>      >      > search over the thread list in ThreadsList::find_JavaThread_from_java_tid(jlong tid) method with the lookup
>      >      > in the thread table.
>      >      >
>      >      > Testing: Mach5 tier1,tier2 and tier3 tests successfully passed.
>      >      >
>      >      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.01/
>      >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
>      >      >
>      >      > Thanks!
>      >      >
>      >      > Best regards,
>      >      > Daniil
>      >      >
>      >      >
>      >      >
>      >      >
>      >      >
>      >      >
>      >      >
>      >
>      >
>      >
>      >
>      
> 
> 

From daniil.x.titov at oracle.com  Mon Aug 12 03:49:57 2019
From: daniil.x.titov at oracle.com (Daniil Titov)
Date: Sun, 11 Aug 2019 20:49:57 -0700
Subject: jmx-dev RFR: 8185005: Improve performance of
 ThreadMXBean.getThreadInfo(long ids[], int maxDepth)
In-Reply-To: <acfdf23e-4d6f-aa2b-3711-1da17a7f5f2c@oracle.com>
References: <4C4212D0-BFFF-4C85-ACC6-05200F220C3F@oracle.com>
 <2d6dede1-aa79-99ce-a823-773fa2e19827@oracle.com>
 <6E7B043A-4647-4931-977C-1854CA7EBEC1@oracle.com>
 <bc9fbb03-6591-16e0-6411-d80cd8f56877@oracle.com>
 <76BCC96D-DB5D-409A-95D5-3A64B893832D@oracle.com>
 <7e0ba39e-e5b7-f56b-66ea-820a0a35ec2c@oracle.com>
 <87748188-3BD4-4A8B-938A-89DBC8F3C57A@oracle.com>
 <acfdf23e-4d6f-aa2b-3711-1da17a7f5f2c@oracle.com>
Message-ID: <A6311A8C-738A-4928-90C0-A11CB8B9487D@oracle.com>

Hi David, Robbin, Daniel, and Serguei,

Please review a new version of the fix.

As David suggested I created a separated Jira issue [1] to cover  additional optimization for 
some callers of find_JavaThread_from_java_tid() and this version of the fix no longer includes
changes in management.cpp ( and the test related with these changes).

Regarding the impact the previous version of the fix had on the thread startup time at heavy load (e.g. 
when 5000 threads are created and destroyed every second) I tried a different approach that makes
calls to ThreadTable::add_thread  and ThreadTable::remove_thread  asynchronous and offloads the 
work for actual modifications of the thread table to a periodic task that runs every 5 seconds. With the
same  stress test scenario (the  test does some warm-up and then measures the time it takes to create
and start 100,000 threads; every  thread just sleeps  for 100 ms) the impact on the thread startup time
was reduced to 1.2% ( from 2.7%).

The cause of this impact in this stress test scenario is that as soon as the thread table is initialized,
an additional work to insert  and delete entries in the thread table should be performed, even if
com.sun.management.ThreadMXBean methods are no longer called. For example, In the stress test 
mentioned above, every second about 5000 entries had to be inserted in the table and then deleted.

That doesn't look right and the new version of the fix uses the different approach: the thread is added to
the thread table only when this thread is requested by com.sun.management.ThreadMXBean bean. Every 
time when find_JavaThread_from_java_tid() is called for a new tid, the thread  is found by the iterating over
the thread list and added to the thread table. All consequent calls to find_JavaThread_from_java_tid() for 
the same tid returns the thread from the thread table. 

Running stress test for the cases when the thread table is enabled and not showed no difference in the
average thread startup times.

[1] : https://bugs.openjdk.java.net/browse/JDK-8229391

Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.05/ 

Thanks,
Daniil

?On 8/4/19, 7:54 PM, "David Holmes" <david.holmes at oracle.com> wrote:

    Hi Daniil,
    
    On 3/08/2019 8:16 am, Daniil Titov wrote:
    > Hi David,
    > 
    > Thank you for your detailed review. Please review a new version of the fix that includes
    > the changes you suggested:
    > - ThreadTableCreate_lock scope is reduced to cover the creation of the table only;
    > - ThreadTableCreate_lock is made _safepoint_check_always;
    
    Okay.
    
    > - ServiceThread is no longer responsible for the resizing of the thread table, instead,
    >    the thread table is changed to grow on demand by the thread that is doing the addition;
    
    Okay - I'm happy to get the serviceThread out of the picture here.
    
    > - fixed nits and formatting issues.
    
    Okay.
    
    >>> The change also includes additional optimization for some callers of find_JavaThread_from_java_tid()
    >>>   as Daniel suggested.
    >> Not sure it's best to combine these, but if they are limited to the
    >> changes in management.cpp only then that may be okay.
    > 
    > The additional optimization for some callers of find_JavaThread_from_java_tid() is
    > limited to management.cpp (plus a new test) so I left them in the webrev  but
    > I also could move it in the separate issue if required.
    
    I'd prefer this part of be separated out, but won't insist. Let's see if 
    Dan or Serguei have a strong opinion.
    
    >    > src/hotspot/share/runtime/threadSMR.cpp
    >    >755     jlong tid = SharedRuntime::get_java_tid(thread);
    >    > 926     jlong tid = SharedRuntime::get_java_tid(thread);
    >   >  I think it cleaner/better to just use
    >   > jlong tid = java_lang_Thread::thread_id(thread->threadObj());
    >   > as we know thread is not NULL, it is a JavaThread and it has to have a
    >   > non-null threadObj.
    > 
    > I had to leave this code unchanged since it turned out the threadObj is null
    > when VM is destroyed:
    >    
    > V  [libjvm.so+0xe165d7]  oopDesc::long_field(int) const+0x67
    > V  [libjvm.so+0x16e06c6]  ThreadsSMRSupport::add_thread(JavaThread*)+0x116
    > V  [libjvm.so+0x16d1302]  Threads::add(JavaThread*, bool)+0x82
    > V  [libjvm.so+0xef8369]  attach_current_thread.part.197+0xc9
    > V  [libjvm.so+0xec136c]  jni_DestroyJavaVM+0x6c
    > C  [libjli.so+0x4333]  JavaMain+0x2c3
    > C  [libjli.so+0x8159]  ThreadJavaMain+0x9
    
    This is actually nothing to do with the VM being destroyed, but is an 
    issue with JNI_AttachCurrentThread and its interaction with the 
    ThreadSMR iterators. The attach process is:
    - create JavaThread
    - mark as "is attaching via jni"
    - add to ThreadsList
    - create java.lang.Thread object (you can only execute Java code after 
    you are attached)
    - mark as "attach completed"
    
    So while a thread "is attaching" it will be seen by the ThreadSMR thread 
    iterator but will have a NULL java.lang.Thread object.
    
    We special-case attaching threads in a number of places in the VM and I 
    think we should be explicitly doing something here to filter out 
    attaching threads, rather than just being tolerant of a NULL j.l.Thread 
    object. Specifically in ThreadsSMRSupport::add_thread:
    
    if (ThreadTable::is_initialized() && !thread->is_attaching_via_jni()) {
       jlong tid = java_lang_Thread::thread_id(thread->threadObj());
       ThreadTable::add_thread(tid, thread);
    }
    
    Note that in ThreadsSMRSupport::remove_thread we can use the same guard, 
    which covers the case the JNI attach encountered an error trying to 
    create the j.l.Thread object.
    
    >> src/hotspot/share/services/threadTable.cpp
    >> 71     static uintx get_hash(Value const& value, bool* is_dead) {
    > 
    >> The is_dead parameter still bothers me here. I can't make enough sense
    >> out of the template code in ConcurrentHashtable to see why we have to
    >> have it, but I'm concerned that its very existence means we perhaps
    >> should not be trying to extend CHT in this context. ??
    > 
    > My understanding is that is_dead parameter provides a mechanism for
    > ConcurrentHashtable to remove stale entries that were not explicitly
    > removed by calling  ConcurrentHashTable::remove() method.
    > I think that just because in our case we don't use this mechanism doesn't
    > mean we should not use ConcurrentHashTable.
    
    Can you confirm that this usage is okay with Robbin Ehn please. He's 
    back from vacation this week.
    
    >> I would still want to see what impact this has on thread
    >> startup cost, both with and without the table being initialized.
    > 
    > I run a test that initializes the table by calling ThreadMXBean.get getThreadInfo(),
    > starts some threads as a worm-up, and then creates and starts 100,000 threads
    > (each thread just sleeps for 100 ms). In case when the thread table is enabled
    > 100,000 threads are created and started  for about 15200 ms. If the thread table
    > is off the test takes about 14800 ms. Based on this information the enabled
    > thread table makes the thread startup about 2.7% slower.
    
    That doesn't sound very good. I think we may need to Claes involved to 
    help investigate overall performance impact here.
    
    > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.04/
    > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
    
    No further code comments.
    
    I didn't look at the test in detail.
    
    Thanks,
    David
    
    > Thanks!
    > --Daniil
    > 
    > 
    > ?On 7/29/19, 12:53 AM, "David Holmes" <david.holmes at oracle.com> wrote:
    > 
    >      Hi Daniil,
    >      
    >      Overall I think this is a reasonable approach but I would still like to
    >      see some performance and footprint numbers, both to verify it fixes the
    >      problem reported, and that we are not getting penalized elsewhere.
    >      
    >      On 25/07/2019 3:21 am, Daniil Titov wrote:
    >      > Hi David, Daniel, and Serguei,
    >      >
    >      > Please review the new version of the fix, that makes the thread table initialization on demand and
    >      > moves it inside ThreadsList::find_JavaThread_from_java_tid(). At the creation time the thread table
    >      >   is initialized with the threads from the current thread list. We don't want to hold Threads_lock
    >      > inside find_JavaThread_from_java_tid(),  thus new threads still could be created  while the thread
    >      > table is being initialized . Such threads will be found by the linear search and added to the thread table
    >      > later, in ThreadsList::find_JavaThread_from_java_tid().
    >      
    >      The initialization allows the created but unpopulated, or partially
    >      populated, table to be seen by other threads - is that your intention?
    >      It seems it should be okay as the other threads will then race with the
    >      initializing thread to add specific entries, and this is a concurrent
    >      map so that should be functionally correct. But if so then I think you
    >      can also reduce the scope of the ThreadTableCreate_lock so that it
    >      covers creation of the table only, not the initial population of the table.
    >      
    >      I like the approach of only initializing the table when needed and using
    >      that to control when the add/remove-thread code needs to update the
    >      table. But I would still want to see what impact this has on thread
    >      startup cost, both with and without the table being initialized.
    >      
    >      > The change also includes additional optimization for some callers of find_JavaThread_from_java_tid()
    >      > as Daniel suggested.
    >      
    >      Not sure it's best to combine these, but if they are limited to the
    >      changes in management.cpp only then that may be okay. It helps to be
    >      able to focus on the table related changes without being distracted by
    >      other optimizations.
    >      
    >      > That is correct that ResolvedMethodTable was used as a blueprint for the thread table, however, I tried
    >      > to strip it of the all functionality that is not required in the thread table case.
    >      
    >      The revised version seems better in that regard. But I still have a
    >      concern, see below.
    >      
    >      > We need to have the thread table resizable and allow it to grow as the number of threads increases to avoid
    >      > reserving excessive memory a-priori or deteriorating lookup times. The ServiceThread is responsible for
    >      > growing the thread table when required.
    >      
    >      Yes but why? Why can't this table be grown on demand by the thread that
    >      is doing the addition? For other tables we may have to delegate to the
    >      service thread because the current thread cannot perform the action, or
    >      it doesn't want to perform it at the time the need for the resize is
    >      detected (e.g. its detected at a safepoint and you want the resize to
    >      happen later outside the safepoint). It's not apparent to me that such
    >      restrictions apply here.
    >      
    >      > There is no ConcurrentHashTable available in Java 8 and for backporting this fix to Java 8 another implementation
    >      > of the hash table, probably originally suggested in the patch attached to the JBS issue, should be used.  It will make
    >      > the backporting more complicated,  however, adding a new Implementation of the hash table in Java 14 while it
    >      > already has ConcurrentHashTable doesn't seem  reasonable for me.
    >      
    >      Ok.
    >      
    >      > Webrev: http://cr.openjdk.java.net/~dtitov/8185005/webrev.03
    >      
    >      Some specific code comments:
    >      
    >      src/hotspot/share/runtime/mutexLocker.cpp
    >      
    >      +   def(ThreadTableCreate_lock       , PaddedMutex  , special,
    >      false, Monitor::_safepoint_check_never);
    >      
    >      I think this needs to be a _safepoint_check_always lock. The table will
    >      be created by regular JavaThreads and they should (nearly) always be
    >      checking for safepoints if they are going to block acquiring the lock.
    >      And it isn't at all obvious that the thread doing the creation can't go
    >      to a safepoint whilst this lock is held.
    >      
    >      ---
    >      
    >      src/hotspot/share/runtime/threadSMR.cpp
    >      
    >      Nit:
    >      
    >        618       JavaThread* thread = thread_at(i);
    >      
    >      you could reuse the new java_thread local you introduced at line 613 and
    >      just rename that "new" variable to "thread" so you don't have to change
    >      all other uses.
    >      
    >      628   } else if (java_thread != NULL && ...
    >      
    >      You don't need to check != NULL here as you only get here when
    >      java_thread is not NULL.
    >      
    >        755     jlong tid = SharedRuntime::get_java_tid(thread);
    >        926     jlong tid = SharedRuntime::get_java_tid(thread);
    >      
    >      I think it cleaner/better to just use
    >      
    >      jlong tid = java_lang_Thread::thread_id(thread->threadObj());
    >      
    >      as we know thread is not NULL, it is a JavaThread and it has to have a
    >      non-null threadObj.
    >      
    >      ---
    >      
    >      src/hotspot/share/services/management.cpp
    >      
    >      1323         if (THREAD->is_Java_thread()) {
    >      1324           JavaThread* current_thread = (JavaThread*)THREAD;
    >      
    >      These calls can only be made on a JavaThread so this be simplified to
    >      remove the is_Java_thread() call. Similarly in other places.
    >      
    >      ---
    >      
    >      src/hotspot/share/services/threadTable.cpp
    >      
    >         55 class ThreadTableEntry : public CHeapObj<mtInternal> {
    >         56   private:
    >         57     jlong _tid;
    >      
    >      I believe hotspot style is to not indent the access modifiers in C++
    >      class declarations, so the above would just be:
    >      
    >         55 class ThreadTableEntry : public CHeapObj<mtInternal> {
    >         56 private:
    >         57   jlong _tid;
    >      
    >      etc.
    >      
    >        60     ThreadTableEntry(jlong tid, JavaThread* java_thread) :
    >        61     _tid(tid),_java_thread(java_thread) {}
    >      
    >      line 61 should be indented as it continues line 60.
    >      
    >         67 class ThreadTableConfig : public AllStatic {
    >         ...
    >         71     static uintx get_hash(Value const& value, bool* is_dead) {
    >      
    >      The is_dead parameter still bothers me here. I can't make enough sense
    >      out of the template code in ConcurrentHashtable to see why we have to
    >      have it, but I'm concerned that its very existence means we perhaps
    >      should not be trying to extend CHT in this context. ??
    >      
    >        115   size_t start_size_log = size_log > DefaultThreadTableSizeLog
    >        116   ? size_log : DefaultThreadTableSizeLog;
    >      
    >      line 116 should be indented, though in this case I think a better layout
    >      would be:
    >      
    >        115   size_t start_size_log =
    >        116       size_log > DefaultThreadTableSizeLog ? size_log :
    >      DefaultThreadTableSizeLog;
    >      
    >        131 double ThreadTable::get_load_factor() {
    >        132   return (double)_items_count/_current_size;
    >        133 }
    >      
    >      Not sure that is doing what you want/expect. It will perform integer
    >      division and then cast that whole integer to a double. If you want
    >      double arithmetic you need:
    >      
    >      return ((double)_items_count)/_current_size;
    >      
    >      180     jlong          _tid;
    >      181     uintx         _hash;
    >      
    >      Nit: no need for all those spaces before the variable name.
    >      
    >        183     ThreadTableLookup(jlong tid)
    >        184     : _tid(tid), _hash(primitive_hash(tid)) {}
    >      
    >      line 184 should be indented.
    >      
    >      201     ThreadGet():_return(NULL) {}
    >      
    >      Nit: need space after :
    >      
    >        211    assert(_is_initialized, "Thread table is not initialized");
    >        212   _has_work = false;
    >      
    >      line 211 is indented one space too far.
    >      
    >      229     ThreadTableEntry* entry = new ThreadTableEntry(tid,java_thread);
    >      
    >      Nit: need space after ,
    >      
    >      252   return _local_table->remove(thread,lookup);
    >      
    >      Nit: need space after ,
    >      
    >      Thanks,
    >      David
    >      ------
    >      
    >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
    >      >
    >      > Thanks!
    >      > --Daniil
    >      >
    >      >
    >      > ?On 7/8/19, 3:24 PM, "Daniel D. Daugherty" <daniel.daugherty at oracle.com> wrote:
    >      >
    >      >      On 6/29/19 12:06 PM, Daniil Titov wrote:
    >      >      > Hi Serguei and David,
    >      >      >
    >      >      > Serguei is right, ThreadTable::find_thread(java_tid) cannot  return a JavaThread with an unmatched java_tid.
    >      >      >
    >      >      > Please find a new version of the fix that includes the changes Serguei suggested.
    >      >      >
    >      >      > Regarding the concern about the maintaining the thread table when it may never even be queried, one of
    >      >      > the options could be to add ThreadTable ::isEnabled flag, set it to "false" by default, and wrap the calls to the thread table
    >      >      > in ThreadsSMRSupport add_thread() and remove_thread() methods to check this flag.
    >      >      >
    >      >      > When ThreadsList::find_JavaThread_from_java_tid() is called for the first time it could check if ThreadTable ::isEnabled
    >      >      > Is on and if not then set it on and populate the thread table with all existing threads from the thread list.
    >      >
    >      >      I have the same concerns as David H. about this new ThreadTable.
    >      >      ThreadsList::find_JavaThread_from_java_tid() is only called from code
    >      >      in src/hotspot/share/services/management.cpp so I think that table
    >      >      needs to enabled and populated only if it is going to be used.
    >      >
    >      >      I've taken a look at the webrev below and I see that David has
    >      >      followed up with additional comments. Before I do a crawl through
    >      >      code review for this, I would like to see the ThreadTable stuff
    >      >      made optional and David's other comments addressed.
    >      >
    >      >      Another possible optimization is for callers of
    >      >      find_JavaThread_from_java_tid() to save the calling thread's
    >      >      tid value before they loop and if the current tid == saved_tid
    >      >      then use the current JavaThread* instead of calling
    >      >      find_JavaThread_from_java_tid() to get the JavaThread*.
    >      >
    >      >      Dan
    >      >
    >      >      >
    >      >      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.02/
    >      >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
    >      >      >
    >      >      > Thanks!
    >      >      > --Daniil
    >      >      >
    >      >      > From: <serguei.spitsyn at oracle.com>
    >      >      > Organization: Oracle Corporation
    >      >      > Date: Friday, June 28, 2019 at 7:56 PM
    >      >      > To: Daniil Titov <daniil.x.titov at oracle.com>, OpenJDK Serviceability <serviceability-dev at openjdk.java.net>, "hotspot-runtime-dev at openjdk.java.net" <hotspot-runtime-dev at openjdk.java.net>, "jmx-dev at openjdk.java.net" <jmx-dev at openjdk.java.net>
    >      >      > Subject: Re: RFR: 8185005: Improve performance of ThreadMXBean.getThreadInfo(long ids[], int maxDepth)
    >      >      >
    >      >      > Hi Daniil,
    >      >      >
    >      >      > I have several quick comments.
    >      >      >
    >      >      > The indent in the hotspot c/c++ files has to be 2, not 4.
    >      >      >
    >      >      > https://cr.openjdk.java.net/~dtitov/8185005/webrev.01/src/hotspot/share/runtime/threadSMR.cpp.frames.html
    >      >      > 614 JavaThread* ThreadsList::find_JavaThread_from_java_tid(jlong java_tid) const {
    >      >      >   615     JavaThread* java_thread = ThreadTable::find_thread(java_tid);
    >      >      >   616     if (java_thread == NULL && java_tid == PMIMORDIAL_JAVA_TID) {
    >      >      >   617         // ThreadsSMRSupport::add_thread() is not called for the primordial
    >      >      >   618         // thread. Thus, we find this thread with a linear search and add it
    >      >      >   619         // to the thread table.
    >      >      >   620         for (uint i = 0; i < length(); i++) {
    >      >      >   621             JavaThread* thread = thread_at(i);
    >      >      >   622             if (is_valid_java_thread(java_tid,thread)) {
    >      >      >   623                 ThreadTable::add_thread(java_tid, thread);
    >      >      >   624                 return thread;
    >      >      >   625             }
    >      >      >   626         }
    >      >      >   627     } else if (java_thread != NULL && is_valid_java_thread(java_tid, java_thread)) {
    >      >      >   628         return java_thread;
    >      >      >   629     }
    >      >      >   630     return NULL;
    >      >      >   631 }
    >      >      >   632 bool ThreadsList::is_valid_java_thread(jlong java_tid, JavaThread* java_thread) {
    >      >      >   633     oop tobj = java_thread->threadObj();
    >      >      >   634     // Ignore the thread if it hasn't run yet, has exited
    >      >      >   635     // or is starting to exit.
    >      >      >   636     return (tobj != NULL && !java_thread->is_exiting() &&
    >      >      >   637             java_tid == java_lang_Thread::thread_id(tobj));
    >      >      >   638 }
    >      >      >
    >      >      >   615     JavaThread* java_thread = ThreadTable::find_thread(java_tid);
    >      >      >
    >      >      >    I'd suggest to rename find_thread() to find_thread_by_tid().
    >      >      >
    >      >      > A space is missed after the comma:
    >      >      >    622 if (is_valid_java_thread(java_tid,thread)) {
    >      >      >
    >      >      > An empty line is needed before L632.
    >      >      >
    >      >      > The name 'is_valid_java_thread' looks wrong (or confusing) to me.
    >      >      > Something like 'is_alive_java_thread_with_tid()' would be better.
    >      >      > It'd better to list parameters in the opposite order.
    >      >      >
    >      >      > The call to is_valid_java_thread() is confusing:
    >      >      >     627 } else if (java_thread != NULL && is_valid_java_thread(java_tid, java_thread)) {
    >      >      >
    >      >      > Why would the call ThreadTable::find_thread(java_tid) return a JavaThread with an unmatched java_tid?
    >      >      >
    >      >      >
    >      >      > Thanks,
    >      >      > Serguei
    >      >      >
    >      >      > On 6/28/19, 9:40 PM, "David Holmes" <david.holmes at oracle.com> wrote:
    >      >      >
    >      >      >      Hi Daniil,
    >      >      >
    >      >      >      The definition and use of this hashtable (yet another hashtable
    >      >      >      implementation!) will need careful examination. We have to be concerned
    >      >      >      about the cost of maintaining it when it may never even be queried. You
    >      >      >      would need to look at footprint cost and performance impact.
    >      >      >
    >      >      >      Unfortunately I'm just about to board a plane and will be out for the
    >      >      >      next few days. I will try to look at this asap next week, but we will
    >      >      >      need a lot more data on it.
    >      >      >
    >      >      >      Thanks,
    >      >      >      David
    >      >      >
    >      >      > On 6/28/19 3:31 PM, Daniil Titov wrote:
    >      >      > Please review the change that improves performance of ThreadMXBean MXBean methods returning the
    >      >      > information for specific threads. The change introduces the thread table that uses ConcurrentHashTable
    >      >      > to store one-to-one the mapping between the thread ids and JavaThread objects and replaces the linear
    >      >      > search over the thread list in ThreadsList::find_JavaThread_from_java_tid(jlong tid) method with the lookup
    >      >      > in the thread table.
    >      >      >
    >      >      > Testing: Mach5 tier1,tier2 and tier3 tests successfully passed.
    >      >      >
    >      >      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.01/
    >      >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
    >      >      >
    >      >      > Thanks!
    >      >      >
    >      >      > Best regards,
    >      >      > Daniil
    >      >      >
    >      >      >
    >      >      >
    >      >      >
    >      >      >
    >      >      >
    >      >      >
    >      >
    >      >
    >      >
    >      >
    >      
    > 
    > 
    

From robbin.ehn at oracle.com  Mon Aug 12 12:22:40 2019
From: robbin.ehn at oracle.com (Robbin Ehn)
Date: Mon, 12 Aug 2019 14:22:40 +0200
Subject: jmx-dev RFR: 8185005: Improve performance of
 ThreadMXBean.getThreadInfo(long ids[], int maxDepth)
In-Reply-To: <A6311A8C-738A-4928-90C0-A11CB8B9487D@oracle.com>
References: <4C4212D0-BFFF-4C85-ACC6-05200F220C3F@oracle.com>
 <2d6dede1-aa79-99ce-a823-773fa2e19827@oracle.com>
 <6E7B043A-4647-4931-977C-1854CA7EBEC1@oracle.com>
 <bc9fbb03-6591-16e0-6411-d80cd8f56877@oracle.com>
 <76BCC96D-DB5D-409A-95D5-3A64B893832D@oracle.com>
 <7e0ba39e-e5b7-f56b-66ea-820a0a35ec2c@oracle.com>
 <87748188-3BD4-4A8B-938A-89DBC8F3C57A@oracle.com>
 <acfdf23e-4d6f-aa2b-3711-1da17a7f5f2c@oracle.com>
 <A6311A8C-738A-4928-90C0-A11CB8B9487D@oracle.com>
Message-ID: <bc83c941-1d3e-8107-12be-4e30b13707b5@oracle.com>

Hi Daniil,

I took a new deeper dive into this.

This line seems to have some issues:

if (ThreadTable::is_initialized() && thread->in_thread_table() && 
!thread->is_attaching_via_jni()) {

If you create new threads which attaches and then dies, the table will just keep
growing. So you must remove them also ?

Secondly you should not use volatile semantics for _in_thread_table.
The load in the if-statement can be reordered with _is_initialized.
Which could lead to a leak, rogue pointer in the table.

So both "static volatile bool _is_initialized;" and "volatile bool 
_in_thread_table; "
should be stored with store_release and loaded with load_acquire.

Unfortunately it looks like there still would be races if
ThreadTable::add_thread e.g. context switch at:

if (_local_table->insert(thread, lookup, entry)) {
// HERE
   java_thread->set_in_thread_table(true);

*Remove side can pass the if-statement without removing.

Since this thread also maybe exiting at any moment, e.g. context switch:

       if (tobj != NULL && !thread->is_exiting() &&
           java_tid == java_lang_Thread::thread_id(tobj)) {
	// HERE
         ThreadTable::add_thread(java_tid, thread);

*Add side can add a thread that is exiting.

Mixing in a third thread looking up a random tid and getting a JavaThread*, it
must validate it against it's ThreadsList. Making the hashtable useless.

So I think the only one adding and removing should be the thread itself.
1:Add to ThreadsList
2:Add to ThreadTable
3:Remove from ThreadTable
4:Remove ThreadsList

Between 1-2 and 3-4 the thread would be looked-up via linear scan.
I don't see an easy way around the start-up issue with this.

Maybe have the cache in Java.
Pass in the thread obj into a
java_sun_management_ThreadImpl_getThreadTotalCpuTime3 instead,
thus skipping any look-ups in native.

Thanks, Robbin


On 8/12/19 5:49 AM, Daniil Titov wrote:
> Hi David, Robbin, Daniel, and Serguei,
> 
> Please review a new version of the fix.
> 
> As David suggested I created a separated Jira issue [1] to cover  additional optimization for
> some callers of find_JavaThread_from_java_tid() and this version of the fix no longer includes
> changes in management.cpp ( and the test related with these changes).
> 
> Regarding the impact the previous version of the fix had on the thread startup time at heavy load (e.g.
> when 5000 threads are created and destroyed every second) I tried a different approach that makes
> calls to ThreadTable::add_thread  and ThreadTable::remove_thread  asynchronous and offloads the
> work for actual modifications of the thread table to a periodic task that runs every 5 seconds. With the
> same  stress test scenario (the  test does some warm-up and then measures the time it takes to create
> and start 100,000 threads; every  thread just sleeps  for 100 ms) the impact on the thread startup time
> was reduced to 1.2% ( from 2.7%).
> 
> The cause of this impact in this stress test scenario is that as soon as the thread table is initialized,
> an additional work to insert  and delete entries in the thread table should be performed, even if
> com.sun.management.ThreadMXBean methods are no longer called. For example, In the stress test
> mentioned above, every second about 5000 entries had to be inserted in the table and then deleted.
> 
> That doesn't look right and the new version of the fix uses the different approach: the thread is added to
> the thread table only when this thread is requested by com.sun.management.ThreadMXBean bean. Every
> time when find_JavaThread_from_java_tid() is called for a new tid, the thread  is found by the iterating over
> the thread list and added to the thread table. All consequent calls to find_JavaThread_from_java_tid() for
> the same tid returns the thread from the thread table.
> 
> Running stress test for the cases when the thread table is enabled and not showed no difference in the
> average thread startup times.
> 
> [1] : https://bugs.openjdk.java.net/browse/JDK-8229391
> 
> Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
> Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.05/
> 
> Thanks,
> Daniil
> 
> ?On 8/4/19, 7:54 PM, "David Holmes" <david.holmes at oracle.com> wrote:
> 
>      Hi Daniil,
>      
>      On 3/08/2019 8:16 am, Daniil Titov wrote:
>      > Hi David,
>      >
>      > Thank you for your detailed review. Please review a new version of the fix that includes
>      > the changes you suggested:
>      > - ThreadTableCreate_lock scope is reduced to cover the creation of the table only;
>      > - ThreadTableCreate_lock is made _safepoint_check_always;
>      
>      Okay.
>      
>      > - ServiceThread is no longer responsible for the resizing of the thread table, instead,
>      >    the thread table is changed to grow on demand by the thread that is doing the addition;
>      
>      Okay - I'm happy to get the serviceThread out of the picture here.
>      
>      > - fixed nits and formatting issues.
>      
>      Okay.
>      
>      >>> The change also includes additional optimization for some callers of find_JavaThread_from_java_tid()
>      >>>   as Daniel suggested.
>      >> Not sure it's best to combine these, but if they are limited to the
>      >> changes in management.cpp only then that may be okay.
>      >
>      > The additional optimization for some callers of find_JavaThread_from_java_tid() is
>      > limited to management.cpp (plus a new test) so I left them in the webrev  but
>      > I also could move it in the separate issue if required.
>      
>      I'd prefer this part of be separated out, but won't insist. Let's see if
>      Dan or Serguei have a strong opinion.
>      
>      >    > src/hotspot/share/runtime/threadSMR.cpp
>      >    >755     jlong tid = SharedRuntime::get_java_tid(thread);
>      >    > 926     jlong tid = SharedRuntime::get_java_tid(thread);
>      >   >  I think it cleaner/better to just use
>      >   > jlong tid = java_lang_Thread::thread_id(thread->threadObj());
>      >   > as we know thread is not NULL, it is a JavaThread and it has to have a
>      >   > non-null threadObj.
>      >
>      > I had to leave this code unchanged since it turned out the threadObj is null
>      > when VM is destroyed:
>      >
>      > V  [libjvm.so+0xe165d7]  oopDesc::long_field(int) const+0x67
>      > V  [libjvm.so+0x16e06c6]  ThreadsSMRSupport::add_thread(JavaThread*)+0x116
>      > V  [libjvm.so+0x16d1302]  Threads::add(JavaThread*, bool)+0x82
>      > V  [libjvm.so+0xef8369]  attach_current_thread.part.197+0xc9
>      > V  [libjvm.so+0xec136c]  jni_DestroyJavaVM+0x6c
>      > C  [libjli.so+0x4333]  JavaMain+0x2c3
>      > C  [libjli.so+0x8159]  ThreadJavaMain+0x9
>      
>      This is actually nothing to do with the VM being destroyed, but is an
>      issue with JNI_AttachCurrentThread and its interaction with the
>      ThreadSMR iterators. The attach process is:
>      - create JavaThread
>      - mark as "is attaching via jni"
>      - add to ThreadsList
>      - create java.lang.Thread object (you can only execute Java code after
>      you are attached)
>      - mark as "attach completed"
>      
>      So while a thread "is attaching" it will be seen by the ThreadSMR thread
>      iterator but will have a NULL java.lang.Thread object.
>      
>      We special-case attaching threads in a number of places in the VM and I
>      think we should be explicitly doing something here to filter out
>      attaching threads, rather than just being tolerant of a NULL j.l.Thread
>      object. Specifically in ThreadsSMRSupport::add_thread:
>      
>      if (ThreadTable::is_initialized() && !thread->is_attaching_via_jni()) {
>         jlong tid = java_lang_Thread::thread_id(thread->threadObj());
>         ThreadTable::add_thread(tid, thread);
>      }
>      
>      Note that in ThreadsSMRSupport::remove_thread we can use the same guard,
>      which covers the case the JNI attach encountered an error trying to
>      create the j.l.Thread object.
>      
>      >> src/hotspot/share/services/threadTable.cpp
>      >> 71     static uintx get_hash(Value const& value, bool* is_dead) {
>      >
>      >> The is_dead parameter still bothers me here. I can't make enough sense
>      >> out of the template code in ConcurrentHashtable to see why we have to
>      >> have it, but I'm concerned that its very existence means we perhaps
>      >> should not be trying to extend CHT in this context. ??
>      >
>      > My understanding is that is_dead parameter provides a mechanism for
>      > ConcurrentHashtable to remove stale entries that were not explicitly
>      > removed by calling  ConcurrentHashTable::remove() method.
>      > I think that just because in our case we don't use this mechanism doesn't
>      > mean we should not use ConcurrentHashTable.
>      
>      Can you confirm that this usage is okay with Robbin Ehn please. He's
>      back from vacation this week.
>      
>      >> I would still want to see what impact this has on thread
>      >> startup cost, both with and without the table being initialized.
>      >
>      > I run a test that initializes the table by calling ThreadMXBean.get getThreadInfo(),
>      > starts some threads as a worm-up, and then creates and starts 100,000 threads
>      > (each thread just sleeps for 100 ms). In case when the thread table is enabled
>      > 100,000 threads are created and started  for about 15200 ms. If the thread table
>      > is off the test takes about 14800 ms. Based on this information the enabled
>      > thread table makes the thread startup about 2.7% slower.
>      
>      That doesn't sound very good. I think we may need to Claes involved to
>      help investigate overall performance impact here.
>      
>      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.04/
>      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
>      
>      No further code comments.
>      
>      I didn't look at the test in detail.
>      
>      Thanks,
>      David
>      
>      > Thanks!
>      > --Daniil
>      >
>      >
>      > ?On 7/29/19, 12:53 AM, "David Holmes" <david.holmes at oracle.com> wrote:
>      >
>      >      Hi Daniil,
>      >
>      >      Overall I think this is a reasonable approach but I would still like to
>      >      see some performance and footprint numbers, both to verify it fixes the
>      >      problem reported, and that we are not getting penalized elsewhere.
>      >
>      >      On 25/07/2019 3:21 am, Daniil Titov wrote:
>      >      > Hi David, Daniel, and Serguei,
>      >      >
>      >      > Please review the new version of the fix, that makes the thread table initialization on demand and
>      >      > moves it inside ThreadsList::find_JavaThread_from_java_tid(). At the creation time the thread table
>      >      >   is initialized with the threads from the current thread list. We don't want to hold Threads_lock
>      >      > inside find_JavaThread_from_java_tid(),  thus new threads still could be created  while the thread
>      >      > table is being initialized . Such threads will be found by the linear search and added to the thread table
>      >      > later, in ThreadsList::find_JavaThread_from_java_tid().
>      >
>      >      The initialization allows the created but unpopulated, or partially
>      >      populated, table to be seen by other threads - is that your intention?
>      >      It seems it should be okay as the other threads will then race with the
>      >      initializing thread to add specific entries, and this is a concurrent
>      >      map so that should be functionally correct. But if so then I think you
>      >      can also reduce the scope of the ThreadTableCreate_lock so that it
>      >      covers creation of the table only, not the initial population of the table.
>      >
>      >      I like the approach of only initializing the table when needed and using
>      >      that to control when the add/remove-thread code needs to update the
>      >      table. But I would still want to see what impact this has on thread
>      >      startup cost, both with and without the table being initialized.
>      >
>      >      > The change also includes additional optimization for some callers of find_JavaThread_from_java_tid()
>      >      > as Daniel suggested.
>      >
>      >      Not sure it's best to combine these, but if they are limited to the
>      >      changes in management.cpp only then that may be okay. It helps to be
>      >      able to focus on the table related changes without being distracted by
>      >      other optimizations.
>      >
>      >      > That is correct that ResolvedMethodTable was used as a blueprint for the thread table, however, I tried
>      >      > to strip it of the all functionality that is not required in the thread table case.
>      >
>      >      The revised version seems better in that regard. But I still have a
>      >      concern, see below.
>      >
>      >      > We need to have the thread table resizable and allow it to grow as the number of threads increases to avoid
>      >      > reserving excessive memory a-priori or deteriorating lookup times. The ServiceThread is responsible for
>      >      > growing the thread table when required.
>      >
>      >      Yes but why? Why can't this table be grown on demand by the thread that
>      >      is doing the addition? For other tables we may have to delegate to the
>      >      service thread because the current thread cannot perform the action, or
>      >      it doesn't want to perform it at the time the need for the resize is
>      >      detected (e.g. its detected at a safepoint and you want the resize to
>      >      happen later outside the safepoint). It's not apparent to me that such
>      >      restrictions apply here.
>      >
>      >      > There is no ConcurrentHashTable available in Java 8 and for backporting this fix to Java 8 another implementation
>      >      > of the hash table, probably originally suggested in the patch attached to the JBS issue, should be used.  It will make
>      >      > the backporting more complicated,  however, adding a new Implementation of the hash table in Java 14 while it
>      >      > already has ConcurrentHashTable doesn't seem  reasonable for me.
>      >
>      >      Ok.
>      >
>      >      > Webrev: http://cr.openjdk.java.net/~dtitov/8185005/webrev.03
>      >
>      >      Some specific code comments:
>      >
>      >      src/hotspot/share/runtime/mutexLocker.cpp
>      >
>      >      +   def(ThreadTableCreate_lock       , PaddedMutex  , special,
>      >      false, Monitor::_safepoint_check_never);
>      >
>      >      I think this needs to be a _safepoint_check_always lock. The table will
>      >      be created by regular JavaThreads and they should (nearly) always be
>      >      checking for safepoints if they are going to block acquiring the lock.
>      >      And it isn't at all obvious that the thread doing the creation can't go
>      >      to a safepoint whilst this lock is held.
>      >
>      >      ---
>      >
>      >      src/hotspot/share/runtime/threadSMR.cpp
>      >
>      >      Nit:
>      >
>      >        618       JavaThread* thread = thread_at(i);
>      >
>      >      you could reuse the new java_thread local you introduced at line 613 and
>      >      just rename that "new" variable to "thread" so you don't have to change
>      >      all other uses.
>      >
>      >      628   } else if (java_thread != NULL && ...
>      >
>      >      You don't need to check != NULL here as you only get here when
>      >      java_thread is not NULL.
>      >
>      >        755     jlong tid = SharedRuntime::get_java_tid(thread);
>      >        926     jlong tid = SharedRuntime::get_java_tid(thread);
>      >
>      >      I think it cleaner/better to just use
>      >
>      >      jlong tid = java_lang_Thread::thread_id(thread->threadObj());
>      >
>      >      as we know thread is not NULL, it is a JavaThread and it has to have a
>      >      non-null threadObj.
>      >
>      >      ---
>      >
>      >      src/hotspot/share/services/management.cpp
>      >
>      >      1323         if (THREAD->is_Java_thread()) {
>      >      1324           JavaThread* current_thread = (JavaThread*)THREAD;
>      >
>      >      These calls can only be made on a JavaThread so this be simplified to
>      >      remove the is_Java_thread() call. Similarly in other places.
>      >
>      >      ---
>      >
>      >      src/hotspot/share/services/threadTable.cpp
>      >
>      >         55 class ThreadTableEntry : public CHeapObj<mtInternal> {
>      >         56   private:
>      >         57     jlong _tid;
>      >
>      >      I believe hotspot style is to not indent the access modifiers in C++
>      >      class declarations, so the above would just be:
>      >
>      >         55 class ThreadTableEntry : public CHeapObj<mtInternal> {
>      >         56 private:
>      >         57   jlong _tid;
>      >
>      >      etc.
>      >
>      >        60     ThreadTableEntry(jlong tid, JavaThread* java_thread) :
>      >        61     _tid(tid),_java_thread(java_thread) {}
>      >
>      >      line 61 should be indented as it continues line 60.
>      >
>      >         67 class ThreadTableConfig : public AllStatic {
>      >         ...
>      >         71     static uintx get_hash(Value const& value, bool* is_dead) {
>      >
>      >      The is_dead parameter still bothers me here. I can't make enough sense
>      >      out of the template code in ConcurrentHashtable to see why we have to
>      >      have it, but I'm concerned that its very existence means we perhaps
>      >      should not be trying to extend CHT in this context. ??
>      >
>      >        115   size_t start_size_log = size_log > DefaultThreadTableSizeLog
>      >        116   ? size_log : DefaultThreadTableSizeLog;
>      >
>      >      line 116 should be indented, though in this case I think a better layout
>      >      would be:
>      >
>      >        115   size_t start_size_log =
>      >        116       size_log > DefaultThreadTableSizeLog ? size_log :
>      >      DefaultThreadTableSizeLog;
>      >
>      >        131 double ThreadTable::get_load_factor() {
>      >        132   return (double)_items_count/_current_size;
>      >        133 }
>      >
>      >      Not sure that is doing what you want/expect. It will perform integer
>      >      division and then cast that whole integer to a double. If you want
>      >      double arithmetic you need:
>      >
>      >      return ((double)_items_count)/_current_size;
>      >
>      >      180     jlong          _tid;
>      >      181     uintx         _hash;
>      >
>      >      Nit: no need for all those spaces before the variable name.
>      >
>      >        183     ThreadTableLookup(jlong tid)
>      >        184     : _tid(tid), _hash(primitive_hash(tid)) {}
>      >
>      >      line 184 should be indented.
>      >
>      >      201     ThreadGet():_return(NULL) {}
>      >
>      >      Nit: need space after :
>      >
>      >        211    assert(_is_initialized, "Thread table is not initialized");
>      >        212   _has_work = false;
>      >
>      >      line 211 is indented one space too far.
>      >
>      >      229     ThreadTableEntry* entry = new ThreadTableEntry(tid,java_thread);
>      >
>      >      Nit: need space after ,
>      >
>      >      252   return _local_table->remove(thread,lookup);
>      >
>      >      Nit: need space after ,
>      >
>      >      Thanks,
>      >      David
>      >      ------
>      >
>      >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
>      >      >
>      >      > Thanks!
>      >      > --Daniil
>      >      >
>      >      >
>      >      > ?On 7/8/19, 3:24 PM, "Daniel D. Daugherty" <daniel.daugherty at oracle.com> wrote:
>      >      >
>      >      >      On 6/29/19 12:06 PM, Daniil Titov wrote:
>      >      >      > Hi Serguei and David,
>      >      >      >
>      >      >      > Serguei is right, ThreadTable::find_thread(java_tid) cannot  return a JavaThread with an unmatched java_tid.
>      >      >      >
>      >      >      > Please find a new version of the fix that includes the changes Serguei suggested.
>      >      >      >
>      >      >      > Regarding the concern about the maintaining the thread table when it may never even be queried, one of
>      >      >      > the options could be to add ThreadTable ::isEnabled flag, set it to "false" by default, and wrap the calls to the thread table
>      >      >      > in ThreadsSMRSupport add_thread() and remove_thread() methods to check this flag.
>      >      >      >
>      >      >      > When ThreadsList::find_JavaThread_from_java_tid() is called for the first time it could check if ThreadTable ::isEnabled
>      >      >      > Is on and if not then set it on and populate the thread table with all existing threads from the thread list.
>      >      >
>      >      >      I have the same concerns as David H. about this new ThreadTable.
>      >      >      ThreadsList::find_JavaThread_from_java_tid() is only called from code
>      >      >      in src/hotspot/share/services/management.cpp so I think that table
>      >      >      needs to enabled and populated only if it is going to be used.
>      >      >
>      >      >      I've taken a look at the webrev below and I see that David has
>      >      >      followed up with additional comments. Before I do a crawl through
>      >      >      code review for this, I would like to see the ThreadTable stuff
>      >      >      made optional and David's other comments addressed.
>      >      >
>      >      >      Another possible optimization is for callers of
>      >      >      find_JavaThread_from_java_tid() to save the calling thread's
>      >      >      tid value before they loop and if the current tid == saved_tid
>      >      >      then use the current JavaThread* instead of calling
>      >      >      find_JavaThread_from_java_tid() to get the JavaThread*.
>      >      >
>      >      >      Dan
>      >      >
>      >      >      >
>      >      >      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.02/
>      >      >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
>      >      >      >
>      >      >      > Thanks!
>      >      >      > --Daniil
>      >      >      >
>      >      >      > From: <serguei.spitsyn at oracle.com>
>      >      >      > Organization: Oracle Corporation
>      >      >      > Date: Friday, June 28, 2019 at 7:56 PM
>      >      >      > To: Daniil Titov <daniil.x.titov at oracle.com>, OpenJDK Serviceability <serviceability-dev at openjdk.java.net>, "hotspot-runtime-dev at openjdk.java.net" <hotspot-runtime-dev at openjdk.java.net>, "jmx-dev at openjdk.java.net" <jmx-dev at openjdk.java.net>
>      >      >      > Subject: Re: RFR: 8185005: Improve performance of ThreadMXBean.getThreadInfo(long ids[], int maxDepth)
>      >      >      >
>      >      >      > Hi Daniil,
>      >      >      >
>      >      >      > I have several quick comments.
>      >      >      >
>      >      >      > The indent in the hotspot c/c++ files has to be 2, not 4.
>      >      >      >
>      >      >      > https://cr.openjdk.java.net/~dtitov/8185005/webrev.01/src/hotspot/share/runtime/threadSMR.cpp.frames.html
>      >      >      > 614 JavaThread* ThreadsList::find_JavaThread_from_java_tid(jlong java_tid) const {
>      >      >      >   615     JavaThread* java_thread = ThreadTable::find_thread(java_tid);
>      >      >      >   616     if (java_thread == NULL && java_tid == PMIMORDIAL_JAVA_TID) {
>      >      >      >   617         // ThreadsSMRSupport::add_thread() is not called for the primordial
>      >      >      >   618         // thread. Thus, we find this thread with a linear search and add it
>      >      >      >   619         // to the thread table.
>      >      >      >   620         for (uint i = 0; i < length(); i++) {
>      >      >      >   621             JavaThread* thread = thread_at(i);
>      >      >      >   622             if (is_valid_java_thread(java_tid,thread)) {
>      >      >      >   623                 ThreadTable::add_thread(java_tid, thread);
>      >      >      >   624                 return thread;
>      >      >      >   625             }
>      >      >      >   626         }
>      >      >      >   627     } else if (java_thread != NULL && is_valid_java_thread(java_tid, java_thread)) {
>      >      >      >   628         return java_thread;
>      >      >      >   629     }
>      >      >      >   630     return NULL;
>      >      >      >   631 }
>      >      >      >   632 bool ThreadsList::is_valid_java_thread(jlong java_tid, JavaThread* java_thread) {
>      >      >      >   633     oop tobj = java_thread->threadObj();
>      >      >      >   634     // Ignore the thread if it hasn't run yet, has exited
>      >      >      >   635     // or is starting to exit.
>      >      >      >   636     return (tobj != NULL && !java_thread->is_exiting() &&
>      >      >      >   637             java_tid == java_lang_Thread::thread_id(tobj));
>      >      >      >   638 }
>      >      >      >
>      >      >      >   615     JavaThread* java_thread = ThreadTable::find_thread(java_tid);
>      >      >      >
>      >      >      >    I'd suggest to rename find_thread() to find_thread_by_tid().
>      >      >      >
>      >      >      > A space is missed after the comma:
>      >      >      >    622 if (is_valid_java_thread(java_tid,thread)) {
>      >      >      >
>      >      >      > An empty line is needed before L632.
>      >      >      >
>      >      >      > The name 'is_valid_java_thread' looks wrong (or confusing) to me.
>      >      >      > Something like 'is_alive_java_thread_with_tid()' would be better.
>      >      >      > It'd better to list parameters in the opposite order.
>      >      >      >
>      >      >      > The call to is_valid_java_thread() is confusing:
>      >      >      >     627 } else if (java_thread != NULL && is_valid_java_thread(java_tid, java_thread)) {
>      >      >      >
>      >      >      > Why would the call ThreadTable::find_thread(java_tid) return a JavaThread with an unmatched java_tid?
>      >      >      >
>      >      >      >
>      >      >      > Thanks,
>      >      >      > Serguei
>      >      >      >
>      >      >      > On 6/28/19, 9:40 PM, "David Holmes" <david.holmes at oracle.com> wrote:
>      >      >      >
>      >      >      >      Hi Daniil,
>      >      >      >
>      >      >      >      The definition and use of this hashtable (yet another hashtable
>      >      >      >      implementation!) will need careful examination. We have to be concerned
>      >      >      >      about the cost of maintaining it when it may never even be queried. You
>      >      >      >      would need to look at footprint cost and performance impact.
>      >      >      >
>      >      >      >      Unfortunately I'm just about to board a plane and will be out for the
>      >      >      >      next few days. I will try to look at this asap next week, but we will
>      >      >      >      need a lot more data on it.
>      >      >      >
>      >      >      >      Thanks,
>      >      >      >      David
>      >      >      >
>      >      >      > On 6/28/19 3:31 PM, Daniil Titov wrote:
>      >      >      > Please review the change that improves performance of ThreadMXBean MXBean methods returning the
>      >      >      > information for specific threads. The change introduces the thread table that uses ConcurrentHashTable
>      >      >      > to store one-to-one the mapping between the thread ids and JavaThread objects and replaces the linear
>      >      >      > search over the thread list in ThreadsList::find_JavaThread_from_java_tid(jlong tid) method with the lookup
>      >      >      > in the thread table.
>      >      >      >
>      >      >      > Testing: Mach5 tier1,tier2 and tier3 tests successfully passed.
>      >      >      >
>      >      >      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.01/
>      >      >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
>      >      >      >
>      >      >      > Thanks!
>      >      >      >
>      >      >      > Best regards,
>      >      >      > Daniil
>      >      >      >
>      >      >      >
>      >      >      >
>      >      >      >
>      >      >      >
>      >      >      >
>      >      >      >
>      >      >
>      >      >
>      >      >
>      >      >
>      >
>      >
>      >
>      
> 
> 

From daniil.x.titov at oracle.com  Mon Aug 12 23:24:30 2019
From: daniil.x.titov at oracle.com (Daniil Titov)
Date: Mon, 12 Aug 2019 16:24:30 -0700
Subject: jmx-dev 8185005: Improve performance of
 ThreadMXBean.getThreadInfo(long ids[], int maxDepth)
In-Reply-To: <bc83c941-1d3e-8107-12be-4e30b13707b5@oracle.com>
References: <4C4212D0-BFFF-4C85-ACC6-05200F220C3F@oracle.com>
 <2d6dede1-aa79-99ce-a823-773fa2e19827@oracle.com>
 <6E7B043A-4647-4931-977C-1854CA7EBEC1@oracle.com>
 <bc9fbb03-6591-16e0-6411-d80cd8f56877@oracle.com>
 <76BCC96D-DB5D-409A-95D5-3A64B893832D@oracle.com>
 <7e0ba39e-e5b7-f56b-66ea-820a0a35ec2c@oracle.com>
 <87748188-3BD4-4A8B-938A-89DBC8F3C57A@oracle.com>
 <acfdf23e-4d6f-aa2b-3711-1da17a7f5f2c@oracle.com>
 <A6311A8C-738A-4928-90C0-A11CB8B9487D@oracle.com>
 <bc83c941-1d3e-8107-12be-4e30b13707b5@oracle.com>
Message-ID: <48311B39-43F9-49E5-BEC7-A64F9D2588AF@oracle.com>

Hi Robbin,

Thank you very much for reviewing this version of the fix!  Based on your findings
it seems as it makes sense to make a step back and  continue with the 
approach we took before in the previous version of the webrev (webrev.04),
and get more information about the impact on the startup time it has. I will 
consult with Claus regarding this and then share the findings.

Thanks again, 
--Daniil


?On 8/12/19, 5:22 AM, "Robbin Ehn" <robbin.ehn at oracle.com> wrote:

    Hi Daniil,
    
    I took a new deeper dive into this.
    
    This line seems to have some issues:
    
    if (ThreadTable::is_initialized() && thread->in_thread_table() && 
    !thread->is_attaching_via_jni()) {
    
    If you create new threads which attaches and then dies, the table will just keep
    growing. So you must remove them also ?
    
    Secondly you should not use volatile semantics for _in_thread_table.
    The load in the if-statement can be reordered with _is_initialized.
    Which could lead to a leak, rogue pointer in the table.
    
    So both "static volatile bool _is_initialized;" and "volatile bool 
    _in_thread_table; "
    should be stored with store_release and loaded with load_acquire.
    
    Unfortunately it looks like there still would be races if
    ThreadTable::add_thread e.g. context switch at:
    
    if (_local_table->insert(thread, lookup, entry)) {
    // HERE
       java_thread->set_in_thread_table(true);
    
    *Remove side can pass the if-statement without removing.
    
    Since this thread also maybe exiting at any moment, e.g. context switch:
    
           if (tobj != NULL && !thread->is_exiting() &&
               java_tid == java_lang_Thread::thread_id(tobj)) {
    	// HERE
             ThreadTable::add_thread(java_tid, thread);
    
    *Add side can add a thread that is exiting.
    
    Mixing in a third thread looking up a random tid and getting a JavaThread*, it
    must validate it against it's ThreadsList. Making the hashtable useless.
    
    So I think the only one adding and removing should be the thread itself.
    1:Add to ThreadsList
    2:Add to ThreadTable
    3:Remove from ThreadTable
    4:Remove ThreadsList
    
    Between 1-2 and 3-4 the thread would be looked-up via linear scan.
    I don't see an easy way around the start-up issue with this.
    
    Maybe have the cache in Java.
    Pass in the thread obj into a
    java_sun_management_ThreadImpl_getThreadTotalCpuTime3 instead,
    thus skipping any look-ups in native.
    
    Thanks, Robbin
    
    
    On 8/12/19 5:49 AM, Daniil Titov wrote:
    > Hi David, Robbin, Daniel, and Serguei,
    > 
    > Please review a new version of the fix.
    > 
    > As David suggested I created a separated Jira issue [1] to cover  additional optimization for
    > some callers of find_JavaThread_from_java_tid() and this version of the fix no longer includes
    > changes in management.cpp ( and the test related with these changes).
    > 
    > Regarding the impact the previous version of the fix had on the thread startup time at heavy load (e.g.
    > when 5000 threads are created and destroyed every second) I tried a different approach that makes
    > calls to ThreadTable::add_thread  and ThreadTable::remove_thread  asynchronous and offloads the
    > work for actual modifications of the thread table to a periodic task that runs every 5 seconds. With the
    > same  stress test scenario (the  test does some warm-up and then measures the time it takes to create
    > and start 100,000 threads; every  thread just sleeps  for 100 ms) the impact on the thread startup time
    > was reduced to 1.2% ( from 2.7%).
    > 
    > The cause of this impact in this stress test scenario is that as soon as the thread table is initialized,
    > an additional work to insert  and delete entries in the thread table should be performed, even if
    > com.sun.management.ThreadMXBean methods are no longer called. For example, In the stress test
    > mentioned above, every second about 5000 entries had to be inserted in the table and then deleted.
    > 
    > That doesn't look right and the new version of the fix uses the different approach: the thread is added to
    > the thread table only when this thread is requested by com.sun.management.ThreadMXBean bean. Every
    > time when find_JavaThread_from_java_tid() is called for a new tid, the thread  is found by the iterating over
    > the thread list and added to the thread table. All consequent calls to find_JavaThread_from_java_tid() for
    > the same tid returns the thread from the thread table.
    > 
    > Running stress test for the cases when the thread table is enabled and not showed no difference in the
    > average thread startup times.
    > 
    > [1] : https://bugs.openjdk.java.net/browse/JDK-8229391
    > 
    > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
    > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.05/
    > 
    > Thanks,
    > Daniil
    > 
    > ?On 8/4/19, 7:54 PM, "David Holmes" <david.holmes at oracle.com> wrote:
    > 
    >      Hi Daniil,
    >      
    >      On 3/08/2019 8:16 am, Daniil Titov wrote:
    >      > Hi David,
    >      >
    >      > Thank you for your detailed review. Please review a new version of the fix that includes
    >      > the changes you suggested:
    >      > - ThreadTableCreate_lock scope is reduced to cover the creation of the table only;
    >      > - ThreadTableCreate_lock is made _safepoint_check_always;
    >      
    >      Okay.
    >      
    >      > - ServiceThread is no longer responsible for the resizing of the thread table, instead,
    >      >    the thread table is changed to grow on demand by the thread that is doing the addition;
    >      
    >      Okay - I'm happy to get the serviceThread out of the picture here.
    >      
    >      > - fixed nits and formatting issues.
    >      
    >      Okay.
    >      
    >      >>> The change also includes additional optimization for some callers of find_JavaThread_from_java_tid()
    >      >>>   as Daniel suggested.
    >      >> Not sure it's best to combine these, but if they are limited to the
    >      >> changes in management.cpp only then that may be okay.
    >      >
    >      > The additional optimization for some callers of find_JavaThread_from_java_tid() is
    >      > limited to management.cpp (plus a new test) so I left them in the webrev  but
    >      > I also could move it in the separate issue if required.
    >      
    >      I'd prefer this part of be separated out, but won't insist. Let's see if
    >      Dan or Serguei have a strong opinion.
    >      
    >      >    > src/hotspot/share/runtime/threadSMR.cpp
    >      >    >755     jlong tid = SharedRuntime::get_java_tid(thread);
    >      >    > 926     jlong tid = SharedRuntime::get_java_tid(thread);
    >      >   >  I think it cleaner/better to just use
    >      >   > jlong tid = java_lang_Thread::thread_id(thread->threadObj());
    >      >   > as we know thread is not NULL, it is a JavaThread and it has to have a
    >      >   > non-null threadObj.
    >      >
    >      > I had to leave this code unchanged since it turned out the threadObj is null
    >      > when VM is destroyed:
    >      >
    >      > V  [libjvm.so+0xe165d7]  oopDesc::long_field(int) const+0x67
    >      > V  [libjvm.so+0x16e06c6]  ThreadsSMRSupport::add_thread(JavaThread*)+0x116
    >      > V  [libjvm.so+0x16d1302]  Threads::add(JavaThread*, bool)+0x82
    >      > V  [libjvm.so+0xef8369]  attach_current_thread.part.197+0xc9
    >      > V  [libjvm.so+0xec136c]  jni_DestroyJavaVM+0x6c
    >      > C  [libjli.so+0x4333]  JavaMain+0x2c3
    >      > C  [libjli.so+0x8159]  ThreadJavaMain+0x9
    >      
    >      This is actually nothing to do with the VM being destroyed, but is an
    >      issue with JNI_AttachCurrentThread and its interaction with the
    >      ThreadSMR iterators. The attach process is:
    >      - create JavaThread
    >      - mark as "is attaching via jni"
    >      - add to ThreadsList
    >      - create java.lang.Thread object (you can only execute Java code after
    >      you are attached)
    >      - mark as "attach completed"
    >      
    >      So while a thread "is attaching" it will be seen by the ThreadSMR thread
    >      iterator but will have a NULL java.lang.Thread object.
    >      
    >      We special-case attaching threads in a number of places in the VM and I
    >      think we should be explicitly doing something here to filter out
    >      attaching threads, rather than just being tolerant of a NULL j.l.Thread
    >      object. Specifically in ThreadsSMRSupport::add_thread:
    >      
    >      if (ThreadTable::is_initialized() && !thread->is_attaching_via_jni()) {
    >         jlong tid = java_lang_Thread::thread_id(thread->threadObj());
    >         ThreadTable::add_thread(tid, thread);
    >      }
    >      
    >      Note that in ThreadsSMRSupport::remove_thread we can use the same guard,
    >      which covers the case the JNI attach encountered an error trying to
    >      create the j.l.Thread object.
    >      
    >      >> src/hotspot/share/services/threadTable.cpp
    >      >> 71     static uintx get_hash(Value const& value, bool* is_dead) {
    >      >
    >      >> The is_dead parameter still bothers me here. I can't make enough sense
    >      >> out of the template code in ConcurrentHashtable to see why we have to
    >      >> have it, but I'm concerned that its very existence means we perhaps
    >      >> should not be trying to extend CHT in this context. ??
    >      >
    >      > My understanding is that is_dead parameter provides a mechanism for
    >      > ConcurrentHashtable to remove stale entries that were not explicitly
    >      > removed by calling  ConcurrentHashTable::remove() method.
    >      > I think that just because in our case we don't use this mechanism doesn't
    >      > mean we should not use ConcurrentHashTable.
    >      
    >      Can you confirm that this usage is okay with Robbin Ehn please. He's
    >      back from vacation this week.
    >      
    >      >> I would still want to see what impact this has on thread
    >      >> startup cost, both with and without the table being initialized.
    >      >
    >      > I run a test that initializes the table by calling ThreadMXBean.get getThreadInfo(),
    >      > starts some threads as a worm-up, and then creates and starts 100,000 threads
    >      > (each thread just sleeps for 100 ms). In case when the thread table is enabled
    >      > 100,000 threads are created and started  for about 15200 ms. If the thread table
    >      > is off the test takes about 14800 ms. Based on this information the enabled
    >      > thread table makes the thread startup about 2.7% slower.
    >      
    >      That doesn't sound very good. I think we may need to Claes involved to
    >      help investigate overall performance impact here.
    >      
    >      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.04/
    >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
    >      
    >      No further code comments.
    >      
    >      I didn't look at the test in detail.
    >      
    >      Thanks,
    >      David
    >      
    >      > Thanks!
    >      > --Daniil
    >      >
    >      >
    >      > ?On 7/29/19, 12:53 AM, "David Holmes" <david.holmes at oracle.com> wrote:
    >      >
    >      >      Hi Daniil,
    >      >
    >      >      Overall I think this is a reasonable approach but I would still like to
    >      >      see some performance and footprint numbers, both to verify it fixes the
    >      >      problem reported, and that we are not getting penalized elsewhere.
    >      >
    >      >      On 25/07/2019 3:21 am, Daniil Titov wrote:
    >      >      > Hi David, Daniel, and Serguei,
    >      >      >
    >      >      > Please review the new version of the fix, that makes the thread table initialization on demand and
    >      >      > moves it inside ThreadsList::find_JavaThread_from_java_tid(). At the creation time the thread table
    >      >      >   is initialized with the threads from the current thread list. We don't want to hold Threads_lock
    >      >      > inside find_JavaThread_from_java_tid(),  thus new threads still could be created  while the thread
    >      >      > table is being initialized . Such threads will be found by the linear search and added to the thread table
    >      >      > later, in ThreadsList::find_JavaThread_from_java_tid().
    >      >
    >      >      The initialization allows the created but unpopulated, or partially
    >      >      populated, table to be seen by other threads - is that your intention?
    >      >      It seems it should be okay as the other threads will then race with the
    >      >      initializing thread to add specific entries, and this is a concurrent
    >      >      map so that should be functionally correct. But if so then I think you
    >      >      can also reduce the scope of the ThreadTableCreate_lock so that it
    >      >      covers creation of the table only, not the initial population of the table.
    >      >
    >      >      I like the approach of only initializing the table when needed and using
    >      >      that to control when the add/remove-thread code needs to update the
    >      >      table. But I would still want to see what impact this has on thread
    >      >      startup cost, both with and without the table being initialized.
    >      >
    >      >      > The change also includes additional optimization for some callers of find_JavaThread_from_java_tid()
    >      >      > as Daniel suggested.
    >      >
    >      >      Not sure it's best to combine these, but if they are limited to the
    >      >      changes in management.cpp only then that may be okay. It helps to be
    >      >      able to focus on the table related changes without being distracted by
    >      >      other optimizations.
    >      >
    >      >      > That is correct that ResolvedMethodTable was used as a blueprint for the thread table, however, I tried
    >      >      > to strip it of the all functionality that is not required in the thread table case.
    >      >
    >      >      The revised version seems better in that regard. But I still have a
    >      >      concern, see below.
    >      >
    >      >      > We need to have the thread table resizable and allow it to grow as the number of threads increases to avoid
    >      >      > reserving excessive memory a-priori or deteriorating lookup times. The ServiceThread is responsible for
    >      >      > growing the thread table when required.
    >      >
    >      >      Yes but why? Why can't this table be grown on demand by the thread that
    >      >      is doing the addition? For other tables we may have to delegate to the
    >      >      service thread because the current thread cannot perform the action, or
    >      >      it doesn't want to perform it at the time the need for the resize is
    >      >      detected (e.g. its detected at a safepoint and you want the resize to
    >      >      happen later outside the safepoint). It's not apparent to me that such
    >      >      restrictions apply here.
    >      >
    >      >      > There is no ConcurrentHashTable available in Java 8 and for backporting this fix to Java 8 another implementation
    >      >      > of the hash table, probably originally suggested in the patch attached to the JBS issue, should be used.  It will make
    >      >      > the backporting more complicated,  however, adding a new Implementation of the hash table in Java 14 while it
    >      >      > already has ConcurrentHashTable doesn't seem  reasonable for me.
    >      >
    >      >      Ok.
    >      >
    >      >      > Webrev: http://cr.openjdk.java.net/~dtitov/8185005/webrev.03
    >      >
    >      >      Some specific code comments:
    >      >
    >      >      src/hotspot/share/runtime/mutexLocker.cpp
    >      >
    >      >      +   def(ThreadTableCreate_lock       , PaddedMutex  , special,
    >      >      false, Monitor::_safepoint_check_never);
    >      >
    >      >      I think this needs to be a _safepoint_check_always lock. The table will
    >      >      be created by regular JavaThreads and they should (nearly) always be
    >      >      checking for safepoints if they are going to block acquiring the lock.
    >      >      And it isn't at all obvious that the thread doing the creation can't go
    >      >      to a safepoint whilst this lock is held.
    >      >
    >      >      ---
    >      >
    >      >      src/hotspot/share/runtime/threadSMR.cpp
    >      >
    >      >      Nit:
    >      >
    >      >        618       JavaThread* thread = thread_at(i);
    >      >
    >      >      you could reuse the new java_thread local you introduced at line 613 and
    >      >      just rename that "new" variable to "thread" so you don't have to change
    >      >      all other uses.
    >      >
    >      >      628   } else if (java_thread != NULL && ...
    >      >
    >      >      You don't need to check != NULL here as you only get here when
    >      >      java_thread is not NULL.
    >      >
    >      >        755     jlong tid = SharedRuntime::get_java_tid(thread);
    >      >        926     jlong tid = SharedRuntime::get_java_tid(thread);
    >      >
    >      >      I think it cleaner/better to just use
    >      >
    >      >      jlong tid = java_lang_Thread::thread_id(thread->threadObj());
    >      >
    >      >      as we know thread is not NULL, it is a JavaThread and it has to have a
    >      >      non-null threadObj.
    >      >
    >      >      ---
    >      >
    >      >      src/hotspot/share/services/management.cpp
    >      >
    >      >      1323         if (THREAD->is_Java_thread()) {
    >      >      1324           JavaThread* current_thread = (JavaThread*)THREAD;
    >      >
    >      >      These calls can only be made on a JavaThread so this be simplified to
    >      >      remove the is_Java_thread() call. Similarly in other places.
    >      >
    >      >      ---
    >      >
    >      >      src/hotspot/share/services/threadTable.cpp
    >      >
    >      >         55 class ThreadTableEntry : public CHeapObj<mtInternal> {
    >      >         56   private:
    >      >         57     jlong _tid;
    >      >
    >      >      I believe hotspot style is to not indent the access modifiers in C++
    >      >      class declarations, so the above would just be:
    >      >
    >      >         55 class ThreadTableEntry : public CHeapObj<mtInternal> {
    >      >         56 private:
    >      >         57   jlong _tid;
    >      >
    >      >      etc.
    >      >
    >      >        60     ThreadTableEntry(jlong tid, JavaThread* java_thread) :
    >      >        61     _tid(tid),_java_thread(java_thread) {}
    >      >
    >      >      line 61 should be indented as it continues line 60.
    >      >
    >      >         67 class ThreadTableConfig : public AllStatic {
    >      >         ...
    >      >         71     static uintx get_hash(Value const& value, bool* is_dead) {
    >      >
    >      >      The is_dead parameter still bothers me here. I can't make enough sense
    >      >      out of the template code in ConcurrentHashtable to see why we have to
    >      >      have it, but I'm concerned that its very existence means we perhaps
    >      >      should not be trying to extend CHT in this context. ??
    >      >
    >      >        115   size_t start_size_log = size_log > DefaultThreadTableSizeLog
    >      >        116   ? size_log : DefaultThreadTableSizeLog;
    >      >
    >      >      line 116 should be indented, though in this case I think a better layout
    >      >      would be:
    >      >
    >      >        115   size_t start_size_log =
    >      >        116       size_log > DefaultThreadTableSizeLog ? size_log :
    >      >      DefaultThreadTableSizeLog;
    >      >
    >      >        131 double ThreadTable::get_load_factor() {
    >      >        132   return (double)_items_count/_current_size;
    >      >        133 }
    >      >
    >      >      Not sure that is doing what you want/expect. It will perform integer
    >      >      division and then cast that whole integer to a double. If you want
    >      >      double arithmetic you need:
    >      >
    >      >      return ((double)_items_count)/_current_size;
    >      >
    >      >      180     jlong          _tid;
    >      >      181     uintx         _hash;
    >      >
    >      >      Nit: no need for all those spaces before the variable name.
    >      >
    >      >        183     ThreadTableLookup(jlong tid)
    >      >        184     : _tid(tid), _hash(primitive_hash(tid)) {}
    >      >
    >      >      line 184 should be indented.
    >      >
    >      >      201     ThreadGet():_return(NULL) {}
    >      >
    >      >      Nit: need space after :
    >      >
    >      >        211    assert(_is_initialized, "Thread table is not initialized");
    >      >        212   _has_work = false;
    >      >
    >      >      line 211 is indented one space too far.
    >      >
    >      >      229     ThreadTableEntry* entry = new ThreadTableEntry(tid,java_thread);
    >      >
    >      >      Nit: need space after ,
    >      >
    >      >      252   return _local_table->remove(thread,lookup);
    >      >
    >      >      Nit: need space after ,
    >      >
    >      >      Thanks,
    >      >      David
    >      >      ------
    >      >
    >      >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
    >      >      >
    >      >      > Thanks!
    >      >      > --Daniil
    >      >      >
    >      >      >
    >      >      > ?On 7/8/19, 3:24 PM, "Daniel D. Daugherty" <daniel.daugherty at oracle.com> wrote:
    >      >      >
    >      >      >      On 6/29/19 12:06 PM, Daniil Titov wrote:
    >      >      >      > Hi Serguei and David,
    >      >      >      >
    >      >      >      > Serguei is right, ThreadTable::find_thread(java_tid) cannot  return a JavaThread with an unmatched java_tid.
    >      >      >      >
    >      >      >      > Please find a new version of the fix that includes the changes Serguei suggested.
    >      >      >      >
    >      >      >      > Regarding the concern about the maintaining the thread table when it may never even be queried, one of
    >      >      >      > the options could be to add ThreadTable ::isEnabled flag, set it to "false" by default, and wrap the calls to the thread table
    >      >      >      > in ThreadsSMRSupport add_thread() and remove_thread() methods to check this flag.
    >      >      >      >
    >      >      >      > When ThreadsList::find_JavaThread_from_java_tid() is called for the first time it could check if ThreadTable ::isEnabled
    >      >      >      > Is on and if not then set it on and populate the thread table with all existing threads from the thread list.
    >      >      >
    >      >      >      I have the same concerns as David H. about this new ThreadTable.
    >      >      >      ThreadsList::find_JavaThread_from_java_tid() is only called from code
    >      >      >      in src/hotspot/share/services/management.cpp so I think that table
    >      >      >      needs to enabled and populated only if it is going to be used.
    >      >      >
    >      >      >      I've taken a look at the webrev below and I see that David has
    >      >      >      followed up with additional comments. Before I do a crawl through
    >      >      >      code review for this, I would like to see the ThreadTable stuff
    >      >      >      made optional and David's other comments addressed.
    >      >      >
    >      >      >      Another possible optimization is for callers of
    >      >      >      find_JavaThread_from_java_tid() to save the calling thread's
    >      >      >      tid value before they loop and if the current tid == saved_tid
    >      >      >      then use the current JavaThread* instead of calling
    >      >      >      find_JavaThread_from_java_tid() to get the JavaThread*.
    >      >      >
    >      >      >      Dan
    >      >      >
    >      >      >      >
    >      >      >      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.02/
    >      >      >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
    >      >      >      >
    >      >      >      > Thanks!
    >      >      >      > --Daniil
    >      >      >      >
    >      >      >      > From: <serguei.spitsyn at oracle.com>
    >      >      >      > Organization: Oracle Corporation
    >      >      >      > Date: Friday, June 28, 2019 at 7:56 PM
    >      >      >      > To: Daniil Titov <daniil.x.titov at oracle.com>, OpenJDK Serviceability <serviceability-dev at openjdk.java.net>, "hotspot-runtime-dev at openjdk.java.net" <hotspot-runtime-dev at openjdk.java.net>, "jmx-dev at openjdk.java.net" <jmx-dev at openjdk.java.net>
    >      >      >      > Subject: Re: RFR: 8185005: Improve performance of ThreadMXBean.getThreadInfo(long ids[], int maxDepth)
    >      >      >      >
    >      >      >      > Hi Daniil,
    >      >      >      >
    >      >      >      > I have several quick comments.
    >      >      >      >
    >      >      >      > The indent in the hotspot c/c++ files has to be 2, not 4.
    >      >      >      >
    >      >      >      > https://cr.openjdk.java.net/~dtitov/8185005/webrev.01/src/hotspot/share/runtime/threadSMR.cpp.frames.html
    >      >      >      > 614 JavaThread* ThreadsList::find_JavaThread_from_java_tid(jlong java_tid) const {
    >      >      >      >   615     JavaThread* java_thread = ThreadTable::find_thread(java_tid);
    >      >      >      >   616     if (java_thread == NULL && java_tid == PMIMORDIAL_JAVA_TID) {
    >      >      >      >   617         // ThreadsSMRSupport::add_thread() is not called for the primordial
    >      >      >      >   618         // thread. Thus, we find this thread with a linear search and add it
    >      >      >      >   619         // to the thread table.
    >      >      >      >   620         for (uint i = 0; i < length(); i++) {
    >      >      >      >   621             JavaThread* thread = thread_at(i);
    >      >      >      >   622             if (is_valid_java_thread(java_tid,thread)) {
    >      >      >      >   623                 ThreadTable::add_thread(java_tid, thread);
    >      >      >      >   624                 return thread;
    >      >      >      >   625             }
    >      >      >      >   626         }
    >      >      >      >   627     } else if (java_thread != NULL && is_valid_java_thread(java_tid, java_thread)) {
    >      >      >      >   628         return java_thread;
    >      >      >      >   629     }
    >      >      >      >   630     return NULL;
    >      >      >      >   631 }
    >      >      >      >   632 bool ThreadsList::is_valid_java_thread(jlong java_tid, JavaThread* java_thread) {
    >      >      >      >   633     oop tobj = java_thread->threadObj();
    >      >      >      >   634     // Ignore the thread if it hasn't run yet, has exited
    >      >      >      >   635     // or is starting to exit.
    >      >      >      >   636     return (tobj != NULL && !java_thread->is_exiting() &&
    >      >      >      >   637             java_tid == java_lang_Thread::thread_id(tobj));
    >      >      >      >   638 }
    >      >      >      >
    >      >      >      >   615     JavaThread* java_thread = ThreadTable::find_thread(java_tid);
    >      >      >      >
    >      >      >      >    I'd suggest to rename find_thread() to find_thread_by_tid().
    >      >      >      >
    >      >      >      > A space is missed after the comma:
    >      >      >      >    622 if (is_valid_java_thread(java_tid,thread)) {
    >      >      >      >
    >      >      >      > An empty line is needed before L632.
    >      >      >      >
    >      >      >      > The name 'is_valid_java_thread' looks wrong (or confusing) to me.
    >      >      >      > Something like 'is_alive_java_thread_with_tid()' would be better.
    >      >      >      > It'd better to list parameters in the opposite order.
    >      >      >      >
    >      >      >      > The call to is_valid_java_thread() is confusing:
    >      >      >      >     627 } else if (java_thread != NULL && is_valid_java_thread(java_tid, java_thread)) {
    >      >      >      >
    >      >      >      > Why would the call ThreadTable::find_thread(java_tid) return a JavaThread with an unmatched java_tid?
    >      >      >      >
    >      >      >      >
    >      >      >      > Thanks,
    >      >      >      > Serguei
    >      >      >      >
    >      >      >      > On 6/28/19, 9:40 PM, "David Holmes" <david.holmes at oracle.com> wrote:
    >      >      >      >
    >      >      >      >      Hi Daniil,
    >      >      >      >
    >      >      >      >      The definition and use of this hashtable (yet another hashtable
    >      >      >      >      implementation!) will need careful examination. We have to be concerned
    >      >      >      >      about the cost of maintaining it when it may never even be queried. You
    >      >      >      >      would need to look at footprint cost and performance impact.
    >      >      >      >
    >      >      >      >      Unfortunately I'm just about to board a plane and will be out for the
    >      >      >      >      next few days. I will try to look at this asap next week, but we will
    >      >      >      >      need a lot more data on it.
    >      >      >      >
    >      >      >      >      Thanks,
    >      >      >      >      David
    >      >      >      >
    >      >      >      > On 6/28/19 3:31 PM, Daniil Titov wrote:
    >      >      >      > Please review the change that improves performance of ThreadMXBean MXBean methods returning the
    >      >      >      > information for specific threads. The change introduces the thread table that uses ConcurrentHashTable
    >      >      >      > to store one-to-one the mapping between the thread ids and JavaThread objects and replaces the linear
    >      >      >      > search over the thread list in ThreadsList::find_JavaThread_from_java_tid(jlong tid) method with the lookup
    >      >      >      > in the thread table.
    >      >      >      >
    >      >      >      > Testing: Mach5 tier1,tier2 and tier3 tests successfully passed.
    >      >      >      >
    >      >      >      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.01/
    >      >      >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
    >      >      >      >
    >      >      >      > Thanks!
    >      >      >      >
    >      >      >      > Best regards,
    >      >      >      > Daniil
    >      >      >      >
    >      >      >      >
    >      >      >      >
    >      >      >      >
    >      >      >      >
    >      >      >      >
    >      >      >      >
    >      >      >
    >      >      >
    >      >      >
    >      >      >
    >      >
    >      >
    >      >
    >      
    > 
    > 
    

From david.holmes at oracle.com  Mon Aug 12 23:34:54 2019
From: david.holmes at oracle.com (David Holmes)
Date: Tue, 13 Aug 2019 09:34:54 +1000
Subject: jmx-dev 8185005: Improve performance of
 ThreadMXBean.getThreadInfo(long ids[], int maxDepth)
In-Reply-To: <48311B39-43F9-49E5-BEC7-A64F9D2588AF@oracle.com>
References: <4C4212D0-BFFF-4C85-ACC6-05200F220C3F@oracle.com>
 <2d6dede1-aa79-99ce-a823-773fa2e19827@oracle.com>
 <6E7B043A-4647-4931-977C-1854CA7EBEC1@oracle.com>
 <bc9fbb03-6591-16e0-6411-d80cd8f56877@oracle.com>
 <76BCC96D-DB5D-409A-95D5-3A64B893832D@oracle.com>
 <7e0ba39e-e5b7-f56b-66ea-820a0a35ec2c@oracle.com>
 <87748188-3BD4-4A8B-938A-89DBC8F3C57A@oracle.com>
 <acfdf23e-4d6f-aa2b-3711-1da17a7f5f2c@oracle.com>
 <A6311A8C-738A-4928-90C0-A11CB8B9487D@oracle.com>
 <bc83c941-1d3e-8107-12be-4e30b13707b5@oracle.com>
 <48311B39-43F9-49E5-BEC7-A64F9D2588AF@oracle.com>
Message-ID: <81f607e5-4a65-9e14-8ad0-fdcb8e03c6a4@oracle.com>

Hi Daniil,

On 13/08/2019 9:24 am, Daniil Titov wrote:
> Hi Robbin,
> 
> Thank you very much for reviewing this version of the fix!  Based on your findings
> it seems as it makes sense to make a step back and  continue with the
> approach we took before in the previous version of the webrev (webrev.04),
> and get more information about the impact on the startup time it has. I will
> consult with Claus regarding this and then share the findings.

That seems a good approach to me. It wasn't at all clear to me that the 
latest proposed approach would actually solve the original problem in a 
satisfactory way - it would depend on how constant the set of threads 
being queried was.

There is no perfect solution here as any fix to the reported problem 
incurs overhead elsewhere. Even evaluating the merits of the different 
trade-offs is hard to do - we could end up with a compromise solution 
that fails to satisfy anyone.

David
-----

> Thanks again,
> --Daniil
> 
> 
> 
> 
> 
> 
> ?On 8/12/19, 5:22 AM, "Robbin Ehn" <robbin.ehn at oracle.com> wrote:
> 
>      Hi Daniil,
>      
>      I took a new deeper dive into this.
>      
>      This line seems to have some issues:
>      
>      if (ThreadTable::is_initialized() && thread->in_thread_table() &&
>      !thread->is_attaching_via_jni()) {
>      
>      If you create new threads which attaches and then dies, the table will just keep
>      growing. So you must remove them also ?
>      
>      Secondly you should not use volatile semantics for _in_thread_table.
>      The load in the if-statement can be reordered with _is_initialized.
>      Which could lead to a leak, rogue pointer in the table.
>      
>      So both "static volatile bool _is_initialized;" and "volatile bool
>      _in_thread_table; "
>      should be stored with store_release and loaded with load_acquire.
>      
>      Unfortunately it looks like there still would be races if
>      ThreadTable::add_thread e.g. context switch at:
>      
>      if (_local_table->insert(thread, lookup, entry)) {
>      // HERE
>         java_thread->set_in_thread_table(true);
>      
>      *Remove side can pass the if-statement without removing.
>      
>      Since this thread also maybe exiting at any moment, e.g. context switch:
>      
>             if (tobj != NULL && !thread->is_exiting() &&
>                 java_tid == java_lang_Thread::thread_id(tobj)) {
>      	// HERE
>               ThreadTable::add_thread(java_tid, thread);
>      
>      *Add side can add a thread that is exiting.
>      
>      Mixing in a third thread looking up a random tid and getting a JavaThread*, it
>      must validate it against it's ThreadsList. Making the hashtable useless.
>      
>      So I think the only one adding and removing should be the thread itself.
>      1:Add to ThreadsList
>      2:Add to ThreadTable
>      3:Remove from ThreadTable
>      4:Remove ThreadsList
>      
>      Between 1-2 and 3-4 the thread would be looked-up via linear scan.
>      I don't see an easy way around the start-up issue with this.
>      
>      Maybe have the cache in Java.
>      Pass in the thread obj into a
>      java_sun_management_ThreadImpl_getThreadTotalCpuTime3 instead,
>      thus skipping any look-ups in native.
>      
>      Thanks, Robbin
>      
>      
>      On 8/12/19 5:49 AM, Daniil Titov wrote:
>      > Hi David, Robbin, Daniel, and Serguei,
>      >
>      > Please review a new version of the fix.
>      >
>      > As David suggested I created a separated Jira issue [1] to cover  additional optimization for
>      > some callers of find_JavaThread_from_java_tid() and this version of the fix no longer includes
>      > changes in management.cpp ( and the test related with these changes).
>      >
>      > Regarding the impact the previous version of the fix had on the thread startup time at heavy load (e.g.
>      > when 5000 threads are created and destroyed every second) I tried a different approach that makes
>      > calls to ThreadTable::add_thread  and ThreadTable::remove_thread  asynchronous and offloads the
>      > work for actual modifications of the thread table to a periodic task that runs every 5 seconds. With the
>      > same  stress test scenario (the  test does some warm-up and then measures the time it takes to create
>      > and start 100,000 threads; every  thread just sleeps  for 100 ms) the impact on the thread startup time
>      > was reduced to 1.2% ( from 2.7%).
>      >
>      > The cause of this impact in this stress test scenario is that as soon as the thread table is initialized,
>      > an additional work to insert  and delete entries in the thread table should be performed, even if
>      > com.sun.management.ThreadMXBean methods are no longer called. For example, In the stress test
>      > mentioned above, every second about 5000 entries had to be inserted in the table and then deleted.
>      >
>      > That doesn't look right and the new version of the fix uses the different approach: the thread is added to
>      > the thread table only when this thread is requested by com.sun.management.ThreadMXBean bean. Every
>      > time when find_JavaThread_from_java_tid() is called for a new tid, the thread  is found by the iterating over
>      > the thread list and added to the thread table. All consequent calls to find_JavaThread_from_java_tid() for
>      > the same tid returns the thread from the thread table.
>      >
>      > Running stress test for the cases when the thread table is enabled and not showed no difference in the
>      > average thread startup times.
>      >
>      > [1] : https://bugs.openjdk.java.net/browse/JDK-8229391
>      >
>      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
>      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.05/
>      >
>      > Thanks,
>      > Daniil
>      >
>      > ?On 8/4/19, 7:54 PM, "David Holmes" <david.holmes at oracle.com> wrote:
>      >
>      >      Hi Daniil,
>      >
>      >      On 3/08/2019 8:16 am, Daniil Titov wrote:
>      >      > Hi David,
>      >      >
>      >      > Thank you for your detailed review. Please review a new version of the fix that includes
>      >      > the changes you suggested:
>      >      > - ThreadTableCreate_lock scope is reduced to cover the creation of the table only;
>      >      > - ThreadTableCreate_lock is made _safepoint_check_always;
>      >
>      >      Okay.
>      >
>      >      > - ServiceThread is no longer responsible for the resizing of the thread table, instead,
>      >      >    the thread table is changed to grow on demand by the thread that is doing the addition;
>      >
>      >      Okay - I'm happy to get the serviceThread out of the picture here.
>      >
>      >      > - fixed nits and formatting issues.
>      >
>      >      Okay.
>      >
>      >      >>> The change also includes additional optimization for some callers of find_JavaThread_from_java_tid()
>      >      >>>   as Daniel suggested.
>      >      >> Not sure it's best to combine these, but if they are limited to the
>      >      >> changes in management.cpp only then that may be okay.
>      >      >
>      >      > The additional optimization for some callers of find_JavaThread_from_java_tid() is
>      >      > limited to management.cpp (plus a new test) so I left them in the webrev  but
>      >      > I also could move it in the separate issue if required.
>      >
>      >      I'd prefer this part of be separated out, but won't insist. Let's see if
>      >      Dan or Serguei have a strong opinion.
>      >
>      >      >    > src/hotspot/share/runtime/threadSMR.cpp
>      >      >    >755     jlong tid = SharedRuntime::get_java_tid(thread);
>      >      >    > 926     jlong tid = SharedRuntime::get_java_tid(thread);
>      >      >   >  I think it cleaner/better to just use
>      >      >   > jlong tid = java_lang_Thread::thread_id(thread->threadObj());
>      >      >   > as we know thread is not NULL, it is a JavaThread and it has to have a
>      >      >   > non-null threadObj.
>      >      >
>      >      > I had to leave this code unchanged since it turned out the threadObj is null
>      >      > when VM is destroyed:
>      >      >
>      >      > V  [libjvm.so+0xe165d7]  oopDesc::long_field(int) const+0x67
>      >      > V  [libjvm.so+0x16e06c6]  ThreadsSMRSupport::add_thread(JavaThread*)+0x116
>      >      > V  [libjvm.so+0x16d1302]  Threads::add(JavaThread*, bool)+0x82
>      >      > V  [libjvm.so+0xef8369]  attach_current_thread.part.197+0xc9
>      >      > V  [libjvm.so+0xec136c]  jni_DestroyJavaVM+0x6c
>      >      > C  [libjli.so+0x4333]  JavaMain+0x2c3
>      >      > C  [libjli.so+0x8159]  ThreadJavaMain+0x9
>      >
>      >      This is actually nothing to do with the VM being destroyed, but is an
>      >      issue with JNI_AttachCurrentThread and its interaction with the
>      >      ThreadSMR iterators. The attach process is:
>      >      - create JavaThread
>      >      - mark as "is attaching via jni"
>      >      - add to ThreadsList
>      >      - create java.lang.Thread object (you can only execute Java code after
>      >      you are attached)
>      >      - mark as "attach completed"
>      >
>      >      So while a thread "is attaching" it will be seen by the ThreadSMR thread
>      >      iterator but will have a NULL java.lang.Thread object.
>      >
>      >      We special-case attaching threads in a number of places in the VM and I
>      >      think we should be explicitly doing something here to filter out
>      >      attaching threads, rather than just being tolerant of a NULL j.l.Thread
>      >      object. Specifically in ThreadsSMRSupport::add_thread:
>      >
>      >      if (ThreadTable::is_initialized() && !thread->is_attaching_via_jni()) {
>      >         jlong tid = java_lang_Thread::thread_id(thread->threadObj());
>      >         ThreadTable::add_thread(tid, thread);
>      >      }
>      >
>      >      Note that in ThreadsSMRSupport::remove_thread we can use the same guard,
>      >      which covers the case the JNI attach encountered an error trying to
>      >      create the j.l.Thread object.
>      >
>      >      >> src/hotspot/share/services/threadTable.cpp
>      >      >> 71     static uintx get_hash(Value const& value, bool* is_dead) {
>      >      >
>      >      >> The is_dead parameter still bothers me here. I can't make enough sense
>      >      >> out of the template code in ConcurrentHashtable to see why we have to
>      >      >> have it, but I'm concerned that its very existence means we perhaps
>      >      >> should not be trying to extend CHT in this context. ??
>      >      >
>      >      > My understanding is that is_dead parameter provides a mechanism for
>      >      > ConcurrentHashtable to remove stale entries that were not explicitly
>      >      > removed by calling  ConcurrentHashTable::remove() method.
>      >      > I think that just because in our case we don't use this mechanism doesn't
>      >      > mean we should not use ConcurrentHashTable.
>      >
>      >      Can you confirm that this usage is okay with Robbin Ehn please. He's
>      >      back from vacation this week.
>      >
>      >      >> I would still want to see what impact this has on thread
>      >      >> startup cost, both with and without the table being initialized.
>      >      >
>      >      > I run a test that initializes the table by calling ThreadMXBean.get getThreadInfo(),
>      >      > starts some threads as a worm-up, and then creates and starts 100,000 threads
>      >      > (each thread just sleeps for 100 ms). In case when the thread table is enabled
>      >      > 100,000 threads are created and started  for about 15200 ms. If the thread table
>      >      > is off the test takes about 14800 ms. Based on this information the enabled
>      >      > thread table makes the thread startup about 2.7% slower.
>      >
>      >      That doesn't sound very good. I think we may need to Claes involved to
>      >      help investigate overall performance impact here.
>      >
>      >      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.04/
>      >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
>      >
>      >      No further code comments.
>      >
>      >      I didn't look at the test in detail.
>      >
>      >      Thanks,
>      >      David
>      >
>      >      > Thanks!
>      >      > --Daniil
>      >      >
>      >      >
>      >      > ?On 7/29/19, 12:53 AM, "David Holmes" <david.holmes at oracle.com> wrote:
>      >      >
>      >      >      Hi Daniil,
>      >      >
>      >      >      Overall I think this is a reasonable approach but I would still like to
>      >      >      see some performance and footprint numbers, both to verify it fixes the
>      >      >      problem reported, and that we are not getting penalized elsewhere.
>      >      >
>      >      >      On 25/07/2019 3:21 am, Daniil Titov wrote:
>      >      >      > Hi David, Daniel, and Serguei,
>      >      >      >
>      >      >      > Please review the new version of the fix, that makes the thread table initialization on demand and
>      >      >      > moves it inside ThreadsList::find_JavaThread_from_java_tid(). At the creation time the thread table
>      >      >      >   is initialized with the threads from the current thread list. We don't want to hold Threads_lock
>      >      >      > inside find_JavaThread_from_java_tid(),  thus new threads still could be created  while the thread
>      >      >      > table is being initialized . Such threads will be found by the linear search and added to the thread table
>      >      >      > later, in ThreadsList::find_JavaThread_from_java_tid().
>      >      >
>      >      >      The initialization allows the created but unpopulated, or partially
>      >      >      populated, table to be seen by other threads - is that your intention?
>      >      >      It seems it should be okay as the other threads will then race with the
>      >      >      initializing thread to add specific entries, and this is a concurrent
>      >      >      map so that should be functionally correct. But if so then I think you
>      >      >      can also reduce the scope of the ThreadTableCreate_lock so that it
>      >      >      covers creation of the table only, not the initial population of the table.
>      >      >
>      >      >      I like the approach of only initializing the table when needed and using
>      >      >      that to control when the add/remove-thread code needs to update the
>      >      >      table. But I would still want to see what impact this has on thread
>      >      >      startup cost, both with and without the table being initialized.
>      >      >
>      >      >      > The change also includes additional optimization for some callers of find_JavaThread_from_java_tid()
>      >      >      > as Daniel suggested.
>      >      >
>      >      >      Not sure it's best to combine these, but if they are limited to the
>      >      >      changes in management.cpp only then that may be okay. It helps to be
>      >      >      able to focus on the table related changes without being distracted by
>      >      >      other optimizations.
>      >      >
>      >      >      > That is correct that ResolvedMethodTable was used as a blueprint for the thread table, however, I tried
>      >      >      > to strip it of the all functionality that is not required in the thread table case.
>      >      >
>      >      >      The revised version seems better in that regard. But I still have a
>      >      >      concern, see below.
>      >      >
>      >      >      > We need to have the thread table resizable and allow it to grow as the number of threads increases to avoid
>      >      >      > reserving excessive memory a-priori or deteriorating lookup times. The ServiceThread is responsible for
>      >      >      > growing the thread table when required.
>      >      >
>      >      >      Yes but why? Why can't this table be grown on demand by the thread that
>      >      >      is doing the addition? For other tables we may have to delegate to the
>      >      >      service thread because the current thread cannot perform the action, or
>      >      >      it doesn't want to perform it at the time the need for the resize is
>      >      >      detected (e.g. its detected at a safepoint and you want the resize to
>      >      >      happen later outside the safepoint). It's not apparent to me that such
>      >      >      restrictions apply here.
>      >      >
>      >      >      > There is no ConcurrentHashTable available in Java 8 and for backporting this fix to Java 8 another implementation
>      >      >      > of the hash table, probably originally suggested in the patch attached to the JBS issue, should be used.  It will make
>      >      >      > the backporting more complicated,  however, adding a new Implementation of the hash table in Java 14 while it
>      >      >      > already has ConcurrentHashTable doesn't seem  reasonable for me.
>      >      >
>      >      >      Ok.
>      >      >
>      >      >      > Webrev: http://cr.openjdk.java.net/~dtitov/8185005/webrev.03
>      >      >
>      >      >      Some specific code comments:
>      >      >
>      >      >      src/hotspot/share/runtime/mutexLocker.cpp
>      >      >
>      >      >      +   def(ThreadTableCreate_lock       , PaddedMutex  , special,
>      >      >      false, Monitor::_safepoint_check_never);
>      >      >
>      >      >      I think this needs to be a _safepoint_check_always lock. The table will
>      >      >      be created by regular JavaThreads and they should (nearly) always be
>      >      >      checking for safepoints if they are going to block acquiring the lock.
>      >      >      And it isn't at all obvious that the thread doing the creation can't go
>      >      >      to a safepoint whilst this lock is held.
>      >      >
>      >      >      ---
>      >      >
>      >      >      src/hotspot/share/runtime/threadSMR.cpp
>      >      >
>      >      >      Nit:
>      >      >
>      >      >        618       JavaThread* thread = thread_at(i);
>      >      >
>      >      >      you could reuse the new java_thread local you introduced at line 613 and
>      >      >      just rename that "new" variable to "thread" so you don't have to change
>      >      >      all other uses.
>      >      >
>      >      >      628   } else if (java_thread != NULL && ...
>      >      >
>      >      >      You don't need to check != NULL here as you only get here when
>      >      >      java_thread is not NULL.
>      >      >
>      >      >        755     jlong tid = SharedRuntime::get_java_tid(thread);
>      >      >        926     jlong tid = SharedRuntime::get_java_tid(thread);
>      >      >
>      >      >      I think it cleaner/better to just use
>      >      >
>      >      >      jlong tid = java_lang_Thread::thread_id(thread->threadObj());
>      >      >
>      >      >      as we know thread is not NULL, it is a JavaThread and it has to have a
>      >      >      non-null threadObj.
>      >      >
>      >      >      ---
>      >      >
>      >      >      src/hotspot/share/services/management.cpp
>      >      >
>      >      >      1323         if (THREAD->is_Java_thread()) {
>      >      >      1324           JavaThread* current_thread = (JavaThread*)THREAD;
>      >      >
>      >      >      These calls can only be made on a JavaThread so this be simplified to
>      >      >      remove the is_Java_thread() call. Similarly in other places.
>      >      >
>      >      >      ---
>      >      >
>      >      >      src/hotspot/share/services/threadTable.cpp
>      >      >
>      >      >         55 class ThreadTableEntry : public CHeapObj<mtInternal> {
>      >      >         56   private:
>      >      >         57     jlong _tid;
>      >      >
>      >      >      I believe hotspot style is to not indent the access modifiers in C++
>      >      >      class declarations, so the above would just be:
>      >      >
>      >      >         55 class ThreadTableEntry : public CHeapObj<mtInternal> {
>      >      >         56 private:
>      >      >         57   jlong _tid;
>      >      >
>      >      >      etc.
>      >      >
>      >      >        60     ThreadTableEntry(jlong tid, JavaThread* java_thread) :
>      >      >        61     _tid(tid),_java_thread(java_thread) {}
>      >      >
>      >      >      line 61 should be indented as it continues line 60.
>      >      >
>      >      >         67 class ThreadTableConfig : public AllStatic {
>      >      >         ...
>      >      >         71     static uintx get_hash(Value const& value, bool* is_dead) {
>      >      >
>      >      >      The is_dead parameter still bothers me here. I can't make enough sense
>      >      >      out of the template code in ConcurrentHashtable to see why we have to
>      >      >      have it, but I'm concerned that its very existence means we perhaps
>      >      >      should not be trying to extend CHT in this context. ??
>      >      >
>      >      >        115   size_t start_size_log = size_log > DefaultThreadTableSizeLog
>      >      >        116   ? size_log : DefaultThreadTableSizeLog;
>      >      >
>      >      >      line 116 should be indented, though in this case I think a better layout
>      >      >      would be:
>      >      >
>      >      >        115   size_t start_size_log =
>      >      >        116       size_log > DefaultThreadTableSizeLog ? size_log :
>      >      >      DefaultThreadTableSizeLog;
>      >      >
>      >      >        131 double ThreadTable::get_load_factor() {
>      >      >        132   return (double)_items_count/_current_size;
>      >      >        133 }
>      >      >
>      >      >      Not sure that is doing what you want/expect. It will perform integer
>      >      >      division and then cast that whole integer to a double. If you want
>      >      >      double arithmetic you need:
>      >      >
>      >      >      return ((double)_items_count)/_current_size;
>      >      >
>      >      >      180     jlong          _tid;
>      >      >      181     uintx         _hash;
>      >      >
>      >      >      Nit: no need for all those spaces before the variable name.
>      >      >
>      >      >        183     ThreadTableLookup(jlong tid)
>      >      >        184     : _tid(tid), _hash(primitive_hash(tid)) {}
>      >      >
>      >      >      line 184 should be indented.
>      >      >
>      >      >      201     ThreadGet():_return(NULL) {}
>      >      >
>      >      >      Nit: need space after :
>      >      >
>      >      >        211    assert(_is_initialized, "Thread table is not initialized");
>      >      >        212   _has_work = false;
>      >      >
>      >      >      line 211 is indented one space too far.
>      >      >
>      >      >      229     ThreadTableEntry* entry = new ThreadTableEntry(tid,java_thread);
>      >      >
>      >      >      Nit: need space after ,
>      >      >
>      >      >      252   return _local_table->remove(thread,lookup);
>      >      >
>      >      >      Nit: need space after ,
>      >      >
>      >      >      Thanks,
>      >      >      David
>      >      >      ------
>      >      >
>      >      >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
>      >      >      >
>      >      >      > Thanks!
>      >      >      > --Daniil
>      >      >      >
>      >      >      >
>      >      >      > ?On 7/8/19, 3:24 PM, "Daniel D. Daugherty" <daniel.daugherty at oracle.com> wrote:
>      >      >      >
>      >      >      >      On 6/29/19 12:06 PM, Daniil Titov wrote:
>      >      >      >      > Hi Serguei and David,
>      >      >      >      >
>      >      >      >      > Serguei is right, ThreadTable::find_thread(java_tid) cannot  return a JavaThread with an unmatched java_tid.
>      >      >      >      >
>      >      >      >      > Please find a new version of the fix that includes the changes Serguei suggested.
>      >      >      >      >
>      >      >      >      > Regarding the concern about the maintaining the thread table when it may never even be queried, one of
>      >      >      >      > the options could be to add ThreadTable ::isEnabled flag, set it to "false" by default, and wrap the calls to the thread table
>      >      >      >      > in ThreadsSMRSupport add_thread() and remove_thread() methods to check this flag.
>      >      >      >      >
>      >      >      >      > When ThreadsList::find_JavaThread_from_java_tid() is called for the first time it could check if ThreadTable ::isEnabled
>      >      >      >      > Is on and if not then set it on and populate the thread table with all existing threads from the thread list.
>      >      >      >
>      >      >      >      I have the same concerns as David H. about this new ThreadTable.
>      >      >      >      ThreadsList::find_JavaThread_from_java_tid() is only called from code
>      >      >      >      in src/hotspot/share/services/management.cpp so I think that table
>      >      >      >      needs to enabled and populated only if it is going to be used.
>      >      >      >
>      >      >      >      I've taken a look at the webrev below and I see that David has
>      >      >      >      followed up with additional comments. Before I do a crawl through
>      >      >      >      code review for this, I would like to see the ThreadTable stuff
>      >      >      >      made optional and David's other comments addressed.
>      >      >      >
>      >      >      >      Another possible optimization is for callers of
>      >      >      >      find_JavaThread_from_java_tid() to save the calling thread's
>      >      >      >      tid value before they loop and if the current tid == saved_tid
>      >      >      >      then use the current JavaThread* instead of calling
>      >      >      >      find_JavaThread_from_java_tid() to get the JavaThread*.
>      >      >      >
>      >      >      >      Dan
>      >      >      >
>      >      >      >      >
>      >      >      >      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.02/
>      >      >      >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
>      >      >      >      >
>      >      >      >      > Thanks!
>      >      >      >      > --Daniil
>      >      >      >      >
>      >      >      >      > From: <serguei.spitsyn at oracle.com>
>      >      >      >      > Organization: Oracle Corporation
>      >      >      >      > Date: Friday, June 28, 2019 at 7:56 PM
>      >      >      >      > To: Daniil Titov <daniil.x.titov at oracle.com>, OpenJDK Serviceability <serviceability-dev at openjdk.java.net>, "hotspot-runtime-dev at openjdk.java.net" <hotspot-runtime-dev at openjdk.java.net>, "jmx-dev at openjdk.java.net" <jmx-dev at openjdk.java.net>
>      >      >      >      > Subject: Re: RFR: 8185005: Improve performance of ThreadMXBean.getThreadInfo(long ids[], int maxDepth)
>      >      >      >      >
>      >      >      >      > Hi Daniil,
>      >      >      >      >
>      >      >      >      > I have several quick comments.
>      >      >      >      >
>      >      >      >      > The indent in the hotspot c/c++ files has to be 2, not 4.
>      >      >      >      >
>      >      >      >      > https://cr.openjdk.java.net/~dtitov/8185005/webrev.01/src/hotspot/share/runtime/threadSMR.cpp.frames.html
>      >      >      >      > 614 JavaThread* ThreadsList::find_JavaThread_from_java_tid(jlong java_tid) const {
>      >      >      >      >   615     JavaThread* java_thread = ThreadTable::find_thread(java_tid);
>      >      >      >      >   616     if (java_thread == NULL && java_tid == PMIMORDIAL_JAVA_TID) {
>      >      >      >      >   617         // ThreadsSMRSupport::add_thread() is not called for the primordial
>      >      >      >      >   618         // thread. Thus, we find this thread with a linear search and add it
>      >      >      >      >   619         // to the thread table.
>      >      >      >      >   620         for (uint i = 0; i < length(); i++) {
>      >      >      >      >   621             JavaThread* thread = thread_at(i);
>      >      >      >      >   622             if (is_valid_java_thread(java_tid,thread)) {
>      >      >      >      >   623                 ThreadTable::add_thread(java_tid, thread);
>      >      >      >      >   624                 return thread;
>      >      >      >      >   625             }
>      >      >      >      >   626         }
>      >      >      >      >   627     } else if (java_thread != NULL && is_valid_java_thread(java_tid, java_thread)) {
>      >      >      >      >   628         return java_thread;
>      >      >      >      >   629     }
>      >      >      >      >   630     return NULL;
>      >      >      >      >   631 }
>      >      >      >      >   632 bool ThreadsList::is_valid_java_thread(jlong java_tid, JavaThread* java_thread) {
>      >      >      >      >   633     oop tobj = java_thread->threadObj();
>      >      >      >      >   634     // Ignore the thread if it hasn't run yet, has exited
>      >      >      >      >   635     // or is starting to exit.
>      >      >      >      >   636     return (tobj != NULL && !java_thread->is_exiting() &&
>      >      >      >      >   637             java_tid == java_lang_Thread::thread_id(tobj));
>      >      >      >      >   638 }
>      >      >      >      >
>      >      >      >      >   615     JavaThread* java_thread = ThreadTable::find_thread(java_tid);
>      >      >      >      >
>      >      >      >      >    I'd suggest to rename find_thread() to find_thread_by_tid().
>      >      >      >      >
>      >      >      >      > A space is missed after the comma:
>      >      >      >      >    622 if (is_valid_java_thread(java_tid,thread)) {
>      >      >      >      >
>      >      >      >      > An empty line is needed before L632.
>      >      >      >      >
>      >      >      >      > The name 'is_valid_java_thread' looks wrong (or confusing) to me.
>      >      >      >      > Something like 'is_alive_java_thread_with_tid()' would be better.
>      >      >      >      > It'd better to list parameters in the opposite order.
>      >      >      >      >
>      >      >      >      > The call to is_valid_java_thread() is confusing:
>      >      >      >      >     627 } else if (java_thread != NULL && is_valid_java_thread(java_tid, java_thread)) {
>      >      >      >      >
>      >      >      >      > Why would the call ThreadTable::find_thread(java_tid) return a JavaThread with an unmatched java_tid?
>      >      >      >      >
>      >      >      >      >
>      >      >      >      > Thanks,
>      >      >      >      > Serguei
>      >      >      >      >
>      >      >      >      > On 6/28/19, 9:40 PM, "David Holmes" <david.holmes at oracle.com> wrote:
>      >      >      >      >
>      >      >      >      >      Hi Daniil,
>      >      >      >      >
>      >      >      >      >      The definition and use of this hashtable (yet another hashtable
>      >      >      >      >      implementation!) will need careful examination. We have to be concerned
>      >      >      >      >      about the cost of maintaining it when it may never even be queried. You
>      >      >      >      >      would need to look at footprint cost and performance impact.
>      >      >      >      >
>      >      >      >      >      Unfortunately I'm just about to board a plane and will be out for the
>      >      >      >      >      next few days. I will try to look at this asap next week, but we will
>      >      >      >      >      need a lot more data on it.
>      >      >      >      >
>      >      >      >      >      Thanks,
>      >      >      >      >      David
>      >      >      >      >
>      >      >      >      > On 6/28/19 3:31 PM, Daniil Titov wrote:
>      >      >      >      > Please review the change that improves performance of ThreadMXBean MXBean methods returning the
>      >      >      >      > information for specific threads. The change introduces the thread table that uses ConcurrentHashTable
>      >      >      >      > to store one-to-one the mapping between the thread ids and JavaThread objects and replaces the linear
>      >      >      >      > search over the thread list in ThreadsList::find_JavaThread_from_java_tid(jlong tid) method with the lookup
>      >      >      >      > in the thread table.
>      >      >      >      >
>      >      >      >      > Testing: Mach5 tier1,tier2 and tier3 tests successfully passed.
>      >      >      >      >
>      >      >      >      > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.01/
>      >      >      >      > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
>      >      >      >      >
>      >      >      >      > Thanks!
>      >      >      >      >
>      >      >      >      > Best regards,
>      >      >      >      > Daniil
>      >      >      >      >
>      >      >      >      >
>      >      >      >      >
>      >      >      >      >
>      >      >      >      >
>      >      >      >      >
>      >      >      >      >
>      >      >      >
>      >      >      >
>      >      >      >
>      >      >      >
>      >      >
>      >      >
>      >      >
>      >
>      >
>      >
>      
> 
> 

From daniel.daugherty at oracle.com  Wed Aug 14 18:25:07 2019
From: daniel.daugherty at oracle.com (Daniel D. Daugherty)
Date: Wed, 14 Aug 2019 14:25:07 -0400
Subject: jmx-dev 8185005: Improve performance of
 ThreadMXBean.getThreadInfo(long ids[], int maxDepth)
In-Reply-To: <81f607e5-4a65-9e14-8ad0-fdcb8e03c6a4@oracle.com>
References: <4C4212D0-BFFF-4C85-ACC6-05200F220C3F@oracle.com>
 <2d6dede1-aa79-99ce-a823-773fa2e19827@oracle.com>
 <6E7B043A-4647-4931-977C-1854CA7EBEC1@oracle.com>
 <bc9fbb03-6591-16e0-6411-d80cd8f56877@oracle.com>
 <76BCC96D-DB5D-409A-95D5-3A64B893832D@oracle.com>
 <7e0ba39e-e5b7-f56b-66ea-820a0a35ec2c@oracle.com>
 <87748188-3BD4-4A8B-938A-89DBC8F3C57A@oracle.com>
 <acfdf23e-4d6f-aa2b-3711-1da17a7f5f2c@oracle.com>
 <A6311A8C-738A-4928-90C0-A11CB8B9487D@oracle.com>
 <bc83c941-1d3e-8107-12be-4e30b13707b5@oracle.com>
 <48311B39-43F9-49E5-BEC7-A64F9D2588AF@oracle.com>
 <81f607e5-4a65-9e14-8ad0-fdcb8e03c6a4@oracle.com>
Message-ID: <2bf948a5-1397-3459-030a-be9598b4e7fc@oracle.com>

So based on the last three message on this thread:

webrev.05 is withdrawn for the moment
webrev.04 is the current webrev, but needs to have some startup time
 ?? ? ? ?? issues resolved before moving forward.

I'm going to hold off on re-reviewing anything at the moment until
the dust settles...

Dan


On 8/12/19 7:34 PM, David Holmes wrote:
> Hi Daniil,
>
> On 13/08/2019 9:24 am, Daniil Titov wrote:
>> Hi Robbin,
>>
>> Thank you very much for reviewing this version of the fix! Based on 
>> your findings
>> it seems as it makes sense to make a step back and? continue with the
>> approach we took before in the previous version of the webrev 
>> (webrev.04),
>> and get more information about the impact on the startup time it has. 
>> I will
>> consult with Claus regarding this and then share the findings.
>
> That seems a good approach to me. It wasn't at all clear to me that 
> the latest proposed approach would actually solve the original problem 
> in a satisfactory way - it would depend on how constant the set of 
> threads being queried was.
>
> There is no perfect solution here as any fix to the reported problem 
> incurs overhead elsewhere. Even evaluating the merits of the different 
> trade-offs is hard to do - we could end up with a compromise solution 
> that fails to satisfy anyone.
>
> David
> -----
>
>> Thanks again,
>> --Daniil
>>
>>
>>
>>
>>
>>
>> ?On 8/12/19, 5:22 AM, "Robbin Ehn" <robbin.ehn at oracle.com> wrote:
>>
>> ???? Hi Daniil,
>> ???? ???? I took a new deeper dive into this.
>> ???? ???? This line seems to have some issues:
>> ???? ???? if (ThreadTable::is_initialized() && 
>> thread->in_thread_table() &&
>> ???? !thread->is_attaching_via_jni()) {
>> ???? ???? If you create new threads which attaches and then dies, the 
>> table will just keep
>> ???? growing. So you must remove them also ?
>> ???? ???? Secondly you should not use volatile semantics for 
>> _in_thread_table.
>> ???? The load in the if-statement can be reordered with _is_initialized.
>> ???? Which could lead to a leak, rogue pointer in the table.
>> ???? ???? So both "static volatile bool _is_initialized;" and 
>> "volatile bool
>> ???? _in_thread_table; "
>> ???? should be stored with store_release and loaded with load_acquire.
>> ???? ???? Unfortunately it looks like there still would be races if
>> ???? ThreadTable::add_thread e.g. context switch at:
>> ???? ???? if (_local_table->insert(thread, lookup, entry)) {
>> ???? // HERE
>> ??????? java_thread->set_in_thread_table(true);
>> ???? ???? *Remove side can pass the if-statement without removing.
>> ???? ???? Since this thread also maybe exiting at any moment, e.g. 
>> context switch:
>> ???? ??????????? if (tobj != NULL && !thread->is_exiting() &&
>> ??????????????? java_tid == java_lang_Thread::thread_id(tobj)) {
>> ???????? // HERE
>> ????????????? ThreadTable::add_thread(java_tid, thread);
>> ???? ???? *Add side can add a thread that is exiting.
>> ???? ???? Mixing in a third thread looking up a random tid and 
>> getting a JavaThread*, it
>> ???? must validate it against it's ThreadsList. Making the hashtable 
>> useless.
>> ???? ???? So I think the only one adding and removing should be the 
>> thread itself.
>> ???? 1:Add to ThreadsList
>> ???? 2:Add to ThreadTable
>> ???? 3:Remove from ThreadTable
>> ???? 4:Remove ThreadsList
>> ???? ???? Between 1-2 and 3-4 the thread would be looked-up via 
>> linear scan.
>> ???? I don't see an easy way around the start-up issue with this.
>> ???? ???? Maybe have the cache in Java.
>> ???? Pass in the thread obj into a
>> ???? java_sun_management_ThreadImpl_getThreadTotalCpuTime3 instead,
>> ???? thus skipping any look-ups in native.
>> ???? ???? Thanks, Robbin
>> ???? ???? ???? On 8/12/19 5:49 AM, Daniil Titov wrote:
>> ???? > Hi David, Robbin, Daniel, and Serguei,
>> ???? >
>> ???? > Please review a new version of the fix.
>> ???? >
>> ???? > As David suggested I created a separated Jira issue [1] to 
>> cover? additional optimization for
>> ???? > some callers of find_JavaThread_from_java_tid() and this 
>> version of the fix no longer includes
>> ???? > changes in management.cpp ( and the test related with these 
>> changes).
>> ???? >
>> ???? > Regarding the impact the previous version of the fix had on 
>> the thread startup time at heavy load (e.g.
>> ???? > when 5000 threads are created and destroyed every second) I 
>> tried a different approach that makes
>> ???? > calls to ThreadTable::add_thread? and 
>> ThreadTable::remove_thread? asynchronous and offloads the
>> ???? > work for actual modifications of the thread table to a 
>> periodic task that runs every 5 seconds. With the
>> ???? > same? stress test scenario (the? test does some warm-up and 
>> then measures the time it takes to create
>> ???? > and start 100,000 threads; every? thread just sleeps for 100 
>> ms) the impact on the thread startup time
>> ???? > was reduced to 1.2% ( from 2.7%).
>> ???? >
>> ???? > The cause of this impact in this stress test scenario is that 
>> as soon as the thread table is initialized,
>> ???? > an additional work to insert? and delete entries in the thread 
>> table should be performed, even if
>> ???? > com.sun.management.ThreadMXBean methods are no longer called. 
>> For example, In the stress test
>> ???? > mentioned above, every second about 5000 entries had to be 
>> inserted in the table and then deleted.
>> ???? >
>> ???? > That doesn't look right and the new version of the fix uses 
>> the different approach: the thread is added to
>> ???? > the thread table only when this thread is requested by 
>> com.sun.management.ThreadMXBean bean. Every
>> ???? > time when find_JavaThread_from_java_tid() is called for a new 
>> tid, the thread? is found by the iterating over
>> ???? > the thread list and added to the thread table. All consequent 
>> calls to find_JavaThread_from_java_tid() for
>> ???? > the same tid returns the thread from the thread table.
>> ???? >
>> ???? > Running stress test for the cases when the thread table is 
>> enabled and not showed no difference in the
>> ???? > average thread startup times.
>> ???? >
>> ???? > [1] : https://bugs.openjdk.java.net/browse/JDK-8229391
>> ???? >
>> ???? > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
>> ???? > Webrev: https://cr.openjdk.java.net/~dtitov/8185005/webrev.05/
>> ???? >
>> ???? > Thanks,
>> ???? > Daniil
>> ???? >
>> ???? > ?On 8/4/19, 7:54 PM, "David Holmes" <david.holmes at oracle.com> 
>> wrote:
>> ???? >
>> ???? >????? Hi Daniil,
>> ???? >
>> ???? >????? On 3/08/2019 8:16 am, Daniil Titov wrote:
>> ???? >????? > Hi David,
>> ???? >????? >
>> ???? >????? > Thank you for your detailed review. Please review a new 
>> version of the fix that includes
>> ???? >????? > the changes you suggested:
>> ???? >????? > - ThreadTableCreate_lock scope is reduced to cover the 
>> creation of the table only;
>> ???? >????? > - ThreadTableCreate_lock is made _safepoint_check_always;
>> ???? >
>> ???? >????? Okay.
>> ???? >
>> ???? >????? > - ServiceThread is no longer responsible for the 
>> resizing of the thread table, instead,
>> ???? >????? >??? the thread table is changed to grow on demand by the 
>> thread that is doing the addition;
>> ???? >
>> ???? >????? Okay - I'm happy to get the serviceThread out of the 
>> picture here.
>> ???? >
>> ???? >????? > - fixed nits and formatting issues.
>> ???? >
>> ???? >????? Okay.
>> ???? >
>> ???? >????? >>> The change also includes additional optimization for 
>> some callers of find_JavaThread_from_java_tid()
>> ???? >????? >>>?? as Daniel suggested.
>> ???? >????? >> Not sure it's best to combine these, but if they are 
>> limited to the
>> ???? >????? >> changes in management.cpp only then that may be okay.
>> ???? >????? >
>> ???? >????? > The additional optimization for some callers of 
>> find_JavaThread_from_java_tid() is
>> ???? >????? > limited to management.cpp (plus a new test) so I left 
>> them in the webrev? but
>> ???? >????? > I also could move it in the separate issue if required.
>> ???? >
>> ???? >????? I'd prefer this part of be separated out, but won't 
>> insist. Let's see if
>> ???? >????? Dan or Serguei have a strong opinion.
>> ???? >
>> ???? >????? >??? > src/hotspot/share/runtime/threadSMR.cpp
>> ???? >????? >??? >755???? jlong tid = 
>> SharedRuntime::get_java_tid(thread);
>> ???? >????? >??? > 926???? jlong tid = 
>> SharedRuntime::get_java_tid(thread);
>> ???? >????? >?? >? I think it cleaner/better to just use
>> ???? >????? >?? > jlong tid = 
>> java_lang_Thread::thread_id(thread->threadObj());
>> ???? >????? >?? > as we know thread is not NULL, it is a JavaThread 
>> and it has to have a
>> ???? >????? >?? > non-null threadObj.
>> ???? >????? >
>> ???? >????? > I had to leave this code unchanged since it turned out 
>> the threadObj is null
>> ???? >????? > when VM is destroyed:
>> ???? >????? >
>> ???? >????? > V? [libjvm.so+0xe165d7] oopDesc::long_field(int) 
>> const+0x67
>> ???? >????? > V? [libjvm.so+0x16e06c6] 
>> ThreadsSMRSupport::add_thread(JavaThread*)+0x116
>> ???? >????? > V? [libjvm.so+0x16d1302] Threads::add(JavaThread*, 
>> bool)+0x82
>> ???? >????? > V? [libjvm.so+0xef8369] 
>> attach_current_thread.part.197+0xc9
>> ???? >????? > V? [libjvm.so+0xec136c] jni_DestroyJavaVM+0x6c
>> ???? >????? > C? [libjli.so+0x4333]? JavaMain+0x2c3
>> ???? >????? > C? [libjli.so+0x8159]? ThreadJavaMain+0x9
>> ???? >
>> ???? >????? This is actually nothing to do with the VM being 
>> destroyed, but is an
>> ???? >????? issue with JNI_AttachCurrentThread and its interaction 
>> with the
>> ???? >????? ThreadSMR iterators. The attach process is:
>> ???? >????? - create JavaThread
>> ???? >????? - mark as "is attaching via jni"
>> ???? >????? - add to ThreadsList
>> ???? >????? - create java.lang.Thread object (you can only execute 
>> Java code after
>> ???? >????? you are attached)
>> ???? >????? - mark as "attach completed"
>> ???? >
>> ???? >????? So while a thread "is attaching" it will be seen by the 
>> ThreadSMR thread
>> ???? >????? iterator but will have a NULL java.lang.Thread object.
>> ???? >
>> ???? >????? We special-case attaching threads in a number of places 
>> in the VM and I
>> ???? >????? think we should be explicitly doing something here to 
>> filter out
>> ???? >????? attaching threads, rather than just being tolerant of a 
>> NULL j.l.Thread
>> ???? >????? object. Specifically in ThreadsSMRSupport::add_thread:
>> ???? >
>> ???? >????? if (ThreadTable::is_initialized() && 
>> !thread->is_attaching_via_jni()) {
>> ???? >???????? jlong tid = 
>> java_lang_Thread::thread_id(thread->threadObj());
>> ???? >???????? ThreadTable::add_thread(tid, thread);
>> ???? >????? }
>> ???? >
>> ???? >????? Note that in ThreadsSMRSupport::remove_thread we can use 
>> the same guard,
>> ???? >????? which covers the case the JNI attach encountered an error 
>> trying to
>> ???? >????? create the j.l.Thread object.
>> ???? >
>> ???? >????? >> src/hotspot/share/services/threadTable.cpp
>> ???? >????? >> 71???? static uintx get_hash(Value const& value, bool* 
>> is_dead) {
>> ???? >????? >
>> ???? >????? >> The is_dead parameter still bothers me here. I can't 
>> make enough sense
>> ???? >????? >> out of the template code in ConcurrentHashtable to see 
>> why we have to
>> ???? >????? >> have it, but I'm concerned that its very existence 
>> means we perhaps
>> ???? >????? >> should not be trying to extend CHT in this context. ??
>> ???? >????? >
>> ???? >????? > My understanding is that is_dead parameter provides a 
>> mechanism for
>> ???? >????? > ConcurrentHashtable to remove stale entries that were 
>> not explicitly
>> ???? >????? > removed by calling ConcurrentHashTable::remove() method.
>> ???? >????? > I think that just because in our case we don't use this 
>> mechanism doesn't
>> ???? >????? > mean we should not use ConcurrentHashTable.
>> ???? >
>> ???? >????? Can you confirm that this usage is okay with Robbin Ehn 
>> please. He's
>> ???? >????? back from vacation this week.
>> ???? >
>> ???? >????? >> I would still want to see what impact this has on thread
>> ???? >????? >> startup cost, both with and without the table being 
>> initialized.
>> ???? >????? >
>> ???? >????? > I run a test that initializes the table by calling 
>> ThreadMXBean.get getThreadInfo(),
>> ???? >????? > starts some threads as a worm-up, and then creates and 
>> starts 100,000 threads
>> ???? >????? > (each thread just sleeps for 100 ms). In case when the 
>> thread table is enabled
>> ???? >????? > 100,000 threads are created and started? for about 
>> 15200 ms. If the thread table
>> ???? >????? > is off the test takes about 14800 ms. Based on this 
>> information the enabled
>> ???? >????? > thread table makes the thread startup about 2.7% slower.
>> ???? >
>> ???? >????? That doesn't sound very good. I think we may need to 
>> Claes involved to
>> ???? >????? help investigate overall performance impact here.
>> ???? >
>> ???? >????? > Webrev: 
>> https://cr.openjdk.java.net/~dtitov/8185005/webrev.04/
>> ???? >????? > Bug: https://bugs.openjdk.java.net/browse/JDK-8185005
>> ???? >
>> ???? >????? No further code comments.
>> ???? >
>> ???? >????? I didn't look at the test in detail.
>> ???? >
>> ???? >????? Thanks,
>> ???? >????? David
>> ???? >
>> ???? >????? > Thanks!
>> ???? >????? > --Daniil
>> ???? >????? >
>> ???? >????? >
>> ???? >????? > ?On 7/29/19, 12:53 AM, "David Holmes" 
>> <david.holmes at oracle.com> wrote:
>> ???? >????? >
>> ???? >????? >????? Hi Daniil,
>> ???? >????? >
>> ???? >????? >????? Overall I think this is a reasonable approach but 
>> I would still like to
>> ???? >????? >????? see some performance and footprint numbers, both 
>> to verify it fixes the
>> ???? >????? >????? problem reported, and that we are not getting 
>> penalized elsewhere.
>> ???? >????? >
>> ???? >????? >????? On 25/07/2019 3:21 am, Daniil Titov wrote:
>> ???? >????? >????? > Hi David, Daniel, and Serguei,
>> ???? >????? >????? >
>> ???? >????? >????? > Please review the new version of the fix, that 
>> makes the thread table initialization on demand and
>> ???? >????? >????? > moves it inside 
>> ThreadsList::find_JavaThread_from_java_tid(). At the creation time 
>> the thread table
>> ???? >????? >????? >?? is initialized with the threads from the 
>> current thread list. We don't want to hold Threads_lock
>> ???? >????? >????? > inside find_JavaThread_from_java_tid(),? thus 
>> new threads still could be created? while the thread
>> ???? >????? >????? > table is being initialized . Such threads will 
>> be found by the linear search and added to the thread table
>> ???? >????? >????? > later, in 
>> ThreadsList::find_JavaThread_from_java_tid().
>> ???? >????? >
>> ???? >????? >????? The initialization allows the created but 
>> unpopulated, or partially
>> ???? >????? >????? populated, table to be seen by other threads - is 
>> that your intention?
>> ???? >????? >????? It seems it should be okay as the other threads 
>> will then race with the
>> ???? >????? >????? initializing thread to add specific entries, and 
>> this is a concurrent
>> ???? >????? >????? map so that should be functionally correct. But if 
>> so then I think you
>> ???? >????? >????? can also reduce the scope of the 
>> ThreadTableCreate_lock so that it
>> ???? >????? >????? covers creation of the table only, not the initial 
>> population of the table.
>> ???? >????? >
>> ???? >????? >????? I like the approach of only initializing the table 
>> when needed and using
>> ???? >????? >????? that to control when the add/remove-thread code 
>> needs to update the
>> ???? >????? >????? table. But I would still want to see what impact 
>> this has on thread
>> ???? >????? >????? startup cost, both with and without the table 
>> being initialized.
>> ???? >????? >
>> ???? >????? >????? > The change also includes additional optimization 
>> for some callers of find_JavaThread_from_java_tid()
>> ???? >????? >????? > as Daniel suggested.
>> ???? >????? >
>> ???? >????? >????? Not sure it's best to combine these, but if they 
>> are limited to the
>> ???? >????? >????? changes in management.cpp only then that may be 
>> okay. It helps to be
>> ???? >????? >????? able to focus on the table related changes without 
>> being distracted by
>> ???? >????? >????? other optimizations.
>> ???? >????? >
>> ???? >????? >????? > That is correct that ResolvedMethodTable was 
>> used as a blueprint for the thread table, however, I tried
>> ???? >????? >????? > to strip it of the all functionality that is not 
>> required in the thread table case.
>> ???? >????? >
>> ???? >????? >????? The revised version seems better in that regard. 
>> But I still have a
>> ???? >????? >????? concern, see below.
>> ???? >????? >
>> ???? >????? >????? > We need to have the thread table resizable and 
>> allow it to grow as the number of threads increases to avoid
>> ???? >????? >????? > reserving excessive memory a-priori or 
>> deteriorating lookup times. The ServiceThread is responsible for
>> ???? >????? >????? > growing the thread table when required.
>> ???? >????? >
>> ???? >????? >????? Yes but why? Why can't this table be grown on 
>> demand by the thread that
>> ???? >????? >????? is doing the addition? For other tables we may 
>> have to delegate to the
>> ???? >????? >????? service thread because the current thread cannot 
>> perform the action, or
>> ???? >????? >????? it doesn't want to perform it at the time the need 
>> for the resize is
>> ???? >????? >????? detected (e.g. its detected at a safepoint and you 
>> want the resize to
>> ???? >????? >????? happen later outside the safepoint). It's not 
>> apparent to me that such
>> ???? >????? >????? restrictions apply here.
>> ???? >????? >
>> ???? >????? >????? > There is no ConcurrentHashTable available in 
>> Java 8 and for backporting this fix to Java 8 another implementation
>> ???? >????? >????? > of the hash table, probably originally suggested 
>> in the patch attached to the JBS issue, should be used.? It will make
>> ???? >????? >????? > the backporting more complicated, however, 
>> adding a new Implementation of the hash table in Java 14 while it
>> ???? >????? >????? > already has ConcurrentHashTable doesn't seem? 
>> reasonable for me.
>> ???? >????? >
>> ???? >????? >????? Ok.
>> ???? >????? >
>> ???? >????? >????? > Webrev: 
>> http://cr.openjdk.java.net/~dtitov/8185005/webrev.03
>> ???? >????? >
>> ???? >????? >????? Some specific code comments:
>> ???? >????? >
>> ???? >????? > src/hotspot/share/runtime/mutexLocker.cpp
>> ???? >????? >
>> ???? >????? >????? +?? def(ThreadTableCreate_lock?????? , 
>> PaddedMutex? , special,
>> ???? >????? >????? false, Monitor::_safepoint_check_never);
>> ???? >????? >
>> ???? >????? >????? I think this needs to be a _safepoint_check_always 
>> lock. The table will
>> ???? >????? >????? be created by regular JavaThreads and they should 
>> (nearly) always be
>> ???? >????? >????? checking for safepoints if they are going to block 
>> acquiring the lock.
>> ???? >????? >????? And it isn't at all obvious that the thread doing 
>> the creation can't go
>> ???? >????? >????? to a safepoint whilst this lock is held.
>> ???? >????? >
>> ???? >????? >????? ---
>> ???? >????? >
>> ???? >????? >????? src/hotspot/share/runtime/threadSMR.cpp
>> ???? >????? >
>> ???? >????? >????? Nit:
>> ???? >????? >
>> ???? >????? >??????? 618?????? JavaThread* thread = thread_at(i);
>> ???? >????? >
>> ???? >????? >????? you could reuse the new java_thread local you 
>> introduced at line 613 and
>> ???? >????? >????? just rename that "new" variable to "thread" so you 
>> don't have to change
>> ???? >????? >????? all other uses.
>> ???? >????? >
>> ???? >????? >????? 628?? } else if (java_thread != NULL && ...
>> ???? >????? >
>> ???? >????? >????? You don't need to check != NULL here as you only 
>> get here when
>> ???? >????? >????? java_thread is not NULL.
>> ???? >????? >
>> ???? >????? >??????? 755???? jlong tid = 
>> SharedRuntime::get_java_tid(thread);
>> ???? >????? >??????? 926???? jlong tid = 
>> SharedRuntime::get_java_tid(thread);
>> ???? >????? >
>> ???? >????? >????? I think it cleaner/better to just use
>> ???? >????? >
>> ???? >????? >????? jlong tid = 
>> java_lang_Thread::thread_id(thread->threadObj());
>> ???? >????? >
>> ???? >????? >????? as we know thread is not NULL, it is a JavaThread 
>> and it has to have a
>> ???? >????? >????? non-null threadObj.
>> ???? >????? >
>> ???? >????? >????? ---
>> ???? >????? >
>> ???? >????? > src/hotspot/share/services/management.cpp
>> ???? >????? >
>> ???? >????? >????? 1323???????? if (THREAD->is_Java_thread()) {
>> ???? >????? >????? 1324?????????? JavaThread* current_thread = 
>> (JavaThread*)THREAD;
>> ???? >????? >
>> ???? >????? >????? These calls can only be made on a JavaThread so 
>> this be simplified to
>> ???? >????? >????? remove the is_Java_thread() call. Similarly in 
>> other places.
>> ???? >????? >
>> ???? >????? >????? ---
>> ???? >????? >
>> ???? >????? > src/hotspot/share/services/threadTable.cpp
>> ???? >????? >
>> ???? >????? >???????? 55 class ThreadTableEntry : public 
>> CHeapObj<mtInternal> {
>> ???? >????? >???????? 56?? private:
>> ???? >????? >???????? 57???? jlong _tid;
>> ???? >????? >
>> ???? >????? >????? I believe hotspot style is to not indent the 
>> access modifiers in C++
>> ???? >????? >????? class declarations, so the above would just be:
>> ???? >????? >
>> ???? >????? >???????? 55 class ThreadTableEntry : public 
>> CHeapObj<mtInternal> {
>> ???? >????? >???????? 56 private:
>> ???? >????? >???????? 57?? jlong _tid;
>> ???? >????? >
>> ???? >????? >????? etc.
>> ???? >????? >
>> ???? >????? >??????? 60???? ThreadTableEntry(jlong tid, JavaThread* 
>> java_thread) :
>> ???? >????? >??????? 61 _tid(tid),_java_thread(java_thread) {}
>> ???? >????? >
>> ???? >????? >????? line 61 should be indented as it continues line 60.
>> ???? >????? >
>> ???? >????? >???????? 67 class ThreadTableConfig : public AllStatic {
>> ???? >????? >???????? ...
>> ???? >????? >???????? 71???? static uintx get_hash(Value const& 
>> value, bool* is_dead) {
>> ???? >????? >
>> ???? >????? >????? The is_dead parameter still bothers me here. I 
>> can't make enough sense
>> ???? >????? >????? out of the template code in ConcurrentHashtable to 
>> see why we have to
>> ???? >????? >????? have it, but I'm concerned that its very existence 
>> means we perhaps
>> ???? >????? >????? should not be trying to extend CHT in this 
>> context. ??
>> ???? >????? >
>> ???? >????? >??????? 115?? size_t start_size_log = size_log > 
>> DefaultThreadTableSizeLog
>> ???? >????? >??????? 116?? ? size_log : DefaultThreadTableSizeLog;
>> ???? >????? >
>> ???? >????? >????? line 116 should be indented, though in this case I 
>> think a better layout
>> ???? >????? >????? would be:
>> ???? >????? >
>> ???? >????? >??????? 115?? size_t start_size_log =
>> ???? >????? >??????? 116?????? size_log > DefaultThreadTableSizeLog ? 
>> size_log :
>> ???? >????? >????? DefaultThreadTableSizeLog;
>> ???? >????? >
>> ???? >????? >??????? 131 double ThreadTable::get_load_factor() {
>> ???? >????? >??????? 132?? return (double)_items_count/_current_size;
>> ???? >????? >??????? 133 }
>> ???? >????? >
>> ???? >????? >????? Not sure that is doing what you want/expect. It 
>> will perform integer
>> ???? >????? >????? division and then cast that whole integer to a 
>> double. If you want
>> ???? >????? >????? double arithmetic you need:
>> ???? >????? >
>> ???? >????? >????? return ((double)_items_count)/_current_size;
>> ???? >????? >
>> ???? >????? >????? 180???? jlong????????? _tid;
>> ???? >????? >????? 181???? uintx???????? _hash;
>> ???? >????? >
>> ???? >????? >????? Nit: no need for all those spaces before the 
>> variable name.
>> ???? >????? >
>> ???? >????? >??????? 183???? ThreadTableLookup(jlong tid)
>> ???? >????? >??????? 184???? : _tid(tid), _hash(primitive_hash(tid)) {}
>> ???? >????? >
>> ???? >????? >????? line 184 should be indented.
>> ???? >????? >
>> ???? >????? >????? 201???? ThreadGet():_return(NULL) {}
>> ???? >????? >
>> ???? >????? >????? Nit: need space after :
>> ???? >????? >
>> ???? >????? >??????? 211??? assert(_is_initialized, "Thread table is 
>> not initialized");
>> ???? >????? >??????? 212?? _has_work = false;
>> ???? >????? >
>> ???? >????? >????? line 211 is indented one space too far.
>> ???? >????? >
>> ???? >????? >????? 229???? ThreadTableEntry* entry = new 
>> ThreadTableEntry(tid,java_thread);
>> ???? >????? >
>> ???? >????? >????? Nit: need space after ,
>> ???? >????? >
>> ???? >????? >????? 252?? return _local_table->remove(thread,lookup);
>> ???? >????? >
>> ???? >????? >????? Nit: need space after ,
>> ???? >????? >
>> ???? >????? >????? Thanks,
>> ???? >????? >????? David
>> ???? >????? >????? ------
>> ???? >????? >
>> ???? >????? >????? > Bug: 
>> https://bugs.openjdk.java.net/browse/JDK-8185005
>> ???? >????? >????? >
>> ???? >????? >????? > Thanks!
>> ???? >????? >????? > --Daniil
>> ???? >????? >????? >
>> ???? >????? >????? >
>> ???? >????? >????? > ?On 7/8/19, 3:24 PM, "Daniel D. Daugherty" 
>> <daniel.daugherty at oracle.com> wrote:
>> ???? >????? >????? >
>> ???? >????? >????? >????? On 6/29/19 12:06 PM, Daniil Titov wrote:
>> ???? >????? >????? >????? > Hi Serguei and David,
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > Serguei is right, 
>> ThreadTable::find_thread(java_tid) cannot? return a JavaThread with 
>> an unmatched java_tid.
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > Please find a new version of the fix that 
>> includes the changes Serguei suggested.
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > Regarding the concern about the 
>> maintaining the thread table when it may never even be queried, one of
>> ???? >????? >????? >????? > the options could be to add ThreadTable 
>> ::isEnabled flag, set it to "false" by default, and wrap the calls to 
>> the thread table
>> ???? >????? >????? >????? > in ThreadsSMRSupport add_thread() and 
>> remove_thread() methods to check this flag.
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > When 
>> ThreadsList::find_JavaThread_from_java_tid() is called for the first 
>> time it could check if ThreadTable ::isEnabled
>> ???? >????? >????? >????? > Is on and if not then set it on and 
>> populate the thread table with all existing threads from the thread 
>> list.
>> ???? >????? >????? >
>> ???? >????? >????? >????? I have the same concerns as David H. about 
>> this new ThreadTable.
>> ???? >????? >????? > ThreadsList::find_JavaThread_from_java_tid() is 
>> only called from code
>> ???? >????? >????? >????? in 
>> src/hotspot/share/services/management.cpp so I think that table
>> ???? >????? >????? >????? needs to enabled and populated only if it 
>> is going to be used.
>> ???? >????? >????? >
>> ???? >????? >????? >????? I've taken a look at the webrev below and I 
>> see that David has
>> ???? >????? >????? >????? followed up with additional comments. 
>> Before I do a crawl through
>> ???? >????? >????? >????? code review for this, I would like to see 
>> the ThreadTable stuff
>> ???? >????? >????? >????? made optional and David's other comments 
>> addressed.
>> ???? >????? >????? >
>> ???? >????? >????? >????? Another possible optimization is for 
>> callers of
>> ???? >????? >????? > find_JavaThread_from_java_tid() to save the 
>> calling thread's
>> ???? >????? >????? >????? tid value before they loop and if the 
>> current tid == saved_tid
>> ???? >????? >????? >????? then use the current JavaThread* instead of 
>> calling
>> ???? >????? >????? > find_JavaThread_from_java_tid() to get the 
>> JavaThread*.
>> ???? >????? >????? >
>> ???? >????? >????? >????? Dan
>> ???? >????? >????? >
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > Webrev: 
>> https://cr.openjdk.java.net/~dtitov/8185005/webrev.02/
>> ???? >????? >????? >????? > Bug: 
>> https://bugs.openjdk.java.net/browse/JDK-8185005
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > Thanks!
>> ???? >????? >????? >????? > --Daniil
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > From: <serguei.spitsyn at oracle.com>
>> ???? >????? >????? >????? > Organization: Oracle Corporation
>> ???? >????? >????? >????? > Date: Friday, June 28, 2019 at 7:56 PM
>> ???? >????? >????? >????? > To: Daniil Titov 
>> <daniil.x.titov at oracle.com>, OpenJDK Serviceability 
>> <serviceability-dev at openjdk.java.net>, 
>> "hotspot-runtime-dev at openjdk.java.net" 
>> <hotspot-runtime-dev at openjdk.java.net>, "jmx-dev at openjdk.java.net" 
>> <jmx-dev at openjdk.java.net>
>> ???? >????? >????? >????? > Subject: Re: RFR: 8185005: Improve 
>> performance of ThreadMXBean.getThreadInfo(long ids[], int maxDepth)
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > Hi Daniil,
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > I have several quick comments.
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > The indent in the hotspot c/c++ files has 
>> to be 2, not 4.
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > 
>> https://cr.openjdk.java.net/~dtitov/8185005/webrev.01/src/hotspot/share/runtime/threadSMR.cpp.frames.html
>> ???? >????? >????? >????? > 614 JavaThread* 
>> ThreadsList::find_JavaThread_from_java_tid(jlong java_tid) const {
>> ???? >????? >????? >????? >?? 615???? JavaThread* java_thread = 
>> ThreadTable::find_thread(java_tid);
>> ???? >????? >????? >????? >?? 616???? if (java_thread == NULL && 
>> java_tid == PMIMORDIAL_JAVA_TID) {
>> ???? >????? >????? >????? >?? 617???????? // 
>> ThreadsSMRSupport::add_thread() is not called for the primordial
>> ???? >????? >????? >????? >?? 618???????? // thread. Thus, we find 
>> this thread with a linear search and add it
>> ???? >????? >????? >????? >?? 619???????? // to the thread table.
>> ???? >????? >????? >????? >?? 620???????? for (uint i = 0; i < 
>> length(); i++) {
>> ???? >????? >????? >????? >?? 621 JavaThread* thread = thread_at(i);
>> ???? >????? >????? >????? >?? 622???????????? if 
>> (is_valid_java_thread(java_tid,thread)) {
>> ???? >????? >????? >????? >?? 623 ThreadTable::add_thread(java_tid, 
>> thread);
>> ???? >????? >????? >????? >?? 624 return thread;
>> ???? >????? >????? >????? >?? 625???????????? }
>> ???? >????? >????? >????? >?? 626???????? }
>> ???? >????? >????? >????? >?? 627???? } else if (java_thread != NULL 
>> && is_valid_java_thread(java_tid, java_thread)) {
>> ???? >????? >????? >????? >?? 628???????? return java_thread;
>> ???? >????? >????? >????? >?? 629???? }
>> ???? >????? >????? >????? >?? 630???? return NULL;
>> ???? >????? >????? >????? >?? 631 }
>> ???? >????? >????? >????? >?? 632 bool 
>> ThreadsList::is_valid_java_thread(jlong java_tid, JavaThread* 
>> java_thread) {
>> ???? >????? >????? >????? >?? 633???? oop tobj = 
>> java_thread->threadObj();
>> ???? >????? >????? >????? >?? 634???? // Ignore the thread if it 
>> hasn't run yet, has exited
>> ???? >????? >????? >????? >?? 635???? // or is starting to exit.
>> ???? >????? >????? >????? >?? 636???? return (tobj != NULL && 
>> !java_thread->is_exiting() &&
>> ???? >????? >????? >????? >?? 637 java_tid == 
>> java_lang_Thread::thread_id(tobj));
>> ???? >????? >????? >????? >?? 638 }
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? >?? 615???? JavaThread* java_thread = 
>> ThreadTable::find_thread(java_tid);
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? >??? I'd suggest to rename find_thread() to 
>> find_thread_by_tid().
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > A space is missed after the comma:
>> ???? >????? >????? >????? >??? 622 if 
>> (is_valid_java_thread(java_tid,thread)) {
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > An empty line is needed before L632.
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > The name 'is_valid_java_thread' looks 
>> wrong (or confusing) to me.
>> ???? >????? >????? >????? > Something like 
>> 'is_alive_java_thread_with_tid()' would be better.
>> ???? >????? >????? >????? > It'd better to list parameters in the 
>> opposite order.
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > The call to is_valid_java_thread() is 
>> confusing:
>> ???? >????? >????? >????? >???? 627 } else if (java_thread != NULL && 
>> is_valid_java_thread(java_tid, java_thread)) {
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > Why would the call 
>> ThreadTable::find_thread(java_tid) return a JavaThread with an 
>> unmatched java_tid?
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > Thanks,
>> ???? >????? >????? >????? > Serguei
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > On 6/28/19, 9:40 PM, "David Holmes" 
>> <david.holmes at oracle.com> wrote:
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? >????? Hi Daniil,
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? >????? The definition and use of this 
>> hashtable (yet another hashtable
>> ???? >????? >????? >????? >????? implementation!) will need careful 
>> examination. We have to be concerned
>> ???? >????? >????? >????? >????? about the cost of maintaining it 
>> when it may never even be queried. You
>> ???? >????? >????? >????? >????? would need to look at footprint cost 
>> and performance impact.
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? >????? Unfortunately I'm just about to 
>> board a plane and will be out for the
>> ???? >????? >????? >????? >????? next few days. I will try to look at 
>> this asap next week, but we will
>> ???? >????? >????? >????? >????? need a lot more data on it.
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? >????? Thanks,
>> ???? >????? >????? >????? >????? David
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > On 6/28/19 3:31 PM, Daniil Titov wrote:
>> ???? >????? >????? >????? > Please review the change that improves 
>> performance of ThreadMXBean MXBean methods returning the
>> ???? >????? >????? >????? > information for specific threads. The 
>> change introduces the thread table that uses ConcurrentHashTable
>> ???? >????? >????? >????? > to store one-to-one the mapping between 
>> the thread ids and JavaThread objects and replaces the linear
>> ???? >????? >????? >????? > search over the thread list in 
>> ThreadsList::find_JavaThread_from_java_tid(jlong tid) method with the 
>> lookup
>> ???? >????? >????? >????? > in the thread table.
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > Testing: Mach5 tier1,tier2 and tier3 
>> tests successfully passed.
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > Webrev: 
>> https://cr.openjdk.java.net/~dtitov/8185005/webrev.01/
>> ???? >????? >????? >????? > Bug: 
>> https://bugs.openjdk.java.net/browse/JDK-8185005
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > Thanks!
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? > Best regards,
>> ???? >????? >????? >????? > Daniil
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? >
>> ???? >????? >????? >????? >
>> ???? >????? >????? >
>> ???? >????? >????? >
>> ???? >????? >????? >
>> ???? >????? >????? >
>> ???? >????? >
>> ???? >????? >
>> ???? >????? >
>> ???? >
>> ???? >
>> ???? >
>>
>>