[External] : Re: Virtual thread memory leak because TrackingRootContainer keeps threads
Robert Engels
robaho at icloud.com
Thu Jul 25 13:52:43 UTC 2024
If it is blocked reading a socket, it is not on a CarrierThread (i.e. blocked not running) so there is no strong reference from the CarrierThread to the VirtualThread in this case.
I am fairly certain that the scheduler/blocking handler is what maintains the strong thread references in this case.
> On Jul 25, 2024, at 8:49 AM, Matthew Swift <matthew.swift at gmail.com> wrote:
>
> Thanks for the quick response Alan. A couple of comments:
>
> > You can specify a ThreadFactory to newThreadPerTaskExecutor so you can
> > set the thread name if you need it. Virtual threads are meant to be
> > lightweight and numerous so don't get a name by default. For many cases,
> > this is okay as the they have a thread ID which will identity them in
> > thread dumps and elsewhere.
>
> That may be so, but it's not very flexible, unless I'm missing something, because it's not possible to derive the thread name from the task itself. Said otherwise, a ThreadFactory lets you create very generic thread names like "HTTP Connection #1" ahead of time, but doesn't let you derive a rich name from the Runnable itself, like "HTTP Connection from 1.2.3.4 to 5.6.7.8 on port 8080". I tried cheating a bit by relying on the toString() of the Runnable, but sadly this doesn't work because the executor wraps the provided Runnable before calling the thread factory (see ThreadPerTaskExecutor#start(java.lang.Runnable)).
>
> > A virtual thread that is blocked reading from a Socket will continue
> > when there are bytes to read, the peer closes the connection, some I/O
> > exception, or the thread is interrupted. So it will be strongly reachable.
>
> Ah, of course, silly me. And, following from that, the VT is strongly reachable when running because it is directly referenced by the carrier thread itself.
>
> Thanks
> Matt
>
>
> On Thu, 25 Jul 2024 at 14:55, Alan Bateman <Alan.Bateman at oracle.com <mailto:Alan.Bateman at oracle.com>> wrote:
>
>
> On 25/07/2024 13:21, Matthew Swift wrote:
> > Hmm, I'm starting to think I may have fallen into the same trap here
> > as Michal.
> >
> > I've been using virtual threads similar to platform threads for
> > performing IO tasks asynchronously.
> >
> > Background: originally, I was using
> > Executors#newVirtualThreadPerTaskExecutor to run these tasks, which
> > was fine because the Executor tracks the virtual threads it creates in
> > order to support shutdown. However, from an observability point of
> > view I've found the executor to be a bit frustrating because it is
> > impossible to set the thread name before the thread is scheduled to
> > run[1].
> You can specify a ThreadFactory to newThreadPerTaskExecutor so you can
> set the thread name if you need it. Virtual threads are meant to be
> lightweight and numerous so don't get a name by default. For many cases,
> this is okay as the they have a thread ID which will identity them in
> thread dumps and elsewhere.
>
>
> > This means that under heavy load where the FJ pool is busy a thread
> > dump shows many unnamed threads, which are waiting to be scheduled for
> > the first time. Admittedly, this is only a minor annoyance because I'm
> > more interested in the threads that are clogging up the FJ pool than
> > the ones which are waiting to use it, even so it'd be nice to have an
> > overall picture of what's active and what's queued (note also thread
> > names are included in JFR events, which is super helpful).
>
> With the Loom EA builds [1] you can use `jcmd <pid>
> Thread.vthread_summary` to get a summary view of all thread grouping so
> you get thread counts, a summary of the scheduler, timers, and any
> socket I/O that it outstanding. We would like to bring this diagnostic
> command into the main line JDK at some point.
>
> >
> > To remedy this, I've switched away from using an Executor and now I
> > just use "Thread.ofVirtual().name(initialName).start(task)". However,
> > I don't think all of the tasks are strongly reachable - some are "fire
> > and forget" tasks (e.g. async resource cleanup), so I may be
> > inadvertently relying on the JVM's observability support to keep these
> > tasks alive until they complete, which seems a bit brittle. In fact,
> > now that I think of it, it may not even be limited to fire and forget
> > tasks. A VT that is reading messages from a Socket could be GC'd
> A virtual thread that is blocked reading from a Socket will continue
> when there are bytes to read, the peer closes the connection, some I/O
> exception, or the thread is interrupted. So it will be strongly reachable.
>
> -Alan
>
> [1] https://jdk.java.net/loom/ <https://jdk.java.net/loom/>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.openjdk.org/pipermail/loom-dev/attachments/20240725/2de7c69a/attachment.htm>
More information about the loom-dev
mailing list