RFR: 8373096: JFR leak profiler: path-to-gc-roots search should be non-recursive
Thomas Stuefe
stuefe at openjdk.org
Tue Dec 9 05:55:57 UTC 2025
On Tue, 9 Dec 2025 01:19:16 GMT, Robert Toyonaga <duke at openjdk.org> wrote:
>> A customer reported a crash when producing a JFR recording with `path-to-gc-roots=true`. It was a native stack overflow that occurred during the recursive path-to-gc-root search performed in the context of PathToGcRootsOperation.
>>
>> We try to avoid this by limiting the maximum search depth (DFSClosure::max_dfs_depth). That solution is brittle, however, since recursion depth is not a good proxy for thread stack usage: it depends on many factors, e.g., compiler inlining decisions and platform specifics. In this case, the VMThread's stack was too small.
>>
>> This RFE changes the algorithm to be non-recursive.
>>
>> Note that as a result of this change, the order in which oop maps are walked per oop is reversed : last oops are processed first. That should not matter for the end result, however. The search is still depth-first.
>>
>> Note that after this patch, we could easily remove the max_depth limitation altogether. I left it in however since this was not the scope of this RFE.
>>
>> Testing:
>>
>> - Tested manually with very small (256K) thread stack size for the VMThread - the patched version works where the old version crashes out
>> - Compared JFR recordings from both an unpatched version (with a large enough VMThread stack size) and a patched version; made sure that the content of "Old Object Sample" was identical
>> - Ran locally all jtreg tests in jdk/jfr
>> - GHAs
>
> src/hotspot/share/jfr/leakprofiler/chains/dfsClosure.cpp line 103:
>
>> 101: } else {
>> 102: if (_mark_bits->is_marked(pointee)) {
>> 103: return;
>
> I think this improvement is a good idea! But maybe this line should be replaced with a `continue`, otherwise we can terminate the DFS prematurely and skip evaluation of other chains extending from other references already pushed to the stack.
Good catch!
-------------
PR Review Comment: https://git.openjdk.org/jdk/pull/28659#discussion_r2601196718
More information about the hotspot-jfr-dev
mailing list