RFR: 8285447: StackWalker minimal batch size should be optimized for getCallerClass [v2]
Mandy Chung
mchung at openjdk.org
Mon Sep 11 17:34:40 UTC 2023
On Mon, 11 Sep 2023 17:27:15 GMT, Mandy Chung <mchung at openjdk.org> wrote:
>> Typically it will find the caller class at the second stack frame from the frame of executing `StackWalker::getCallerClass`. The initial size of the buffer can be changed from 8 to 4 (the top 2 elements are reserved for implementation use). If it fetches another batch, `getCallerClass` may be invoked via core reflection, so subsequent batches can be increased to a larger size. This PR also moves the benchmark for `getCallerClass` in its own file because it does not need to test with different depth and can be separated from StackWalkBench.
>
> Mandy Chung has updated the pull request incrementally with one additional commit since the last revision:
>
> Review feedback
> In general looks good. But once you on this, I suggest to add the following additional optimizations:
>
> * `FrameBuffer.START_POS` is `2` but as far as I can see, currently only the 0th element is reserved for passing a "magic" object passed between the JVM and Java code. So this should be set to `1`.
> ....
>
> With these changes you would:
>
> * save one more frame for the `getCallerClass()` case
Note that this only affects the number of elements allocated. `getCallerClass` only walks 2 frames in the first batch regardless of the number of reserved elements.
Benchmark Mode Cnt Score Error Units
CallerClassBench.getCallerClass avgt 15 0.361 ? 0.003 us/op // num of reserved elements = 1
CallerClassBench.getCallerClass avgt 15 0.370 ? 0.009 us/op // num of reserved elements = 2
-------------
PR Comment: https://git.openjdk.org/jdk/pull/15642#issuecomment-1714303844
More information about the core-libs-dev
mailing list