RFR: Save/load nmethod without going through CodeBuffer [v2]
Ashutosh Mehra
asmehra at openjdk.org
Mon Mar 17 12:04:18 UTC 2025
On Mon, 17 Mar 2025 11:53:32 GMT, Ashutosh Mehra <asmehra at openjdk.org> wrote:
>> This is the prototype for storing and loading nmethods without going through the CodeBuffer.
>> The new implementation is protected by the flag -XX:+UseNewCode2.
>>
>> Some numbers using this implementation:
>> spring-boot-getting-started [0] shows startup improvement of ~ 7.5%% and quarkus-getting-started [1] shows improvement of around ~3.5%.
>>
>> Numbers for Springboot
>>
>> Old build = /home/asmehra/data/ashu-mehra/leyden/build/nmethod-single-copy-load-release/images/jdk with options -XX:+UnlockDiagnosticVMOptions -XX:-UseNewCode2
>> New build = /home/asmehra/data/ashu-mehra/leyden/build/nmethod-single-copy-load-release/images/jdk with options -XX:+UnlockDiagnosticVMOptions -XX:+UseNewCode2
>> Run,Old CDS + AOT,New CDS + AOT
>> 1,544,512
>> 2,554,525
>> 3,550,508
>> 4,571,515
>> 5,550,506
>> 6,552,515
>> 7,568,510
>> 8,554,517
>> 9,551,506
>> 10,556,508
>> Geomean,554.94,512.17
>> Stdev,7.90,5.65
>>
>> Numbers for Quarkus:
>>
>> Old build = /home/asmehra/data/ashu-mehra/leyden/build/nmethod-single-copy-load-release/images/jdk with options -XX:+UnlockDiagnosticVMOptions -XX:-UseNewCode2
>> New build = /home/asmehra/data/ashu-mehra/leyden/build/nmethod-single-copy-load-release/images/jdk with options -XX:+UnlockDiagnosticVMOptions -XX:+UseNewCode2
>> Run,Old CDS + AOT,New CDS + AOT
>> 1,359,346
>> 2,360,353
>> 3,373,357
>> 4,376,356
>> 5,366,353
>> 6,360,356
>> 7,361,361
>> 8,347,349
>> 9,355,336
>> 10,374,342
>> Geomean,363.00,350.82
>> Stdev,8.70,7.27
>>
>>
>> -Xlog:init logs the load time from AOT code cache at JVM exit.
>> For spring-boot-getting-started without `UseNewCode2`:
>>
>> [3.459s][info][init] SC Load Time: 0.202 s
>> [3.459s][info][init] nmethod register: 0.135 s
>> [3.459s][info][init] find cached code: 0.007 s
>>
>>
>> For spring-boot-getting-started with `UseNewCode2`:
>>
>> [3.192s][info][init]
>> [3.192s][info][init] SC Load Time: 0.138 s
>> [3.192s][info][init] nmethod register: 0.111 s
>> [3.192s][info][init] find cached code: 0.006 s
>>
>>
>> For quarkus-getting-started without `UseNewCode2`
>>
>> [0.392s][info][init] SC Load Time: 0.060 s
>> [0.392s][info][init] nmethod register: 0.039 s
>> [0.392s][info][init] find cached code: 0.002 s
>>
>>
>> For quarkus-getting-started with `UseNewCode2`
>>
>> [0.386s][info][init] SC Load Time: 0.033 s
>> [0.386s][info][init] nmethod register: 0.027 s
>> [0.386s][info][init] find cached code: ...
>
> Ashutosh Mehra has updated the pull request incrementally with one additional commit since the last revision:
>
> Minor cleanup
>
> Signed-off-by: Ashutosh Mehra <asmehra at redhat.com>
I have updated this PR to remove `SCnmethod` and store `nmethod` directly into AOT code cache.
Perf numbers have changed compared to the last prototype.
For spring boot:
Old build = /home/asmehra/data/ashu-mehra/leyden/build/premain-release/images/jdk with options
New build = /home/asmehra/data/ashu-mehra/leyden/build/nmethod-single-copy-load-release/images/jdk with options
Run,Old CDS + AOT,New CDS + AOT
1,544,532
2,548,526
3,558,534
4,550,529
5,541,532
6,540,540
7,539,542
8,554,526
9,540,532
10,544,530
Geomean,545.77,532.28
Stdev,6.18,5.02
For quarkus:
Old build = /home/asmehra/data/ashu-mehra/leyden/build/premain-release/images/jdk with options
New build = /home/asmehra/data/ashu-mehra/leyden/build/nmethod-single-copy-load-release/images/jdk with options
Run,Old CDS + AOT,New CDS + AOT
1,358,349
2,355,356
3,360,347
4,372,342
5,357,353
6,344,336
7,356,353
8,352,339
9,359,338
10,347,340
Geomean,355.93,345.23
Stdev,7.27,6.84
This shows an improvement of 2.5~3%.
`SC Load Time` as reported by `-Xlog:init` shows it is cut down to almost half.
Springboot without this change:
[0.810s][info][init] SC Load Time: 0.127 s
[0.810s][info][init] nmethod register: 0.091 s
[0.810s][info][init] find cached code: 0.003 s
Springboot with this change:
[0.800s][info][init] SC Load Time: 0.064 s
[0.800s][info][init] nmethod register: 0.055 s
[0.800s][info][init] find cached code: 0.003 s
Quarkus without this change:
[0.392s][info][init] SC Load Time: 0.056 s
[0.392s][info][init] nmethod register: 0.040 s
[0.392s][info][init] find cached code: 0.002 s
Quarkus with this change:
[0.383s][info][init] SC Load Time: 0.027 s
[0.383s][info][init] nmethod register: 0.024 s
[0.383s][info][init] find cached code: 0.002 s
-------------
PR Comment: https://git.openjdk.org/leyden/pull/27#issuecomment-2729228757
More information about the leyden-dev
mailing list