RFR: 8284578: Relax InterpreterCodelet stub alignment [v2]

Aleksey Shipilev shade at openjdk.java.net
Thu Apr 14 07:33:12 UTC 2022


On Wed, 13 Apr 2022 09:15:14 GMT, Aleksey Shipilev <shade at openjdk.org> wrote:

>> `InterpreterCodelet` is aligned by `CodeEntryAlignment` (`CAE`) twice. First, the entire stub is aligned, which aligns its data section. Then, the code section in the stub is aligned. Since `CAE` is usually larger than the size of `InterpreterCodelet`, we are wasting quite a bit of space for each codelet. In the extreme cases, like PPC that defaults to `CAE=128`, we have 16 bytes of codelet data effectively taking 128 bytes!
>> 
>> This can be made better by relaxing the `InterpreterCodelet` stub alignment to `HeapWordSize`, while leaving its code section alignment the same.
>> 
>> This tangentially touches the only other user for `StubQueue`: `ICStub`. Unfortunately, we cannot do the same kind of relaxation there, because there is a reverse lookup function that needs to reach data section from the code section, which forces us to keep the same alignment for both.
>> 
>> Interpreter sizes on Linux x86_64 release:
>> 
>> 
>> # Baseline, CEA=32 (default)
>> code size        =     94K bytes
>> avg codelet size =    356  bytes
>> 
>> # Baseline, CEA=128 (PPC-like)
>> code size        =    133K bytes
>> avg codelet size =    501  bytes
>> 
>> # Patched, CEA=32 (default)
>> code size        =     89K bytes
>> avg codelet size =    338  bytes
>> 
>> # Patched, CEA=128 (PPC-like)
>> code size        =    100K bytes
>> avg codelet size =    380  bytes
>> 
>> 
>> Point performance runs (SPECjvm2008:serial with `-Xint` on Linux x86_64 release):
>> 
>> 
>> Benchmark     Mode  Cnt   Score   Error  Units
>> 
>> # Baseline, CEA=32
>> Serial.test  thrpt    9  73.427 ± 0.152  ops/s
>> 
>> # Baseline, CEA=128
>> Serial.test  thrpt    9  70.999 ± 0.246  ops/s
>> 
>> # Patched, CEA=32
>> Serial.test  thrpt    9  73.991 ± 0.860  ops/s
>> 
>> # Patched, CEA=128
>> Serial.test  thrpt    9  72.981 ± 0.301  ops/s
>> 
>> 
>> Additional testing:
>>  - [x] Linux x86_64 fastdebug `tier1`
>>  - [x] Linux x86_64 fastdebug `tier2`
>>  - [x] Linux x86_64 fastdebug `tier3`
>
> Aleksey Shipilev has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains three additional commits since the last revision:
> 
>  - Relax slack space a bit
>  - Merge branch 'master' into JDK-8284578-intcodelet-align
>  - Initial implementation

Thanks!

-------------

PR: https://git.openjdk.java.net/jdk/pull/8159


More information about the hotspot-dev mailing list