Integrated: 8370473: C2: Better Aligment of Vector Spill Slots

Richard Reingruber rrich at openjdk.org
Wed Dec 3 10:32:25 UTC 2025


On Fri, 24 Oct 2025 07:36:57 GMT, Richard Reingruber <rrich at openjdk.org> wrote:

> With this change c2 will allocate spill slots for vectors with sp offsets aligned to the size of the vectors. Maximum alignment is StackAlignmentInBytes.
> 
> It also updates comments that have never been changed to describe how register allocation works for sizes larger than 64 bit.
> 
> The change helps to produce better spill code on AARCH64 and PPC64 where an additional add instruction is emitted if the offset of a vector un-/spill is not aligned.
> 
> The change is rather a cleanup than an optimization. In most cases the sp offsets will already be properly aligned.
> Only with incoming stack arguments unaligned offsets can be generated. But also then alignment padding is only added if vector registers larger than 64 bit are used.
> 
> So the costs are effectively zero. Especially because extra padding won't enlarge the frame since only virtual registers are allocated which are mapped to the caller frame (see `pad0` in the [diagram](https://github.com/openjdk/jdk/blob/92e380c59c2498b1bc94e26658b07b383deae59a/src/hotspot/cpu/aarch64/aarch64.ad#L3829))
> 
> There's a risk though that with the extra virtual registers allocated for `pad0` the limit of registers a `RegMask` can represent is reached (occurs with excessive spilling). If this happens the compilation would fail. It could be retried with smaller alignment for vector spilling though. I havn't implemented it as I thought the risk is negligible.
> 
> Note that the sp offset of the accesses should be aligned rather than the effective address. So it could even be argued that the maximum alignment could be higher than StackAlignmentInBytes.
> 
> ##### Testing with fastdebug builds on AARCH64 and PPC64:
> 
> hotspot_vector_1
> hotspot_vector_2
> jdk_vector
> jdk_vector_sanity
> 
> ##### The change passed our CI testing:
> Tier 1-4 of hotspot and jdk. All of langtools and jaxp. Renaissance Suite and SAP specific tests.
> Testing was done on the main platforms and also on Linux/PPC64le and AIX.
> 
> C2 compilation of `jdk.internal.vm.vector.VectorSupport::rearrangeOp` has unaligned spill offsets. It is covered by the following tests:
> 
> compiler/vectorapi/VectorRearrangeTest.java
> jdk/incubator/vector/Byte128VectorLoadStoreTests.java
> jdk/incubator/vector/Double256VectorLoadStoreTests.java
> jdk/incubator/vector/Float128VectorTests.java
> jdk/incubator/vector/Long256VectorLoadStoreTests.java
> jdk/incubator/vector/Short128VectorLoadStoreTests.java
> jdk/incubator/vector/Vector64ConversionTests.java

This pull request has now been integrated.

Changeset: 804ce0a2
Author:    Richard Reingruber <rrich at openjdk.org>
URL:       https://git.openjdk.org/jdk/commit/804ce0a2394cb3f837441976e5ef6eb4b9cab257
Stats:     203 lines in 7 files changed: 157 ins; 29 del; 17 mod

8370473: C2: Better Aligment of Vector Spill Slots

Reviewed-by: goetz, mdoerr

-------------

PR: https://git.openjdk.org/jdk/pull/27969


More information about the hotspot-compiler-dev mailing list