RFR: 8256061: RegisterSaver::save_live_registers() omits upper halves of ZMM0-15 registers
Sandhya Viswanathan
sviswanathan at openjdk.java.net
Mon Nov 9 21:28:56 UTC 2020
On Mon, 9 Nov 2020 16:44:23 GMT, Vladimir Ivanov <vlivanov at openjdk.org> wrote:
> `YMM0-15` registers are handled specially when CPU registers are saved. They are split in 2 parts (128-bit each) and put in different parts of the frame (see `RegisterSaver::layout` for details). AVX512 adds 16 more vector registers (ZMM16-31) and those are saved full-sized in a separate region. But `RegisterSaver::save_live_registers()` doesn't do anything special for `ZMM0-15` and their upper halves are lost (though there's space reserved for them in the frame).
>
> The fix adds missing logic which saves upper halves (256-bit in size) of ZMM0-15 registers. Thus every ZMM0-15 register ends up split into 3 parts which are stored independently in the frame.
>
> Testing (with some other relevant patches):
> - [x] jdk/incubator/vector w/ -XX:+DeoptimizeALot and -XX:UseAVX=3 on AVX512-capable hardware
> - [x] hs-precheckin-comp, hs-tier1, hs-tier2
It looks like the upper 256 bits of ZMM0-15 are already being saved as part of the following statements in sharedRuntime_x86_64.cpp:
191 if (save_vectors) {
192 // Save upper half of YMM registers(0..15)
193 int base_addr = XSAVE_AREA_YMM_BEGIN;
194 for (int n = 0; n < 16; n++) {
195 __ vextractf128_high(Address(rsp, base_addr+n*16), as_XMMRegister(n));
196 }
197 if (VM_Version::supports_evex()) {
198 // Save upper half of ZMM registers(0..15)
199 base_addr = XSAVE_AREA_ZMM_BEGIN;
200 for (int n = 0; n < 16; n++) {
201 __ vextractf64x4_high(Address(rsp, base_addr+n*32), as_XMMRegister(n));
202 }
203 // Save full ZMM registers(16..num_xmm_regs)
204 base_addr = XSAVE_AREA_UPPERBANK;
205 off = 0;
206 int vector_len = Assembler::AVX_512bit;
207 for (int n = 16; n < num_xmm_regs; n++) {
208 __ evmovdqul(Address(rsp, base_addr+(off++*64)), as_XMMRegister(n), vector_le
n);
209 }
210 }
211 }
-------------
PR: https://git.openjdk.java.net/jdk/pull/1131
More information about the hotspot-compiler-dev
mailing list