RFR: 8282204: Use lea instructions for arithmetic operations on x86_64 [v11]
Jie Fu
jiefu at openjdk.java.net
Sat Mar 12 02:31:42 UTC 2022
On Sat, 12 Mar 2022 02:17:40 GMT, Quan Anh Mai <duke at openjdk.java.net> wrote:
>> src/hotspot/cpu/x86/assembler_x86.cpp line 2393:
>>
>>> 2391: InstructionMark im(this);
>>> 2392: #ifdef _LP64
>>> 2393: emit_int8(0x67); // addr32
>>
>> By removing the 0x67 prefix, we are now doing 64 bit computation even for 32 bit inputs. Wouldn't this cause partial register stalls (see section 3.5.2.4 end of section).
>
> Partial writes only happen with data size smaller than 32-bit since 32-bit writes are zero-extended to full register width. As a result, there should be no additional stalls here.
> By removing the 0x67 prefix, we are now doing 64 bit computation even for 32 bit inputs. Wouldn't this cause partial register stalls (see section 3.5.2.4 end of section).
I agree with @merykitty .
The manual mentions
Note that in Intel 64 architecture, an update to the lower 32 bits of a 64 bit integer register is architecturally defined to zero extend the upper 32 bits. While this action may be logically viewed as a 32 bit
update, it is really a 64 bit update (and therefore does not cause a partial stall).
TBH, I didn't know this opt rule before.
Thanks @sviswa7 for sharing this.
-------------
PR: https://git.openjdk.java.net/jdk/pull/7560
More information about the hotspot-compiler-dev
mailing list