some questions about vm code implement

Jorn Vernee jorn.vernee at oracle.com
Fri Sep 16 15:23:43 UTC 2022


Hi Ziai,

(Sorry, I think I mistyped your name last time)

The principle of design of the VM code is that it should make no 
assumptions about the native ABI that is being used (though it can make 
assumptions about the Java ABI). How to link against an ABI should 
instead be entirely communicated from the Java side, through e.g. 
ABIDescriptor.

Since at the moment we don't pass information about which register parts 
should be saved, x86_64 conservatively saves the entire register if it 
is not volatile according to the ABI. This is perhaps a missed 
optimization opportunity. More information could be added to 
ABIDescriptor to describe which parts of a register should be saved.

The AArch64 code is less conservative, and only saves the lower 8 bytes 
of a FloatRegister. In practice this works out for the all the ABIs we 
target, which is currently only the platform's C ABI. But, this is 
essentially a hard-coded assumption, and should ideally be communicated 
through ABIDescriptor. (x86_64 also makes such an assumption about the 
mxcsr status register).

I think this area could be improved. I've filed: JDK-8293947 [1] to keep 
track of that.

For now, when adding support for new architectures I recommend 
conservatively saving the entire register if the ABIDescriptor says it 
is non-volatile (i.e. callee-saved).

HTH,
Jorn

[1]: https://bugs.openjdk.org/browse/JDK-8293947

On 16/09/2022 04:45, ziai li wrote:
> Hi all:
>
> In the process of reading the code, I have some doubts about the 
> implementation of vm code
> for example in cpu/aarch64/upcallLinker_aarch64.cpp
>
> // for callee saved regs, according to the caller's ABI
> static int compute_reg_save_area_size(const ABIDescriptor& abi) {
>
>   for (int i = 0; i < FloatRegisterImpl::number_of_registers; i++) {
>     FloatRegister reg = as_FloatRegister(i);
>     if (!abi.is_volatile_reg(reg)) {
>       // Only the lower 64 bits of vector registers need to be preserved.
>       size += 8; // bytes
>     }
>   }
> }
>
> static void preserve_callee_saved_registers(MacroAssembler* _masm, 
> const ABIDescriptor& abi, int reg_save_area_offset) {
>
>   for (int i = 0; i < FloatRegisterImpl::number_of_registers; i++) {
>     FloatRegister reg = as_FloatRegister(i);
>     if (!abi.is_volatile_reg(reg)) {
>       __ strd(reg, Address(sp, offset));
>       offset += 8;
>     }
>   }
> }
>
> static void restore_callee_saved_registers(MacroAssembler* _masm, 
> const ABIDescriptor& abi, int reg_save_area_offset) {
>
>   for (int i = 0; i < FloatRegisterImpl::number_of_registers; i++) {
>     FloatRegister reg = as_FloatRegister(i);
>     if (!abi.is_volatile_reg(reg)) {
>       __ ldrd(reg, Address(sp, offset));
>       offset += 8;
>     }
>   }
> }
>
> In the comments of the function compute_reg_save_area_size, only the 
> lower 64 bits of vector registers need to be preserved, so the size is 
> set to 8. Is this due to the calling convention of aarch64 or is it 
> because only floating point data needs to be saved? I have the same 
> question about the restore_callee_saved_registers and 
> preserve_callee_saved_registers functions.
> The x86 code is implemented like this:
>
> static int compute_reg_save_area_size(const ABIDescriptor& abi) {
>
>   for (XMMRegister reg = as_XMMRegister(0); is_valid_XMM(reg); reg = 
> reg->successor()) {
>     if (!abi.is_volatile_reg(reg)) {
>       if (UseAVX >= 3) {
>         size += 64; // bytes
>       } else if (UseAVX >= 1) {
>         size += 32;
>       } else {
>         size += 16;
>       }
>     }
>   }
> }
>
> So I don't know what principle the value of size is set according to, 
> such as riscv, what should the value of size be, 8 or 16 or what?
>
> Looking forward to your reply
>
> ziai li


More information about the panama-dev mailing list