RFR: JDK-8214527 AArch64: ZGC for Aarch64

Stuart Monteith stuart.monteith at linaro.org
Mon Jun 10 21:38:49 UTC 2019


Thanks for looking at this.

With ZGC we are emitting more lea macro instructions. In some
circumstances I found that a post indexed address was being passed to
LEA, and so we have to cover that circumstance. In this case, the
effective address is just the base address, as it is changed after the
address is calculated.

rscratch2 is loaded with the effective source address, which you'll
see is passed to the load barrier routine, further down in the method
- would you like a comment clarifying?
     // call_VM_leaf uses rscratch1.
     __ call_VM_leaf(ZBarrierSetRuntime::load_barrier_on_oop_field_preloaded_addr(decorators),
dst, rscratch2);

As for the vector registers, I should be saving the full 128 bits of v0 to v31?
 ZBarrierSetAssembler::load_at on x86 saves 8 SSE XMM registers -
would they be considered the same as the save-on-call d0-d7 registers
on Aarch64?

I notice that z_x86_64.ad will kill xmm, ymm and zmm registers (SSE,
AVX, AVX-512 registers?), depending on the machine it is on. I presume
the risk we have here is that during autovectorization these registers
will be lost if we are unlucky enough to have the barrier code
overwrite them.

BR,
   Stuart

On Mon, 10 Jun 2019 at 17:39, Andrew Haley <aph at redhat.com> wrote:
>
> On 6/10/19 3:02 PM, Stuart Monteith wrote:
> > Nils' patch for "ZGC Late Barrier Insertion" has been merged
> > (http://hg.openjdk.java.net/jdk/jdk/rev/ed12027517c0). I'm now running
> > a fresh jtreg test against it with my updated patch here:
> >
> >      http://cr.openjdk.java.net/~smonteith/8214527/webrev.4/
> >
> > The difference from before are some additions to z_aarch64.ad in order
> > to implement the new nodes required by Nils' patch. Running against
> > specjbb2015 and Lucene don't throw up any errors.
>
> This hunk is very weird.
>
> --- old/src/hotspot/cpu/aarch64/assembler_aarch64.cpp   2019-06-10 14:25:39.274238301 +0100
> +++ new/src/hotspot/cpu/aarch64/assembler_aarch64.cpp   2019-06-10 14:25:39.026235784 +0100
> @@ -1265,6 +1265,13 @@
>        __ movptr(r, (uint64_t)target());
>      break;
>    }
> +  case post: {
> +    // Post-indexed, just copy the contents of the register. Offset added afterwards.
> +    if (_base == r) // it's a nop
> +      break;
> +    __ mov(r, _base);
> +    break;
> +  }
>    default:
>      ShouldNotReachHere();
>    }
>
> What is going on here:
>
> +
> +  // rscratch1 can be passed as src or dst, so don't use it.
> +  RegSet savedRegs = RegSet::of(rscratch2, rheapbase);
> +
> +  Label done;
> +  assert_different_registers(rheapbase, rscratch2, dst);
> +  assert_different_registers(rheapbase, rscratch2, src.base());
> +
> +  __ push(savedRegs, sp);
> +
> +  // Load bad mask into scratch register.
> +  __ ldr(rheapbase, address_bad_mask_from_thread(rthread));
> +  __ lea(rscratch2, src);
>
> You load an address into rscratch2 but you do not use rscratch2.
>
> Barrier stubs save int registers but not vectors. Why is that?
>
> Surely this file is nearly identical to x86:
>
> --- /dev/null   2019-06-10 08:42:37.317240407 +0100
> +++ new/src/hotspot/os_cpu/linux_aarch64/gc/z/zBackingFile_linux_aarch64.cpp    2019-06-10 14:25:44.374290036 +0100
> @@ -0,0 +1,590 @@
>
> --
> Andrew Haley
> Java Platform Lead Engineer
> Red Hat UK Ltd. <https://www.redhat.com>
> https://keybase.io/andrewhaley
> EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671



More information about the hotspot-gc-dev mailing list