[foreign-memaccess+abi] Integrated: 8275646: Implement optimized upcall stubs on AArch64
Nick Gasson
ngasson at openjdk.java.net
Tue Nov 9 09:08:03 UTC 2021
On Fri, 5 Nov 2021 06:47:19 GMT, Nick Gasson <ngasson at openjdk.org> wrote:
> This is a fairly direct port of the X86 code. The changes to
> frame_aarch64 and foreign_globals_aarch64 are identical: perhaps
> ForeignGlobals::parse_call_regs_impl() could be moved to shared code?
>
> The X86 version has a call to reinit_heapbase() before the return from
> the stub. I think this is redundant because the heap base register will
> be clobbered immediately by the native code?
>
> I hit a really weird bug in release builds where the first few
> instructions in the code buffer were overwritten by the fields of
> OptimizedEntryBlob. I think we need to pass sizeof(OptimizedEntryBlob)
> instead of sizeof(BufferBlob) as the header_size argument to the
> RuntimeBlob constructor (none of the other subclasses of BufferBlob have
> extra fields). I added a header_size argument to BufferBlob's
> constructor to thread this through.
>
> I removed the calls to change the W^X state in on_entry/on_exit calls:
> in the on_entry case the stub must already be executable because we
> called into the VM from there, and for on_exit we need the code to be
> executable not writable otherwise we'll get a SIGBUS as soon as we
> return to the stub. The newly added stack tests in TestUpcall hit
> JDK-8275584 on MacOS/AArch64 so I've problem-listed that for now.
>
> JMH results from org.openjdk.bench.jdk.incubator.foreign.Upcalls before:
>
>
> Benchmark Mode Cnt Score Error Units
> Upcalls.jni_args10 avgt 30 450.417 ? 4.755 ns/op
> Upcalls.jni_args5 avgt 30 245.898 ? 3.171 ns/op
> Upcalls.jni_blank avgt 30 195.606 ? 5.459 ns/op
> Upcalls.jni_identity avgt 30 369.788 ? 15.165 ns/op
> Upcalls.panama_args10 avgt 30 1253.189 ? 62.261 ns/op
> Upcalls.panama_args5 avgt 30 927.101 ? 35.369 ns/op
> Upcalls.panama_blank avgt 30 637.708 ? 11.353 ns/op
> Upcalls.panama_identity avgt 30 697.109 ? 9.971 ns/op
>
>
> After:
>
>
> Benchmark Mode Cnt Score Error Units
> Upcalls.jni_args10 avgt 30 455.304 ? 21.838 ns/op
> Upcalls.jni_args5 avgt 30 247.279 ? 2.513 ns/op
> Upcalls.jni_blank avgt 30 194.113 ? 4.317 ns/op
> Upcalls.jni_identity avgt 30 366.145 ? 4.912 ns/op
> Upcalls.panama_args10 avgt 30 236.337 ? 11.072 ns/op
> Upcalls.panama_args5 avgt 30 223.858 ? 12.345 ns/op
> Upcalls.panama_blank avgt 30 203.631 ? 8.840 ns/op
> Upcalls.panama_identity avgt 30 208.783 ? 9.914 ns/op
>
>
> Tested tier1 and jdk_foreign on Linux/AArch64 and MacOS/AArch64.
This pull request has now been integrated.
Changeset: 9e9ba2e9
Author: Nick Gasson <ngasson at openjdk.org>
URL: https://git.openjdk.java.net/panama-foreign/commit/9e9ba2e99d3218f37b89e9cd0dae431f471be487
Stats: 370 lines in 9 files changed: 336 ins; 4 del; 30 mod
8275646: Implement optimized upcall stubs on AArch64
Reviewed-by: jvernee
-------------
PR: https://git.openjdk.java.net/panama-foreign/pull/610
More information about the panama-dev
mailing list