RFR: 8267663: AArch64: Add unsigned comparison operators on AArch64

Andrew Haley aph at openjdk.java.net
Sat Jun 5 08:02:59 UTC 2021


On Fri, 4 Jun 2021 11:35:16 GMT, Eric Liu <eliu at openjdk.org> wrote:

> This patch implements unsigned vector comparison on AArch64. The
> performance of unsigned comparison improves about 4x~5x in my local with
> Byte128Vector.java[1].
> 
> Before:
> Benchmark                               Score(op/ms)     Error
> Byte128Vector.UNSIGNED_GE#size(1024)    99.953           6.17
> Byte128Vector.UNSIGNED_GT#size(1024)    95.334           8.865
> Byte128Vector.UNSIGNED_LE#size(1024)    76.908           24.332
> Byte128Vector.UNSIGNED_LT#size(1024)    78.362           23.507
> 
> After:
> Benchmark                               Score(op/ms)     Error
> Byte128Vector.UNSIGNED_GE#size(1024)    421.809          25.57
> Byte128Vector.UNSIGNED_GT#size(1024)    420.653          26.779
> Byte128Vector.UNSIGNED_LE#size(1024)    316.754          92.889
> Byte128Vector.UNSIGNED_LT#size(1024)    423.683          26.508
> 
> [Test]
> - All vector API test cases passed without new failure. 8265312[2] has
>   been implemented this on x86 and supplied sufficient test cases for
>   all kinds of vector.
> - No performance regression for other comparisons.
> - libjvm.so drops off about 200KB after this patch by combining those
>   vector compare rules.
> 
> [1] https://github.com/openjdk/panama-vector/blob/vectorIntrinsics/test/jdk/jdk/incubator/vector/benchmark/src/main/java/benchmark/jdk/incubator/vector/Byte128Vector.java#L1198
> [2] https://github.com/openjdk/panama-vector/pull/68

src/hotspot/cpu/aarch64/aarch64_neon_ad.m4 line 889:

> 887: 
> 888: static void neon_compare(C2_MacroAssembler masm, FloatRegister dst, BasicType bt,
> 889:                          FloatRegister src1, FloatRegister src2, int cond, bool isX) {

Passing masm by value, rather than reference, is unconventional. There's no reason to do that.

src/hotspot/cpu/aarch64/aarch64_neon_ad.m4 line 930:

> 928: }
> 929: %}
> 930: 

This stuff should be in C2_MacroAssembler.

-------------

PR: https://git.openjdk.java.net/jdk/pull/4358


More information about the hotspot-compiler-dev mailing list