RFR: 8302906: AArch64: Add SVE backend support for vector unsigned comparison
Andrew Haley
aph at openjdk.org
Thu Feb 23 11:08:05 UTC 2023
On Thu, 23 Feb 2023 07:05:43 GMT, Chang Peng <duke at openjdk.org> wrote:
> This patch implements unsigned vector comparison on SVE.
>
> 1: Test:
> All vector API test cases[1][2] passed without new failure. Existing test cases can cover all unsigned comparison conditions for all kinds of vector.
>
> 2: Performance:
> (1): Benchmark:
> As existing benchmarks in panama repo (such as [3]) have some issues [4] (We will fix them in a separate patch.), I collected performance data with a reduced jmh benchmark [5]. e.g. for ByteVector unsigned comparison:
>
>
> @Benchmark
> public void byteVectorUnsignedCompare() {
> for (int j = 0; j < 200; j++) {
> for (int i = 0; i < bspecies.length(); i++) {
> ByteVector av = ByteVector.fromArray(bspecies, ba, i);
> ByteVector ca = ByteVector.fromArray(bspecies, bb, i);
> av.compare(VectorOperators.UNSIGNED_GT, ca).intoArray(br, i);
> }
> }
> }
>
>
> (2): Performance data
>
> Before:
>
>
> Benchmark Score(op/ms) Error
> ByteVector.UNSIGNED_GT#size(1024) 4.846 3.419
> ShortVector.UNSIGNED_GE#size(1024) 3.055 1.369
> IntVector.UNSIGNED_LT#size(1024) 3.475 1.269
> LongVector.UNSIGNED_LE#size(1024) 4.515 1.812
>
>
> After:
>
>
> Benchmark Score(op/ms) Error
> ByteVector.UNSIGNED_GT#size(1024) 493.937 1.389
> ShortVector.UNSIGNED_GE#size(1024) 5308.796 20.557
> IntVector.UNSIGNED_LT#size(1024) 4944.744 10.606
> LongVector.UNSIGNED_LE#size(1024) 8459.605 28.683
>
>
> [1] https://github.com/openjdk/jdk/tree/master/test/jdk/jdk/incubator/vector
> [2] https://github.com/openjdk/jdk/tree/master/test/hotspot/jtreg/compiler/vectorapi
> [3] https://github.com/openjdk/panama-vector/blob/2aade73adeabdf6a924136b17fd96ccc95c1d160/test/micro/org/openjdk/bench/jdk/incubator/vector/operation/ByteMaxVector.java#L1459
> [4] https://bugs.openjdk.org/browse/JDK-8282850
> [5] https://gist.github.com/changpeng1997/d311127e1015c107197f9b56a92b0fae
src/hotspot/cpu/aarch64/c2_MacroAssembler_aarch64.cpp line 1123:
> 1121: case BoolTest::uge: condition = Assembler::HS; break;
> 1122: case BoolTest::ult: z1 = zm; z2 = zn; condition = Assembler::HI; break;
> 1123: case BoolTest::ule: z1 = zm; z2 = zn; condition = Assembler::HS; break;
I'm sure you already know what to do here.
-------------
PR: https://git.openjdk.org/jdk/pull/12725
More information about the hotspot-compiler-dev
mailing list