Integrated: 8261542: X86 slice and unslice intrinsics for 256-bit byte/short vectors
Sandhya Viswanathan
sviswanathan at openjdk.java.net
Fri Feb 19 18:13:41 UTC 2021
On Thu, 11 Feb 2021 02:37:35 GMT, Sandhya Viswanathan <sviswanathan at openjdk.org> wrote:
> The slice and unslice intrinsics for 256-bit byte/short vectors can be implemented for x86 platforms supporting AVX2 using a sequence of instructions.
>
> JBS: https://bugs.openjdk.java.net/browse/JDK-8261542
>
> The PerfSliceOrigin.java jmh test attached to the JBS shows the following performance on AVX2 platform.
>
> Before:
> Benchmark (size) Mode Cnt Score Error Units
> PerfSliceOrigin.vectorSliceOrigin 1024 thrpt 5 18.887 ± 1.128 ops/ms
> PerfSliceOrigin.vectorSliceUnsliceOrigin 1024 thrpt 5 9.374 ± 0.370 ops/ms
>
> After:
> Benchmark (size) Mode Cnt Score Error Units
> PerfSliceOrigin.vectorSliceOrigin 1024 thrpt 5 13861.420 ± 19.071 ops/ms
> PerfSliceOrigin.vectorSliceUnsliceOrigin 1024 thrpt 5 7895.199 ± 142.580 ops/ms
This pull request has now been integrated.
Changeset: c53acc2a
Author: Sandhya Viswanathan <sviswanathan at openjdk.org>
URL: https://git.openjdk.java.net/jdk/commit/c53acc2a
Stats: 120 lines in 7 files changed: 100 ins; 5 del; 15 mod
8261542: X86 slice and unslice intrinsics for 256-bit byte/short vectors
Reviewed-by: kvn, neliasso
-------------
PR: https://git.openjdk.java.net/jdk/pull/2520
More information about the hotspot-dev
mailing list