Integrated: 8297753: AArch64: Add optimized rules for vector compare with zero on NEON

changpeng1997 duke at openjdk.org
Fri Mar 3 12:14:40 UTC 2023


On Tue, 3 Jan 2023 08:24:50 GMT, changpeng1997 <duke at openjdk.org> wrote:

> We can use the compare-with-zero instructions like cmgt(zero)[1] immediately to avoid the extra scalar2vector operations.
> 
> The following instruction sequence
> 
> movi  v16.4s, #0x0
> cmgt  v16.4s, v17.4s, v16.4s
> 
> can be optimized to:
> 
> cmgt v16.4s, v17.4s, #0x0
> 
> This patch does the following:
> 1. Add NEON floating-point compare-with-zero instructions.
> 2. Add optimized match rules to generate the compare-with-zero instructions.
> 
> [1]: https://developer.arm.com/documentation/ddi0602/2022-06/SIMD-FP-Instructions/CMGT--zero---Compare-signed-Greater-than-zero--vector--

This pull request has now been integrated.

Changeset: d23a8bfb
Author:    changpeng1997 <Chang.Peng at arm.com>
Committer: Andrew Dinn <adinn at openjdk.org>
URL:       https://git.openjdk.org/jdk/commit/d23a8bfb14037460731fb6ca1890b03278b84b1a
Stats:     1054 lines in 11 files changed: 548 ins; 6 del; 500 mod

8297753: AArch64: Add optimized rules for vector compare with zero on NEON

Reviewed-by: aph

-------------

PR: https://git.openjdk.org/jdk/pull/11822


More information about the hotspot-compiler-dev mailing list