[vectorIntrinsics] RFR: Improve mask reduction operations on AVX [v4]

Tue Nov 16 08:16:11 UTC 2021

> Hi,
> This patch improves the logic of vector mask reduction operations on AVX, especially int, float, long, double, by using vmovmskpd and vmovmskps instructions. I also do a little refactoring to reduce duplication in toLong. The patch temporarily disables these operations on Neon, though.
> Thank you very much.

Mai Đặng Quân Anh has updated the pull request incrementally with one additional commit since the last revision:

  use duplicate dst instead of noreg, use lzcnt for hardwares support it

-------------

Changes:
  - all: https://git.openjdk.java.net/panama-vector/pull/158/files
  - new: https://git.openjdk.java.net/panama-vector/pull/158/files/d3249aee..de9de8f5

Webrevs:
 - full: https://webrevs.openjdk.java.net/?repo=panama-vector&pr=158&range=03
 - incr: https://webrevs.openjdk.java.net/?repo=panama-vector&pr=158&range=02-03

  Stats: 37 lines in 3 files changed: 7 ins; 1 del; 29 mod
  Patch: https://git.openjdk.java.net/panama-vector/pull/158.diff
  Fetch: git fetch https://git.openjdk.java.net/panama-vector pull/158/head:pull/158

PR: https://git.openjdk.java.net/panama-vector/pull/158