[vectorIntrinsics] RFR: Improve mask reduction operations on AVX [v3]

Tue Nov 9 16:14:26 UTC 2021

> Hi,
> This patch improves the logic of vector mask reduction operations on AVX, especially int, float, long, double, by using vmovmskpd and vmovmskps instructions. I also do a little refactoring to reduce duplication in toLong. The patch temporarily disables these operations on Neon, though.
> Thank you very much.

Mai Đặng Quân Anh has updated the pull request incrementally with two additional commits since the last revision:

 - support for non-bmi, some refinement
 - restore VectorStoreMaskNode, move logic to backend

-------------

Changes:
  - all: https://git.openjdk.java.net/panama-vector/pull/158/files
  - new: https://git.openjdk.java.net/panama-vector/pull/158/files/4d0e7936..d3249aee

Webrevs:
 - full: https://webrevs.openjdk.java.net/?repo=panama-vector&pr=158&range=02
 - incr: https://webrevs.openjdk.java.net/?repo=panama-vector&pr=158&range=01-02

  Stats: 142 lines in 6 files changed: 93 ins; 16 del; 33 mod
  Patch: https://git.openjdk.java.net/panama-vector/pull/158.diff
  Fetch: git fetch https://git.openjdk.java.net/panama-vector pull/158/head:pull/158

PR: https://git.openjdk.java.net/panama-vector/pull/158