[vectorIntrinsics] RFR: Improve mask reduction operations on AVX [v3]
Mai Đặng Quân Anh
duke at openjdk.java.net
Tue Nov 9 16:14:26 UTC 2021
> Hi,
> This patch improves the logic of vector mask reduction operations on AVX, especially int, float, long, double, by using vmovmskpd and vmovmskps instructions. I also do a little refactoring to reduce duplication in toLong. The patch temporarily disables these operations on Neon, though.
> Thank you very much.
Mai Đặng Quân Anh has updated the pull request incrementally with two additional commits since the last revision:
- support for non-bmi, some refinement
- restore VectorStoreMaskNode, move logic to backend
-------------
Changes:
- all: https://git.openjdk.java.net/panama-vector/pull/158/files
- new: https://git.openjdk.java.net/panama-vector/pull/158/files/4d0e7936..d3249aee
Webrevs:
- full: https://webrevs.openjdk.java.net/?repo=panama-vector&pr=158&range=02
- incr: https://webrevs.openjdk.java.net/?repo=panama-vector&pr=158&range=01-02
Stats: 142 lines in 6 files changed: 93 ins; 16 del; 33 mod
Patch: https://git.openjdk.java.net/panama-vector/pull/158.diff
Fetch: git fetch https://git.openjdk.java.net/panama-vector pull/158/head:pull/158
PR: https://git.openjdk.java.net/panama-vector/pull/158
More information about the panama-dev
mailing list