[vectorIntrinsics] RFR: Improve mask reduction operations on AVX [v4]
Mai Đặng Quân Anh
duke at openjdk.java.net
Tue Nov 16 08:16:11 UTC 2021
> Hi,
> This patch improves the logic of vector mask reduction operations on AVX, especially int, float, long, double, by using vmovmskpd and vmovmskps instructions. I also do a little refactoring to reduce duplication in toLong. The patch temporarily disables these operations on Neon, though.
> Thank you very much.
Mai Đặng Quân Anh has updated the pull request incrementally with one additional commit since the last revision:
use duplicate dst instead of noreg, use lzcnt for hardwares support it
-------------
Changes:
- all: https://git.openjdk.java.net/panama-vector/pull/158/files
- new: https://git.openjdk.java.net/panama-vector/pull/158/files/d3249aee..de9de8f5
Webrevs:
- full: https://webrevs.openjdk.java.net/?repo=panama-vector&pr=158&range=03
- incr: https://webrevs.openjdk.java.net/?repo=panama-vector&pr=158&range=02-03
Stats: 37 lines in 3 files changed: 7 ins; 1 del; 29 mod
Patch: https://git.openjdk.java.net/panama-vector/pull/158.diff
Fetch: git fetch https://git.openjdk.java.net/panama-vector pull/158/head:pull/158
PR: https://git.openjdk.java.net/panama-vector/pull/158
More information about the panama-dev
mailing list