RFR: 8325991: Accelerate Poly1305 on x86_64 using AVX2 instructions [v12]
Srinivas Vamsi Parasa
duke at openjdk.org
Fri Mar 8 17:17:19 UTC 2024
> The goal of this PR is to accelerate the Poly1305 algorithm using AVX2 instructions (including IFMA) for x86_64 CPUs.
>
> This implementation is directly based on the AVX2 Poly1305 hash computation as implemented in Intel(R) Multi-Buffer Crypto for IPsec Library (url: https://github.com/intel/intel-ipsec-mb/blob/main/lib/avx2_t3/poly_fma_avx2.asm)
>
> This PR shows upto 19x speedup on buffer sizes of 1MB.
Srinivas Vamsi Parasa has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 27 commits:
- add missing avx_ifma in amd64.java
- Merge branch 'master' of https://github.com/vamsi-parasa/jdk into jdk_poly
- update asserts for vpmadd52l/hq
- Update description of Poly1305 algo
- add cpuinfo test for avx_ifma
- fix checks for vpmadd52*
- fix use_vl to true for vpmadd52* instrs
- fix merge issues with avx_ifma
- Merge branch 'master' of https://git.openjdk.java.net/jdk into jdk_poly
- removed unused merge, faster and, redundant mov
- ... and 17 more: https://git.openjdk.org/jdk/compare/5aae8030...35d39dc5
-------------
Changes: https://git.openjdk.org/jdk/pull/17881/files
Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=17881&range=11
Stats: 810 lines in 10 files changed: 800 ins; 0 del; 10 mod
Patch: https://git.openjdk.org/jdk/pull/17881.diff
Fetch: git fetch https://git.openjdk.org/jdk.git pull/17881/head:pull/17881
PR: https://git.openjdk.org/jdk/pull/17881
More information about the hotspot-compiler-dev
mailing list