RFR: 8325991: Accelerate Poly1305 on x86_64 using AVX2 instructions [v12]

Srinivas Vamsi Parasa duke at openjdk.org
Fri Mar 8 17:17:19 UTC 2024


> The goal of this PR is to accelerate the Poly1305 algorithm using AVX2 instructions (including IFMA) for x86_64 CPUs.
> 
> This implementation is directly based on the AVX2 Poly1305 hash computation as implemented in Intel(R) Multi-Buffer Crypto for IPsec Library (url: https://github.com/intel/intel-ipsec-mb/blob/main/lib/avx2_t3/poly_fma_avx2.asm)
> 
> This PR shows upto 19x speedup on buffer sizes of 1MB.

Srinivas Vamsi Parasa has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 27 commits:

 - add missing avx_ifma in amd64.java
 - Merge branch 'master' of https://github.com/vamsi-parasa/jdk into jdk_poly
 - update asserts for vpmadd52l/hq
 - Update description of Poly1305 algo
 - add cpuinfo test for avx_ifma
 - fix checks for vpmadd52*
 - fix use_vl to true for vpmadd52* instrs
 - fix merge issues with avx_ifma
 - Merge branch 'master' of https://git.openjdk.java.net/jdk into jdk_poly
 - removed unused merge, faster and, redundant mov
 - ... and 17 more: https://git.openjdk.org/jdk/compare/5aae8030...35d39dc5

-------------

Changes: https://git.openjdk.org/jdk/pull/17881/files
 Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=17881&range=11
  Stats: 810 lines in 10 files changed: 800 ins; 0 del; 10 mod
  Patch: https://git.openjdk.org/jdk/pull/17881.diff
  Fetch: git fetch https://git.openjdk.org/jdk.git pull/17881/head:pull/17881

PR: https://git.openjdk.org/jdk/pull/17881


More information about the hotspot-compiler-dev mailing list