RFR: 8360934: Add AVX-512 intrinsics for ML-KEM - enhancement on AVX512_VBMI and AVX512_VBMI2 [v2]
Shawn M Emery
duke at openjdk.org
Sat Jan 3 00:23:13 UTC 2026
> This change allows use of the AVX512_VBMI/VMBI2 instruction set to further optimize decompression/parsing of polynomial coefficients for ML-KEM. The speedup gained in the ML-KEM benchmarks for key generation is between 0.2 to 0.5%, encapsulation is 0.3 to 1.5%, and decapsulation is 0 to 0.9%.
>
> Thank you to @sviswa7 and @ferakocz for their help in working through the early stages of this code with me.
Shawn M Emery has updated the pull request incrementally with one additional commit since the last revision:
Update copyright year
-------------
Changes:
- all: https://git.openjdk.org/jdk/pull/28815/files
- new: https://git.openjdk.org/jdk/pull/28815/files/d2cadaf9..7cd8de53
Webrevs:
- full: https://webrevs.openjdk.org/?repo=jdk&pr=28815&range=01
- incr: https://webrevs.openjdk.org/?repo=jdk&pr=28815&range=00-01
Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod
Patch: https://git.openjdk.org/jdk/pull/28815.diff
Fetch: git fetch https://git.openjdk.org/jdk.git pull/28815/head:pull/28815
PR: https://git.openjdk.org/jdk/pull/28815
More information about the hotspot-compiler-dev
mailing list