RFR: 8360934: Add AVX-512 intrinsics for ML-KEM - enhancement on AVX512_VBMI [v3]
Volodymyr Paprotski
vpaprotski at openjdk.org
Thu Jan 8 16:28:10 UTC 2026
On Thu, 8 Jan 2026 00:24:11 GMT, Shawn M Emery <duke at openjdk.org> wrote:
>> This change allows use of the AVX512_VBMI instruction set to further optimize decompression/parsing of polynomial coefficients for ML-KEM. The speedup gained in the ML-KEM benchmarks for key generation is between 0.3 to 0.6%, encapsulation is 0.4 to 1.7%, and decapsulation is 0.3 to 1.9%.
>>
>> Thank you to @sviswa7 and @ferakocz for their help in working through the early stages of this code with me.
>
> Shawn M Emery has updated the pull request incrementally with one additional commit since the last revision:
>
> 8360934: Add AVX-512 intrinsics for ML-KEM - enhancement on AVX512_VBMI
> Change Swap to Dup named function/variable
> Check for only VBMI support (not VBMI2)
Looks good to me
-------------
Marked as reviewed by vpaprotski (Committer).
PR Review: https://git.openjdk.org/jdk/pull/28815#pullrequestreview-3640216667
More information about the hotspot-compiler-dev
mailing list