RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v10]
Volodymyr Paprotski
vpaprotski at openjdk.org
Mon Mar 24 15:19:25 UTC 2025
On Sat, 22 Mar 2025 16:45:31 GMT, Volodymyr Paprotski <vpaprotski at openjdk.org> wrote:
>> Ferenc Rakoczi has updated the pull request incrementally with one additional commit since the last revision:
>>
>> Fix windows build
>
> src/hotspot/cpu/x86/stubGenerator_x86_64_dilithium.cpp line 216:
>
>> 214: // Zmm8-Zmm23 used as scratch registers
>> 215: // result goes to Zmm0-Zmm7
>> 216: static void montMulByConst128(MacroAssembler *_masm) {
>
> wish the inputs and output register arrays were explicit.. easier to follow that way
Looking at this function some more.. I think you could remove this function and replace it with two calls to `montMul64`?
montMul64(xmm0_3, xmm0_3, xmm29_29, Scratch*, _masm);
montMul64(xmm4_7, xmm4_7, xmm29_29, Scratch*, _masm);
```
Scratch would have to be defined..
-------------
PR Review Comment: https://git.openjdk.org/jdk/pull/23860#discussion_r2008979257
More information about the hotspot-dev
mailing list