RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v3]

Jatin Bhateja jbhateja at openjdk.org
Thu Mar 6 09:34:53 UTC 2025


On Wed, 5 Mar 2025 18:27:44 GMT, Ferenc Rakoczi <duke at openjdk.org> wrote:

>> Hi @ferakocz , 
>> 
>> Thanks!, for efficient utilization of Decode ICache (please refer to Intel SDM section 3.4.2.5), code blocks should be aligned to 32-byte boundaries; a 64-byte aligned code is a superset of both 16 and 32 byte aligned addresses and also matches with the cacheline size. However, I can noticed that we have been using OptoLoopAlignment at places in AES-GCM also.
>> 
>> I introduced some errors in generate_dilithiumAlmostInverseNtt_avx512 implementation in anticipation of catching it through existing ML_DSA_Tests under 
>> test/jdk/sun/security/provider/acvp
>> 
>> But all the tests passed for me.
>> `java  -jar /home/jatinbha/sandboxes/jtreg/build/images/jtreg/lib/jtreg.jar -jdk:$JAVA_HOME -Djdk.test.lib.artifacts.ACVP-Server=/home/jatinbha/softwares/v1.1.0.38.zip -va -timeout:4 Launcher.java`
>> 
>> Can you please point out a test I need to use for validation
>
> I think the easiest is to put a for (int i = 0; i < 1000; i++) loop around  the switch statement in the run() method of  the ML_DSA_Test class (test/jdk/sun/security/provider/acvp/ML_DSA_Test.java). (This is because the intrinsics kick in after a few thousand calls of the method.)

Hi @ferakocz , Yes, we should modify the test or lower the compilation threshold with -Xbatch -XX:TieredCompileThreshold=0.1.

Alternatively, since the tests has a depedency on Automatic Cryptographic Validation Test server I have created a simplified test which cover all the security levels.

Kindly include [test/hotspot/jtreg/compiler/intrinsics/signature/TestModuleLatticeDSA.java
](https://github.com/ferakocz/jdk/pull/1)

-------------

PR Review Comment: https://git.openjdk.org/jdk/pull/23860#discussion_r1983009390


More information about the hotspot-dev mailing list