RFR: 8325991: Accelerate Poly1305 on x86_64 using AVX2 instructions [v11]
Srinivas Vamsi Parasa
duke at openjdk.org
Fri Mar 8 17:19:58 UTC 2024
On Mon, 4 Mar 2024 21:40:04 GMT, Srinivas Vamsi Parasa <duke at openjdk.org> wrote:
>> The goal of this PR is to accelerate the Poly1305 algorithm using AVX2 instructions (including IFMA) for x86_64 CPUs.
>>
>> This implementation is directly based on the AVX2 Poly1305 hash computation as implemented in Intel(R) Multi-Buffer Crypto for IPsec Library (url: https://github.com/intel/intel-ipsec-mb/blob/main/lib/avx2_t3/poly_fma_avx2.asm)
>>
>> This PR shows upto 19x speedup on buffer sizes of 1MB.
>
> Srinivas Vamsi Parasa has updated the pull request incrementally with one additional commit since the last revision:
>
> update asserts for vpmadd52l/hq
Planning to integrate this PR by Monday. Could you please let me know if there are any objections?
-------------
PR Comment: https://git.openjdk.org/jdk/pull/17881#issuecomment-1986094404
More information about the hotspot-compiler-dev
mailing list