RFR: 8288047: Accelerate Poly1305 on x86_64 using AVX512 instructions [v20]

Volodymyr Paprotski duke at openjdk.org
Wed Nov 16 23:19:19 UTC 2022


On Wed, 16 Nov 2022 23:08:16 GMT, Vladimir Ivanov <vlivanov at openjdk.org> wrote:

>> src/hotspot/cpu/x86/stubGenerator_x86_64_poly.cpp line 917:
>> 
>>> 915:   // Cleanup
>>> 916:   __ vpxorq(xmm0, xmm0, xmm0, Assembler::AVX_512bit);
>>> 917:   __ vpxorq(xmm1, xmm1, xmm1, Assembler::AVX_512bit);
>> 
>> You could use T0, T1 in place of xmm0, xmm1 here.
>
> Or simply switch to `vzeroall` for `xmm0` - `xmm15`.

ah.. I remember thinking about doing that.. `vzeroall` isnt encoded yet and I figured since I already have to do the xmm16-29, might as well do them all.. should I add that instruction too?

-------------

PR: https://git.openjdk.org/jdk/pull/10582


More information about the hotspot-compiler-dev mailing list