RFR: 8288047: Accelerate Poly1305 on x86_64 using AVX512 instructions [v17]

Volodymyr Paprotski duke at openjdk.org
Wed Nov 16 21:34:26 UTC 2022


On Tue, 15 Nov 2022 19:44:16 GMT, Vladimir Ivanov <vlivanov at openjdk.org> wrote:

>> Volodymyr Paprotski has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 25 commits:
>> 
>>  - Vladimir's review comments
>>  - Merge remote-tracking branch 'origin/master' into avx512-poly
>>  - Merge remote-tracking branch 'origin/master' into avx512-poly
>>  - Vladimir's review
>>  - live review with Sandhya
>>  - jcheck
>>  - Sandhya's review
>>  - fix windows and 32b linux builds
>>  - add getLimbs to interface and reviews
>>  - fix 32-bit build
>>  - ... and 15 more: https://git.openjdk.org/jdk/compare/7357a1a3...8f5942d9
>
> src/hotspot/cpu/x86/stubGenerator_x86_64_poly.cpp line 1004:
> 
>> 1002:   __ jcc(Assembler::less, L_process16Loop);
>> 1003: 
>> 1004:   poly1305_process_blocks_avx512(input, length,
> 
> I'd like to see a comment here explaining what register effects are implicit.
> 
> `poly1305_process_blocks_avx512` has the following comment, but it doesn't mention xmm registers:
> 
>   // Register Map:
>   // reserved: rsp, rbp, rcx
>   // PARAMs: rdi, rbx, rsi, r8-r12
>   // poly1305_multiply_scalar clobbers: r13-r15, rax, rdx

Just redid the register allocation, comments, names, function parameters.. hope its better

-------------

PR: https://git.openjdk.org/jdk/pull/10582


More information about the hotspot-dev mailing list