RFR: 8296411: AArch64: Accelerated Poly1305 intrinsics [v2]
Andrew Haley
aph at openjdk.org
Wed May 24 10:21:56 UTC 2023
On Wed, 24 May 2023 10:07:47 GMT, Claes Redestad <redestad at openjdk.org> wrote:
>> Andrew Haley has updated the pull request incrementally with one additional commit since the last revision:
>>
>> Whitespace
>
> src/hotspot/cpu/aarch64/stubGenerator_aarch64.cpp line 7097:
>
>> 7095: // together partial products without any risk of needing to
>> 7096: // propagate a carry out.
>> 7097: wide_mul(U_0, U_0HI, S_0, R_0); wide_madd(U_0, U_0HI, S_1, RR_1); wide_madd(U_0, U_0HI, S_2, RR_0);
>
> What is `r` corresponding to here? This asserts that 'the top four bits of each 32-bit subword of "r" are zero'. If `r` is `R_0...R_2` it would seem broken since we're packing 26-bit values into `R_0...R_2` above in a way that would break this invariant?
No, it doesn't break the invariants.
R is the randomly-chosen 128-bit key. It is generated from an initial 128-bit-log string or random bits, then
`r &= 0x0ffffffc0ffffffc0ffffffc0fffffff`
This 128-bit-long string is split into 26-bit limbs before the intrinsic is called. The zero bits remain zero.
When we repack R into two 64-bit registers those zero bits are still zero.
-------------
PR Review Comment: https://git.openjdk.org/jdk/pull/14085#discussion_r1203850178
More information about the hotspot-dev
mailing list