RFR: JDK-8216437 : PPC64: Add intrinsic for GHASH algorithm [v26]

Suchismith Roy sroy at openjdk.org
Fri Feb 21 17:17:58 UTC 2025


On Fri, 21 Feb 2025 16:21:49 GMT, Martin Doerr <mdoerr at openjdk.org> wrote:

>> As per the algorithm mentioned section 6.4 in Power ISA, we need 2 loads to access from nearest aligned address(to the unaligned address) and the next aligned address. 
>> 
>> Without the vec_perm, I faced the issue of wrong control vectors generated due to Endianness. Hence I had to include them.
>
> Please understand my question correctly. I didn't propose to remove all `ver_perm`. The idea is to do the same job in the loop with one `vec_perm`.

Hi @TheRealMDoerr  Maybe my answer was not clear. I am not proposing to remove them. I am unable to decipher how to  reduce the 3 instructions to one, as I feel the below 2 lines are required , as per the algorithm.
 __ vec_perm(vTmp4, vHigh, vHigh, loadOrder);
 __ vec_perm(vTmp5, vLow, vLow, loadOrder);

-------------

PR Review Comment: https://git.openjdk.org/jdk/pull/20235#discussion_r1965901600


More information about the hotspot-dev mailing list