RFR: JDK-8216437 : PPC64: Add intrinsic for GHASH algorithm [v26]

Martin Doerr mdoerr at openjdk.org
Fri Feb 21 19:57:02 UTC 2025


On Fri, 21 Feb 2025 17:15:34 GMT, Suchismith Roy <sroy at openjdk.org> wrote:

>> Please understand my question correctly. I didn't propose to remove all `ver_perm`. The idea is to do the same job in the loop with one `vec_perm`.
>
> Hi @TheRealMDoerr  Maybe my answer was not clear. I am not proposing to remove them. I am unable to decipher how to  reduce the 3 instructions to one, as I feel the below 2 lines are required , as per the algorithm.
>  __ vec_perm(vTmp4, vHigh, vHigh, loadOrder);
>  __ vec_perm(vTmp5, vLow, vLow, loadOrder);

The purpose of the 3 `vec_perm` instructions is to extract 16 Bytes from two 16 Byte values loaded into vector registers. This can be done by 1 `vec_perm` instruction. But I think AIX should get fixed first before we figure out how to determine the vPerm value for that (probably lvsl + vxor before the loop).

-------------

PR Review Comment: https://git.openjdk.org/jdk/pull/20235#discussion_r1966095694


More information about the hotspot-dev mailing list