RFR: JDK-8216437 : PPC64: Add intrinsic for GHASH algorithm [v26]

Martin Doerr mdoerr at openjdk.org
Fri Feb 21 16:24:57 UTC 2025


On Fri, 21 Feb 2025 16:08:09 GMT, Suchismith Roy <sroy at openjdk.org> wrote:

>> src/hotspot/cpu/ppc/stubGenerator_ppc.cpp line 703:
>> 
>>> 701:       __ vec_perm(vTmp4, vHigh, vHigh, loadOrder);
>>> 702:       __ vec_perm(vTmp5, vLow, vLow, loadOrder);
>>> 703:       __ vec_perm(vH, vTmp5, vTmp4, vPerm);
>> 
>> Can we compute a different vPerm such that we only need one `vec_perm` instruction in the loop?
>
> As per the algorithm mentioned section 6.4 in Power ISA, we need 2 loads to access from nearest aligned address(to the unaligned address) and the next aligned address. 
> 
> Without the vec_perm, I faced the issue of wrong control vectors generated due to Endianness. Hence I had to include them.

Please understand my question correctly. I didn't propose to remove all `ver_perm`.

-------------

PR Review Comment: https://git.openjdk.org/jdk/pull/20235#discussion_r1965808162


More information about the hotspot-dev mailing list