RFR: JDK-8216437 : PPC64: Add intrinsic for GHASH algorithm [v28]
Suchismith Roy
sroy at openjdk.org
Sun Mar 2 17:10:04 UTC 2025
On Fri, 28 Feb 2025 16:30:16 GMT, Martin Doerr <mdoerr at openjdk.org> wrote:
>> Suchismith Roy has updated the pull request incrementally with one additional commit since the last revision:
>>
>> use vsplitsb
>
> src/hotspot/cpu/ppc/stubGenerator_ppc.cpp line 574:
>
>> 572: masm->vsldoi(vLowProduct, vLowProduct, vLowProduct, 8); // Swap
>> 573: masm->vxor(vLowProduct, vLowProduct, vReducedLow); // Reduction using constant
>> 574: masm->vsldoi(vCombinedResult, vLowProduct, vLowProduct, 8); // Swap
>
> The part between the vpsumd instructions looks too complicated. Isn't it equivalent to the following?
>
> masm->vsldoi(vTmp8, vLowProduct, vHighProduct, 8);
> masm->vsldoi(vTmp9, vReducedLow, vReducedLow, 8);
> masm->vxor(vTmp8, vTmp8, vMidProduct);
> masm->vxor(vCombinedResult, vTmp8, vTmp9);
@TheRealMDoerr can you explain how it can be equivalent to these 4 instructions ?
we are extracting the different parts of midProduct here ,64 bits each, for the cross product.
I,e Xl * Hh +Hl*Xh , so the below 2 are required
masm->vsldoi(vTmp8, vMidProduct, vZero, 8);
masm->vsldoi(vTmp9, vZero, vMidProduct, 8);
-------------
PR Review Comment: https://git.openjdk.org/jdk/pull/20235#discussion_r1976673259
More information about the hotspot-dev
mailing list