RFR: JDK-8216437 : PPC64: Add intrinsic for GHASH algorithm [v19]
Suchismith Roy
sroy at openjdk.org
Fri Feb 7 16:58:15 UTC 2025
On Wed, 5 Feb 2025 14:42:17 GMT, Martin Doerr <mdoerr at openjdk.org> wrote:
>> Suchismith Roy has updated the pull request incrementally with one additional commit since the last revision:
>>
>> adapt Condition registers
>
> src/hotspot/cpu/ppc/stubGenerator_ppc.cpp line 655:
>
>> 653: // https://web.archive.org/web/20110609115824/https://software.intel.com/file/24918
>> 654: //
>> 655: Label loop;
>
> Please try if aligning the loop entry improves performance. I'd insert `__ align(32);` here.
This is not improving performance @TheRealMDoerr
> src/hotspot/cpu/ppc/stubGenerator_ppc.cpp line 658:
>
>> 656: __ bind(loop);
>> 657: __ vspltisb(vZero, 0);
>> 658: __ li(temp1, 0);
>
> I don't think these instructions should be inside of the loop.
vspltisb(vZero,0) is needed.
__ vsldoi(vTmp8, vTmp5, vZero, 8); // mL : Extract the lower 64 bits of M
__ vsldoi(vTmp9, vZero, vTmp5, 8); // mH : Extract the higher 64 bits of M
We need to extract appropriate bits and for that vZero needs to be initialised to 0 always.
-------------
PR Review Comment: https://git.openjdk.org/jdk/pull/20235#discussion_r1946858131
PR Review Comment: https://git.openjdk.org/jdk/pull/20235#discussion_r1946859665
More information about the hotspot-dev
mailing list