RFR: 8323116: [REDO] Computational test more than 2x slower when AVX instructions are used [v5]
Sandhya Viswanathan
sviswanathan at openjdk.org
Fri Apr 5 18:21:10 UTC 2024
On Fri, 5 Apr 2024 17:46:17 GMT, Vladimir Kozlov <kvn at openjdk.org> wrote:
>> Srinivas Vamsi Parasa has updated the pull request incrementally with one additional commit since the last revision:
>>
>> fix failure for KNL
>
> My new testing passed.
> But I want to hear an answer to @merykitty suggestion about using xmm15.
@vnkozlov If I understand the proposal from @merykitty correctly, the suggestion is to reserve xmm15 as non allocatable throughout. This sounds like a big overhead for cases where every xmm register is usable say in a Vector API kernel. From Vamsi's microbenchmark runs, he has clearly shown that the gain of his optimization is way more than any overhead of doing pxor just before the converts.
-------------
PR Comment: https://git.openjdk.org/jdk/pull/18503#issuecomment-2040372881
More information about the hotspot-compiler-dev
mailing list