RFR: 8323116: [REDO] Computational test more than 2x slower when AVX instructions are used [v5]
    Srinivas Vamsi Parasa 
    duke at openjdk.org
       
    Fri Apr  5 19:53:31 UTC 2024
    
    
  
On Fri, 5 Apr 2024 18:17:00 GMT, Sandhya Viswanathan <sviswanathan at openjdk.org> wrote:
>> My new testing passed. 
>> But I want to hear an answer to @merykitty suggestion about using xmm15.
>
> @vnkozlov  If I understand the proposal from @merykitty correctly, the suggestion is to reserve xmm15 as non allocatable throughout. This sounds like a big overhead for cases where every xmm register is usable say in a Vector API kernel. From Vamsi's microbenchmark runs, he has clearly shown that the gain of his optimization is way more than any overhead of doing pxor just before the converts.
> Okay. I will wait changes @sviswa7 suggested to use vlRegD and vlRegF.
Please see the updated commit which uses vlRegD and vlRegF.
-------------
PR Comment: https://git.openjdk.org/jdk/pull/18503#issuecomment-2040529692
    
    
More information about the hotspot-compiler-dev
mailing list