RFR: 8320206: Some intrinsics/stubs missing vzeroupper on x86_64

Vladimir Kozlov kvn at openjdk.org
Fri Nov 17 03:00:29 UTC 2023


On Wed, 15 Nov 2023 21:28:46 GMT, Sandhya Viswanathan <sviswanathan at openjdk.org> wrote:

> The following intrinsics/stubs are missing vzeroupper:
> adler32 (since JDK17)
> count_positives (since JDK 9)
> chacha20 (since JDK 20)
> string indexOfChar (since JDK 9)
> 
> Adding the missing vzeroupper to avoid AVX-SSE transition penalties.

I understand that originally it was good decision to generate vzeroupper in all places where we use avx512 wide instructions (C2 generated SSE instructions at that time). May be it is time to check if C2 still generate SSE now.  If not we should fix it so that we can remove  vzeroupper in intrinsic stubs used by C2 only.

-------------

PR Comment: https://git.openjdk.org/jdk/pull/16678#issuecomment-1815671083


More information about the hotspot-compiler-dev mailing list