RFR: 8300208: Optimize Adler32 stub for AVX-512 targets. [v4]
Vladimir Kozlov
kvn at openjdk.org
Sun Jan 29 01:09:16 UTC 2023
On Fri, 27 Jan 2023 20:23:41 GMT, Jatin Bhateja <jbhateja at openjdk.org> wrote:
>> Patch optimizes Adler32 stub for AVX512 target.
>>
>> Main computation loop now uses zero extended lane widening load vector operation.
>>
>> New sequence also honors AVX3Thresholds so that implementation uses existing AVX2 instruction sequence on relevant targets
>> if input size is smaller than threshold limit (default 4096).
>>
>> Following are the result of an [existing JMH micro ](https://github.com/openjdk/jdk/blob/master/test/micro/org/openjdk/bench/java/util/TestAdler32.java)on various targets.
>>
>> **System Configurations : Turbo frequency scaling is disabled, all the data is collected at fixed frequency of 2.8 GHz.
>> SUT1 : Intel® Xeon® Platinum 8480+ Processor (Sapphire Rapids) 56C 2S
>> SUT2 : Intel(R) Xeon(R) Platinum 8380 CPU (Icelake Server) 40C 2S
>> SUT3 : Intel(R) Xeon(R) Platinum 8280 CPU (Cascadelake Server) 28C 2S**
>>
>>
>> ![image](https://user-images.githubusercontent.com/59989778/212934730-68717a61-191f-4dba-8c83-2eddf6007a47.png)
>>
>> ![image](https://user-images.githubusercontent.com/59989778/212934945-cada95ad-c93c-487f-bacc-928a2e3b5c21.png)
>>
>> ![image](https://user-images.githubusercontent.com/59989778/212935059-511aca3b-c736-40a2-bff6-89caf0664828.png)
>>
>>
>> Please review and share your feedback.
>>
>> Best Regards,
>> Jatin
>
> Jatin Bhateja has updated the pull request incrementally with one additional commit since the last revision:
>
> 8300208: Removing extra space.
My testing of latest version passed.
-------------
Marked as reviewed by kvn (Reviewer).
PR: https://git.openjdk.org/jdk/pull/12045
More information about the hotspot-compiler-dev
mailing list