RFR: 8300208: Optimize Adler32 stub for AVX-512 targets. [v4]
Vladimir Kozlov
kvn at openjdk.org
Sat Jan 28 01:55:16 UTC 2023
On Fri, 27 Jan 2023 20:23:41 GMT, Jatin Bhateja <jbhateja at openjdk.org> wrote:
>> Patch optimizes Adler32 stub for AVX512 target.
>>
>> Main computation loop now uses zero extended lane widening load vector operation.
>>
>> New sequence also honors AVX3Thresholds so that implementation uses existing AVX2 instruction sequence on relevant targets
>> if input size is smaller than threshold limit (default 4096).
>>
>> Following are the result of an [existing JMH micro ](https://github.com/openjdk/jdk/blob/master/test/micro/org/openjdk/bench/java/util/TestAdler32.java)on various targets.
>>
>> **System Configurations : Turbo frequency scaling is disabled, all the data is collected at fixed frequency of 2.8 GHz.
>> SUT1 : Intel® Xeon® Platinum 8480+ Processor (Sapphire Rapids) 56C 2S
>> SUT2 : Intel(R) Xeon(R) Platinum 8380 CPU (Icelake Server) 40C 2S
>> SUT3 : Intel(R) Xeon(R) Platinum 8280 CPU (Cascadelake Server) 28C 2S**
>>
>>
>> 
>>
>> 
>>
>> 
>>
>>
>> Please review and share your feedback.
>>
>> Best Regards,
>> Jatin
>
> Jatin Bhateja has updated the pull request incrementally with one additional commit since the last revision:
>
> 8300208: Removing extra space.
This needs to be tested before integration since stub code was changes since v00.
-------------
Changes requested by kvn (Reviewer).
PR: https://git.openjdk.org/jdk/pull/12045
More information about the hotspot-compiler-dev
mailing list