RFR(S): 8178811: Minimize the AVX <-> SSE transition penalty on x86

Deshpande, Vivek R vivek.r.deshpande at intel.com
Fri Apr 14 18:17:47 UTC 2017


Hi All

This fix minimizes the AVX to SSE and SSE to AVX transition penalty through generation of vzeroupper instruction. With this patch we see zero transitions with penalty per SPECjbb2015 jOPS on BDW and a significant reduction on SKX CPU event vector width mismatch from 65 to 0.01 per SPECjbb2015 jOPS. We have also implemented an enhancement to disable vzeroupper generation for Knights family where the instruction has high penalty and is not recommended. The option UseVzeroupper is used to control generation of vzeroupper instruction and gets set to false on the Knights family.
We observed ~3% gain on SPECJvm2008 composite result on Skylake.
Webrev:
http://cr.openjdk.java.net/~vdeshpande/8178811/webrev.00/
I have also updated the JBS entry.
https://bugs.openjdk.java.net/browse/JDK-8178811
Would you please review and sponsor it.

Regards,
Vivek

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20170414/0dee87ec/attachment.html>


More information about the hotspot-compiler-dev mailing list