RFR(M): 8181616: FMA Vectorization on x86
Deshpande, Vivek R
vivek.r.deshpande at intel.com
Tue Jun 6 18:28:17 UTC 2017
Hi Vladimir
Thanks for taking a look at the patch.
The FMA is introduced with Haswell microarchitecture along with AVX2. It is recommended to also check for AVX for using FMA instructions.
Regards,
Vivek
-----Original Message-----
From: Vladimir Kozlov [mailto:vladimir.kozlov at oracle.com]
Sent: Monday, June 5, 2017 4:53 PM
To: Deshpande, Vivek R; hotspot-compiler-dev at openjdk.java.net
Cc: Viswanathan, Sandhya
Subject: Re: RFR(M): 8181616: FMA Vectorization on x86
Thank you, Vivek
Why you restrict FMA to AVX?:
- if (supports_fma() && UseSSE >= 2) {
+ if (supports_fma() && UseAVX > 0) {
Otherwise changes look good.
Thanks,
Vladimir
On 6/5/17 2:42 PM, Deshpande, Vivek R wrote:
> Hi All
>
>
>
> The enhancement generates Vector FMA instructions using SuperWord Analysis for X86.
>
> We observed upto 6x gain on Skylake and KNL using 512 bit Vector FMA instructions using FMA API operating on independent arrays.
>
> Webrev:
>
> http://cr.openjdk.java.net/~vdeshpande/8181616/webrev.00/
>
> I have also updated the JBS entry.
>
> https://bugs.openjdk.java.net/browse/JDK-8181616
>
> Would you please review and sponsor it.
>
>
>
> Regards,
>
> Vivek
>
>
>
More information about the hotspot-compiler-dev
mailing list