[PATCH] 8217561 : X86: Add floating-point Math.min/max intrinsics, approval request

Vladimir Kozlov vladimir.kozlov at oracle.com
Wed Feb 20 17:53:11 UTC 2019


On 2/20/19 9:39 AM, Andrew Haley wrote:
> On 2/19/19 3:16 PM, B. Blaser wrote:
> 
>> So, I did some experiments and it seems that the most important
>> branch is the last one.
>> If data is well balanced like in the initial benchmark I posted,
>> branch prediction is poor and the intrinsic is 2x faster, in all
>> other cases it'd be better to avoid it:
> 
> I tried this an it seems to work.
> 
> This idea looks extremely promising. It's not perfect in that there
> are still datasets that confuse it, but it's surely an improvement
> over what we have today.
> 
> Vladimir, what say you? This is a useful heuristic which avoids
> performance regressions caused by min/max in almost all cases. I say
> we want it.

Yes, this would be nice.

> 
> NB: this heuristic is probably good for CMOV instructions too.

Agree, we had a lot of comments back when we implemented CMOV. There was big discussion what is better - short jump or 
cmove. It all depends on bench prediction as in this case.

Regards,
Vladimir


More information about the hotspot-compiler-dev mailing list