RFR: 8249893: AARCH64: optimize the construction of the value from the bits of the other two

Andrew Haley aph at redhat.com
Sun Oct 11 15:04:03 UTC 2020


Hi,

On 09/10/2020 20:01, Boris Ulasevich wrote:
> 
> Many thanks for looking at this again!
> 
> Benchmark link: [1]. Measurements on Cortex A73, Cortex A53 and Neoverse N1
> shows [2] 6-15% performance improvement for bench1 and 18-29% for bench2.

For me on ThunderX 2,

Before:

Benchmark           Mode  Cnt   Score   Error  Units
ConstructFF.bench1  avgt   10  15.170 ± 0.975  ns/op
ConstructFF.bench4  avgt   10  39.391 ± 2.617  ns/op

After:

ConstructFF.bench1  avgt   10  12.349 ± 2.535  ns/op
ConstructFF.bench4  avgt   10  24.353 ± 0.443  ns/op

So for this carefully-constructed benchmark, it looks like there's
a useful gain.

-- 
Andrew Haley  (he/him)
Java Platform Lead Engineer
Red Hat UK Ltd. <https://www.redhat.com>
https://keybase.io/andrewhaley
EAC8 43EB D3EF DB98 CC77 2FAD A5CD 6035 332F A671



More information about the hotspot-compiler-dev mailing list