Integrated: 8323116: [REDO] Computational test more than 2x slower when AVX instructions are used

Srinivas Vamsi Parasa duke at openjdk.org
Mon Apr 8 18:44:22 UTC 2024


On Tue, 26 Mar 2024 23:19:16 GMT, Srinivas Vamsi Parasa <duke at openjdk.org> wrote:

> The goal of this small PR is improve the performance of convert instructions and address the slowdown when AVX>0 is used.
> 
> The performance data using the ComputePI.java benchmark (part of this PR) is as follows:
> 
> 
> Benchmark   (ns/op) | Stock JDK | This PR (AVX=3) | Speedup
> -- | -- | -- | --
> ComputePI.compute_pi_dbl_flt | 511.34 | 510.989 | 1.0
> ComputePI.compute_pi_flt_dbl | 2024.06 | 518.695 | 3.9
> ComputePI.compute_pi_int_dbl | 695.482 | 453.054 | 1.5
> ComputePI.compute_pi_int_flt | 799.268 | 449.83 | 1.8
> ComputePI.compute_pi_long_dbl | 802.992 | 454.891 | 1.8
> ComputePI.compute_pi_long_flt | 628.62 | 463.617 | 1.4
> 
> 
> 
> Benchmark (ns/op) | Stock JDK | This PR (AVX=0) | Speedup
> -- | -- | -- | --
> ComputePI.compute_pi_dbl_flt | 473.778 | 472.529 | 1.0
> ComputePI.compute_pi_flt_dbl | 536.004 | 538.418 | 1.0
> ComputePI.compute_pi_int_dbl | 458.08 | 460.245 | 1.0
> ComputePI.compute_pi_int_flt | 477.305 | 476.975 | 1.0
> ComputePI.compute_pi_long_dbl | 455.132 | 455.064 | 1.0
> ComputePI.compute_pi_long_flt | 474.734 | 476.571 | 1.0

This pull request has now been integrated.

Changeset: 7e5ef79f
Author:    vamsi-parasa <srinivas.vamsi.parasa at intel.com>
Committer: Sandhya Viswanathan <sviswanathan at openjdk.org>
URL:       https://git.openjdk.org/jdk/commit/7e5ef79f953877cde6389998edcfe3fecb9b900e
Stats:     223 lines in 4 files changed: 217 ins; 0 del; 6 mod

8323116: [REDO] Computational test more than 2x slower when AVX instructions are used

Reviewed-by: sviswanathan, kvn

-------------

PR: https://git.openjdk.org/jdk/pull/18503


More information about the hotspot-compiler-dev mailing list