RFR: 8353686: Optimize Math.cbrt for x86 64 bit platforms [v3]
Mohamed Issa
duke at openjdk.org
Tue May 6 21:45:34 UTC 2025
> The goal of this PR is to implement an x86_64 intrinsic for java.lang.Math.cbrt() using libm.
>
> The results of all tests posted below were captured with an [Intel® Xeon 6761P](https://www.intel.com/content/www/us/en/products/sku/241842/intel-xeon-6761p-processor-336m-cache-2-50-ghz/specifications.html) using [OpenJDK v25-b21](https://github.com/openjdk/jdk/releases/tag/jdk-25%2B21) as the baseline version.
>
> For performance data collected with the built in **cbrt** micro-benchmark, see the table below. Each result is the mean of 8 individual runs. Overall, the intrinsic provides a performance uplift of 37%.
>
> | Benchmark | Throughput with baseline (op/s) | Throughput with intrinsic (op/s) | Speedup |
> | :----------------: | :----------------------------------: | :----------------------------------: | :---------: |
> | MathBench.cbrt | 152465 | 208537 | 1.37x |
>
> Finally, the `jtreg:test/jdk/java/lang/Math/CubeRootTests.java` test passed with the changes.
Mohamed Issa has updated the pull request incrementally with one additional commit since the last revision:
Add new set of cbrt micro-benchmarks
-------------
Changes:
- all: https://git.openjdk.org/jdk/pull/24470/files
- new: https://git.openjdk.org/jdk/pull/24470/files/3212c669..57412f0d
Webrevs:
- full: https://webrevs.openjdk.org/?repo=jdk&pr=24470&range=02
- incr: https://webrevs.openjdk.org/?repo=jdk&pr=24470&range=01-02
Stats: 148 lines in 1 file changed: 148 ins; 0 del; 0 mod
Patch: https://git.openjdk.org/jdk/pull/24470.diff
Fetch: git fetch https://git.openjdk.org/jdk.git pull/24470/head:pull/24470
PR: https://git.openjdk.org/jdk/pull/24470
More information about the graal-dev
mailing list