RFR: 8376794: Enable copy and mismatch Partial Inlining for AMD AVX512 targets

Jatin Bhateja jbhateja at openjdk.org
Wed Feb 4 07:01:44 UTC 2026


Partial in-lining handles copy and mismatch for small array sizes less than -XX:ArrayOperationPartialInlineSize bytes through JIT code rather than calling optimized stubs thereby saving costly call overhead.

Enabling partial in-lining optimization for AMD EPYC servers supporting AVX-512 feature.

Following are the performance numbers on Turin at fixed frequency of 2.1GHz

<img width="440" height="440" alt="image" src="https://github.com/user-attachments/assets/14b55ee3-b65c-4247-8739-67f1b94dceb4" />
<img width="440" height="300" alt="image" src="https://github.com/user-attachments/assets/c00d6443-45a2-4277-961d-580ceea5da88" />

Kindly review and share your feedback.

Best Regards,
Jatin

-------------

Commit messages:
 - Extending micro-benchmark for short array mismatch
 - 8376794: Enable copy and mismatch Partial Inlining for AMD AVX512 targets

Changes: https://git.openjdk.org/jdk/pull/29519/files
  Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=29519&range=00
  Issue: https://bugs.openjdk.org/browse/JDK-8376794
  Stats: 75 lines in 2 files changed: 47 ins; 5 del; 23 mod
  Patch: https://git.openjdk.org/jdk/pull/29519.diff
  Fetch: git fetch https://git.openjdk.org/jdk.git pull/29519/head:pull/29519

PR: https://git.openjdk.org/jdk/pull/29519


More information about the hotspot-dev mailing list