RFR: 8341146: RISC-V: Unnecessary fences used for load-acquire in template interpreter

Feilong Jiang fjiang at openjdk.org
Sun Sep 29 10:57:06 UTC 2024


Hi, please consider.

RISC-V does not currently have plain load and store opcodes with aq or rl annotations, load-acquire and
store-release operations are implemented using fences instead. Initially, we followed the RISC-V spec
and placed FENCE RW,RW fence in front of load-acquire operation when porting the template interpreter.
The purpose is to enforce a store-release-to-load-acquire ordering (where there must be a FENCE RW,RW
between the store-release and load-acquire). But it turns out these fences are unnecessary for our use
cases in the template interpreter. In fact, we only need to do a single FENCE R,RW after a normal memory
load in order to implement a load-acquire operation. We should remove those unnecessary fences for both
performance reasons and for consistency with the rest of the port (i.e., C1 and C2 JIT).

Testing:
- [x] JCstress
- [x] hs-tier1 - hs-tier4
- [x] ~5% improvement on SPECJbb2005 score (-Xint -XX:+UseParallelGC)

-------------

Commit messages:
 - remove unnecessary membar in template interpreter

Changes: https://git.openjdk.org/jdk/pull/21248/files
  Webrev: https://webrevs.openjdk.org/?repo=jdk&pr=21248&range=00
  Issue: https://bugs.openjdk.org/browse/JDK-8341146
  Stats: 9 lines in 1 file changed: 0 ins; 9 del; 0 mod
  Patch: https://git.openjdk.org/jdk/pull/21248.diff
  Fetch: git fetch https://git.openjdk.org/jdk.git pull/21248/head:pull/21248

PR: https://git.openjdk.org/jdk/pull/21248


More information about the hotspot-compiler-dev mailing list