RFR: 8351949: RISC-V: Cleanup and enable store-load peephole for membars [v8]

Robbin Ehn rehn at openjdk.org
Thu Mar 27 11:22:48 UTC 2025


> Hi please consider.
> 
> |RVWMO| Patched|
> | ---------- | ---------- |
> |fence iorw,iorw| fence iorw,ow|
> |sw  t4,120(t2) | sw  t4,120(t2) |
> |fence ow,ir | unnecessary_membar_volatile_rvwmo |
> | sw  t6,128(t2) // Non-volatile | sw  t6,128(t2) // Non-volatile |
> |fence iorw,ow | fence iorw,ow|
> |sw  t5,124(t2) |sw  t5,124(t2) |
> 
> |TSO                                | Patched|
> | ---------- | ---------- |
> | lw	a4,120(t2) | lw	a6,120(t2) |
> | sw	a0,124(t2) | sw	t6,124(t2) |
> | fence	iorw,iorw | unnecessary_membar_volatile_tso |
> | sw	t4,120(t2) | sw	t4,120(t2) |
> | fence	ow,ir | unnecessary_membar_volatile_tso |
> | sw	t6,128(t2) | sw	t5,128(t2) |
> | sw	t5,124(t2) // Non-volatile| sw	a1,124(t2)  // Non-volatile |
> | fence	iorw,iorw  | unnecessary_membar_volatile_tso |
> |...  | ... |
> | sw	a3,120(t2) | sw	a0,120(t2) |
> | fence	ow,ir | fence	ow,ir |
> | lw	a7,124(t2) | lw	a5,124(t2) |
> 
> For the specific rvwmo volatile store + store + volatile store is around 30% faster on VF2.
> 
> The patch do:
> - Separate ztso and rvwmo in ad by using UseZtso predicate.
> - Match all that requires the same membar.
> - Make fence/fencei protected as they shouldn't be using directly.
> - Increased cost of membars to VOLATILE_REF_COST.
> - Added a real_empty pipe.
> - Change to pipe_slow on TSO (as x86).
> 
> Note that C2-rv64 is now superior to gcc/clang regrading fencing:
> https://godbolt.org/z/6E3YTP15j
> 
> Testing jcstress, tier1 and manually reading the generated assembly.
> Doing additional testing, but RFR it now as it may need some consideration.
> 
> /Robbin

Robbin Ehn has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains 11 additional commits since the last revision:

 - Merge branch 'master' into tso-merge
 - Merge branch 'master' into tso-merge
 - format comment
 - Merge branch 'master' into tso-merge
 - Review comments
 - Merge branch 'master' into tso-merge
 - Review comments
 - Fixed ws
 - Revert NC
 - Fixed comment
 - ... and 1 more: https://git.openjdk.org/jdk/compare/15ffecab...c2688a6a

-------------

Changes:
  - all: https://git.openjdk.org/jdk/pull/24035/files
  - new: https://git.openjdk.org/jdk/pull/24035/files/5eac8470..c2688a6a

Webrevs:
 - full: https://webrevs.openjdk.org/?repo=jdk&pr=24035&range=07
 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24035&range=06-07

  Stats: 32764 lines in 118 files changed: 1605 ins; 30778 del; 381 mod
  Patch: https://git.openjdk.org/jdk/pull/24035.diff
  Fetch: git fetch https://git.openjdk.org/jdk.git pull/24035/head:pull/24035

PR: https://git.openjdk.org/jdk/pull/24035


More information about the hotspot-dev mailing list