RFR: 8365147: AArch64: Replace DMB + LD + DMB with LDAR for C1 volatile field loads [v3]
Andrew Haley
aph at openjdk.org
Wed Nov 12 11:30:14 UTC 2025
On Wed, 12 Nov 2025 11:09:45 GMT, Samuel Chee <duke at openjdk.org> wrote:
>> Replaces the DMB ISH + LD + DMB ISHLD sequence with LDAR for volatile field loads - for example, AtomicLong::get.
>>
>> This is valid, as originally the DMBs were necessary due to the case described here - https://bugs.openjdk.org/browse/JDK-8179954. As in the rare case where the LD can be reordered with an LDAR or STLR from the C2 implementation for stores and loads, these DMBs are required.
>> However, acquire/release operations use a sequentially consistent model which does not allow reordering between them. Hence, the LD can be replaced with an LDAR to disallow reordering with a STLR/LDAR and the first DMB can be removed.
>>
>> The LDAR has acquire semantics, so it's impossible for memory accesses after to be reordered before; the DMB ISHLD is not required. Therefore, a singular LDAR is sufficient.
>
> Samuel Chee has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains five commits:
>
> - Address review comments. Refine.
>
> Change-Id: I9cc0308300548c1892d39791e00b41ef13c95e63
> - Merge from the main branch
> - Address review comments
>
> Change-Id: Ica13be8094ac0f057066042ef0a5ec5927b98dfd
> - Refine code generation for mem2reg_volatile
>
> The patch is contributed by @theRealAph.
>
> Change-Id: I7ab1854dd238cdce72a4ab218b5b4ee84ad39586
> - 8365147: AArch64: Replace DMB + LD + DMB with LDAR for C1 volatile loads
>
> Replaces the DMB ISH + LD + DMB ISHLD sequence with LDAR
> for volatile field loads - for example, AtomicLong::get.
>
> This is valid, as originally the DMBs were necessary due to
> the case described here - https://bugs.openjdk.org/browse/JDK-8179954.
> As in the rare case where the LD can be reordered with an LDAR
> or STLR from the C2 implementation for stores and loads, these
> DMBs are required.
> However, acquire/release operations use a sequentially consistent model
> which does not allow reordering between them. Hence, the LD can be
> replaced with an LDAR to disallow reordering with a STLR/LDAR
> and the first DMB can be removed.
>
> The LDAR has acquire semantics, so it's impossible for
> memory accesses after to be reordered before; the DMB ISHLD is
> not required. Therefore, a singular LDAR is sufficient.
>
> This excludes floats and doubles, as they do not have
> equivalent load-acquire instructions.
>
> Change-Id: Ia93607f8bb20c2d974fe6b2e586dd3239bb2728c
src/hotspot/cpu/aarch64/c1_LIRAssembler_aarch64.cpp line 948:
> 946: }
> 947:
> 948: void LIR_Assembler::load_generic(LIR_Address *from_addr, LIR_Opr dest,
Suggestion:
void LIR_Assembler::load_relaxed(LIR_Address *from_addr, LIR_Opr dest,
Standard terminology.
-------------
PR Review Comment: https://git.openjdk.org/jdk/pull/26748#discussion_r2517938298
More information about the hotspot-dev
mailing list