RFR: 8316180: Thread-local backoff for secondary_super_cache updates [v12]
Aleksey Shipilev
shade at openjdk.org
Thu Oct 19 08:03:21 UTC 2023
On Wed, 18 Oct 2023 08:34:04 GMT, Aleksey Shipilev <shade at openjdk.org> wrote:
>> See more details in the bug and related issues.
>>
>> This is the attempt to mitigate [JDK-8180450](https://bugs.openjdk.org/browse/JDK-8180450), while the more complex fix that would obviate the need for `secondary_super_cache` is being worked out. The goal for this fix is to improve performance in pathological cases, while keeping non-pathological cases out of extra risk, *and* staying simple enough and reliable for backports to currently supported JDK releases.
>>
>> This implements mitigation on most current architectures:
>> - ✅ x86_64: implemented
>> - 🔴 x86_32: considered, abandoned; cannot be easily done without blowing up code size
>> - ✅ AArch64: implemented
>> - 🔴 ARM32: considered, abandoned; needs cleanups and testing; see [JDK-8318414](https://bugs.openjdk.org/browse/JDK-8318414)
>> - ✅ PPC64: implemented, thanks @TheRealMDoerr
>> - ✅ S390: implemented, thanks @offamitkumar
>> - ✅ RISC-V: implemented, thanks @RealFYang
>> - ✅ Zero: does not need implementation
>>
>> Note that the code is supposed to be rather compact, because it is inlined in generated code. That is why, for example, we cannot easily do x86_32 version: we need a thread, so the easiest way would be to call into VM. But we cannot that easily: the code blowout would make some forward branches in external code non-short. I think we we cannot implement this mitigation on some architectures, so be it, it would be a sensible tradeoff for simplicity.
>>
>> Setting backoff at `0` effectively disables the mitigation, and gives us safety hatch if something goes wrong.
>>
>> I believe we can go in with `1000` as the default, given the experimental results mentioned in this PR.
>>
>> Additional testing:
>> - [x] Linux x86_64 fastdebug, `tier1 tier2 tier3`
>> - [x] Linux AArch64 fastdebug, `tier1 tier2 tier3`
>
> Aleksey Shipilev has updated the pull request incrementally with four additional commits since the last revision:
>
> - Editorial cleanups
> - RISC-V implementation
> - Mention ARM32 bug
> - Make sure benchmark runs with C1
Deeper performance evaluation shows that Dacapo:pmd has regressions, that get linearly worse as we get into larger backoffs. They reach 8% at backoff=10000. I am currently investigating the cause.
-------------
PR Comment: https://git.openjdk.org/jdk/pull/15718#issuecomment-1770271456
More information about the hotspot-dev
mailing list