RFR: 8354560: Exponentially delay subsequent native thread creation in case of EAGAIN [v3]

Yannik Stradmann duke at openjdk.org
Thu May 8 21:42:11 UTC 2025


On Tue, 6 May 2025 22:52:59 GMT, Yannik Stradmann <duke at openjdk.org> wrote:

>> This change introduces an exponential backoff when hitting `EAGAIN` during native thread creation in hotspot.
>> 
>> In contrast to the current solution, where we retry to create a native thread up to three times in a tight loop, hotspot will will thereby be more kind to an already depleted resource, reduce stress on the kernel and become more robust on systems under high load.
>> 
>> The proposed modifications to `os_linux.cpp` have substantially improved system stability in a mid-sized Jenkins cluster and have been in production within our systems over the past three years. I have verbatim ported these to the other platforms, which previously also relied on identical logic.
>
> Yannik Stradmann has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains three additional commits since the last revision:
> 
>  - Merge remote-tracking branch 'upstream/master' into robust_pthread
>  - Fix build on Windows: Sleep() only accepts milliseconds
>  - Exponentially delay native thread creation retries

> Sorry for the delay but I have been on vacation (and noone else picked this up).

No worries, thanks a lot for getting back to me!

-------------

PR Review: https://git.openjdk.org/jdk/pull/24682#pullrequestreview-2826419146


More information about the hotspot-runtime-dev mailing list