RFR: 8354560: Exponentially delay subsequent native thread creation in case of EAGAIN [v3]

Yannik Stradmann duke at openjdk.org
Tue May 6 22:52:59 UTC 2025


> This change introduces an exponential backoff when hitting `EAGAIN` during native thread creation in hotspot.
> 
> In contrast to the current solution, where we retry to create a native thread up to three times in a tight loop, hotspot will will thereby be more kind to an already depleted resource, reduce stress on the kernel and become more robust on systems under high load.
> 
> The proposed modifications to `os_linux.cpp` have substantially improved system stability in a mid-sized Jenkins cluster and have been in production within our systems over the past three years. I have verbatim ported these to the other platforms, which previously also relied on identical logic.

Yannik Stradmann has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains three additional commits since the last revision:

 - Merge remote-tracking branch 'upstream/master' into robust_pthread
 - Fix build on Windows: Sleep() only accepts milliseconds
 - Exponentially delay native thread creation retries

-------------

Changes:
  - all: https://git.openjdk.org/jdk/pull/24682/files
  - new: https://git.openjdk.org/jdk/pull/24682/files/d4841a00..14b2b82f

Webrevs:
 - full: https://webrevs.openjdk.org/?repo=jdk&pr=24682&range=02
 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=24682&range=01-02

  Stats: 87438 lines in 2152 files changed: 66142 ins; 13032 del; 8264 mod
  Patch: https://git.openjdk.org/jdk/pull/24682.diff
  Fetch: git fetch https://git.openjdk.org/jdk.git pull/24682/head:pull/24682

PR: https://git.openjdk.org/jdk/pull/24682


More information about the hotspot-runtime-dev mailing list