RFR: 8308094: Add a compilation timeout flag to catch long running compilations [v4]

Dean Long dlong at openjdk.org
Thu Aug 7 18:59:36 UTC 2025


On Thu, 7 Aug 2025 14:52:48 GMT, Manuel Hässig <mhaessig at openjdk.org> wrote:

>> This PR adds `-XX:CompileTaskTimeout` on Linux to limit the amount of time a compilation task can run. The goal of this is initially to be able to find and investigate long-running compilations.
>> 
>> The timeout is implemented using a POSIX timer that sends a `SIGALRM` to the compiler thread the compile task is running on. Each compiler thread registers a signal handler that triggers an assert upon receiving `SIGALRM`. This is currently only implemented for Linux, because it relies on `SIGEV_THREAD_ID` to get the signal delivered to the same thread that timed out.
>> 
>> Since `SIGALRM` is now used, the test `runtime/signal/TestSigalrm.java` now requires `vm.flagless` so it will not interfere with the compiler thread signal handlers.
>> 
>> Testing:
>>  - [ ] Github Actions
>>  - [ ] tier1, tier2 on all platforms
>>  - [ ] tier3, tier4 and Oracle internal testing on Linux fastdebug
>>  - [ ] tier1 through tier4 with `-XX:CompileTaskTimeout=60000` (one minute timeout) to see what fails (`compiler/codegen/TestAntiDependenciesHighMemUsage2.java`, `compiler/loopopts/TestMaxLoopOptsCountReached.java`, and `compiler/c2/TestScalarReplacementMaxLiveNodes.java` fail)
>
> Manuel Hässig has updated the pull request incrementally with one additional commit since the last revision:
> 
>   ASSERT

Thinking about _timeout_armed a little more, the fact the the signal handler received TIMEOUT_SIGNAL should be enough.  The value of _timeout_armed should be redundant, and your assert could be changed to:

assert(false, "compile task timed out");

and _timeout_armed could be removed. It's just an inexact mirror of the timer state.

-------------

PR Comment: https://git.openjdk.org/jdk/pull/26023#issuecomment-3165377812


More information about the hotspot-dev mailing list