RFR: 8272873: C2: Inlining should not depend on absolute call site counts
Igor Veresov
iveresov at openjdk.java.net
Wed Aug 25 23:28:26 UTC 2021
On Tue, 24 Aug 2021 15:44:41 GMT, Igor Veresov <iveresov at openjdk.org> wrote:
> C2 considers absolute call site counts in its inlining decisions, which seems very wrong considering the asynchronous nature of profiling and background compilation (See InlineTree::should_inline()). It causes substantial over-inlining, which in presence of a depth-first inlining can lead to an early cut off. It also is inherently unstable. C2 already uses call frequency as an additional factor and it's better to consider only that in the inlining heuristic. I did extensive benchmarking it yielded almost no losses and single-digit wins (up to 5%) on some benchmarks. I think it's safe to remove/deprecate InlineFrequencyCount and continue using InlineFrequencyRatio instead. I found that converting the frequency computation to FP and setting InlineFrequencyRatio=0.25 (inline a method that is called a least 25% of the time) works best.
A > 1 ratio means the call site is evaluated more than once per invocation - it's in a loop. So 20 means that the call site should be a loop with more than 20 iterations. It kind of made sense.
The current value of 0.25 means that we'll try to inline call sites that are evaluated in at least 25% of the invocations of the caller.
-------------
PR: https://git.openjdk.java.net/jdk/pull/5238
More information about the hotspot-compiler-dev
mailing list