RFR: 8265836: OperatingSystemImpl.getCpuLoad() returns incorrect CPU load
Argha C
github.com+971473+argha-c at openjdk.java.net
Wed May 5 21:18:52 UTC 2021
On Wed, 5 May 2021 05:57:55 GMT, Hao Tang <github.com+7947546+tanghaoth90 at openjdk.org> wrote:
>> src/jdk.management/unix/classes/com/sun/management/internal/OperatingSystemImpl.java line 142:
>>
>>> 140: long usageNanos = containerMetrics.getCpuUsage();
>>> 141: if (numPeriods > 0 && usageNanos > 0) {
>>> 142: long quotaNanos = TimeUnit.MICROSECONDS.toNanos(quota * numPeriods);
>>
>> We happened to hit an exactly similar problem when running on a container with openjdk15.
>>
>> Given we effectively agree that the problem is `elapsedNanos` doesn't accurately reflect the cpu time allocated across all shares vs a single share, my proposal was to use `getCpuShares` as a multiplier for `periodLength` above.
>> Is there a good reason `getCpuQuota` is a better alternative?
>
> Hi Argha, thanks a lot for your suggestion. I think both "quota" and "share" are worth considering. Let us look into the implementation of `CgroupSubsystem::active_processor_count()` in OpenJDK HotSpot (https://github.com/openjdk/jdk/blob/master/src/hotspot/os/linux/cgroupSubsystem_linux.cpp).
Thanks for linking that. It sounds reasonable to me to prefer `quota` in that case.
-------------
PR: https://git.openjdk.java.net/jdk/pull/3656
More information about the serviceability-dev
mailing list