RFR: 8283199: Linux os::cpu_microcode_revision() stalls cold startup

David Holmes dholmes at openjdk.java.net
Thu Mar 17 07:27:50 UTC 2022


On Tue, 15 Mar 2022 17:21:43 GMT, Aleksey Shipilev <shade at openjdk.org> wrote:

> Noticed this when staring at timeline profiles for JVM startup. If you run any small startup workload on Linux, there is a significant time gap where JVM does nothing. I pinpointed that to fopen of `/proc/cpuinfo` due to [JDK-8249672](https://bugs.openjdk.java.net/browse/JDK-8249672). 
> 
> This *does not* reproduce if you run startup workloads continuously, as it looks as if `/proc/cpuinfo` is cached for about a second, and stalls reads after that, I suspect for CPU info updates, like frequency. This reproduces on at least two of my systems running Linux kernels 5.4 and 5.15.
> 
> Observe:
> 
> 
> $ for I in `seq 1 5`; do sleep 0.99; time cat /proc/cpuinfo > /dev/null; done
> real	0m0.022s
> real	0m0.003s
> real	0m0.004s
> real	0m0.004s
> real	0m0.004s
> 
> $ for I in `seq 1 5`; do sleep 1.00; time cat /proc/cpuinfo > /dev/null; done
> real	0m0.022s
> real	0m0.022s
> real	0m0.022s
> real	0m0.022s
> real	0m0.022s
> 
> 
> It directly impacts JVM startup:
> 
> 
> $ for I in `seq 1 3`; do sleep 1; time build/linux-x86_64-server-release/images/jdk/bin/java -Xms128m -Xmx128m -version > /dev/null 2>&1 ; done
> 
> real	0m0.046s
> real	0m0.045s
> real	0m0.046s
> 
> 
> Without the sleep:
> 
> 
> $ for I in `seq 1 3`; do time build/linux-x86_64-server-release/images/jdk/bin/java -Xms128m -Xmx128m -version > /dev/null 2>&1 ; done
> 
> real	0m0.026s
> real	0m0.026s
> real	0m0.023s
> 
> 
> There is another way to do this: read `/sys/devices/system/cpu/cpu0/microcode/version`, this is what this patch does. With `sleep 1`:
> 
> 
> $ for I in `seq 1 3`; do sleep 1; time build/linux-x86_64-server-release/images/jdk/bin/java -Xms128m -Xmx128m -version > /dev/null 2>&1 ; done
> 
> real	0m0.025s
> real	0m0.024s
> real	0m0.024s
> 
> 
> Which means it improves startup time from ~45ms to ~25ms, or about 1.8x!
> 
> `os::cpu_microcode_revision()` is currently used to generate the VM features string. It raises a bigger question if VM features string should be generated on startup, but it seems to take negligible time otherwise, see [JDK-8283200](https://bugs.openjdk.java.net/browse/JDK-8283200). Given [JDK-8249672](https://bugs.openjdk.java.net/browse/JDK-8249672) had been backported to update releases, let's do a pointed fix for this regression first.
> 
> Additional testing:
>  - [x] Linux x86_64 fastdebug `tier1`
>  - [x] Eyeballing reported microcode number with/without the fix -- same thing reported

What versions of Linux have `/sys/devices/system/cpu/cpu0/microcode/version`? And is it present in virtualized environments? What cost is there for systems that always have to take the second path?

Thanks,
David

src/hotspot/os_cpu/linux_x86/os_linux_x86.cpp line 475:

> 473:         sscanf(data, "%x", &result);
> 474:       }
> 475:       fclose(fp);

The diff would be much simpler if you added a return here and so avoided the need for the if statement below.

-------------

PR: https://git.openjdk.java.net/jdk/pull/7825


More information about the hotspot-runtime-dev mailing list