RFR: 8319650: Improve heap dump performance with class metadata caching [v3]
Coleen Phillimore
coleenp at openjdk.org
Thu Nov 9 14:28:18 UTC 2023
On Thu, 9 Nov 2023 10:45:18 GMT, Aleksey Shipilev <shade at openjdk.org> wrote:
>> See the rationale and some profiles in the bug.
>>
>> This PR implements simple caching for object instance dumping, which is overwhelmingly hottest path in heap dumping code. This allows to put the cache straight in `HeapObjectDumper`, which is effectively thread local. A simple hash table would then work to cache the class metadata.
>>
>> Given a simple workload from the bug, these are the motivational improvements and historical perspective:
>>
>>
>> $ for I in `seq 1 3`; do java -XX:+UseParallelGC -XX:+HeapDumpAfterFullGC -Xms8g -Xmx8g HeapDump.java 2>&1 | grep created; rm *.hprof; done
>>
>> # Baseline JDK 8u382
>> Heap dump file created [1440865228 bytes in 5.492 secs]
>> Heap dump file created [1440865228 bytes in 5.460 secs]
>> Heap dump file created [1440865228 bytes in 5.471 secs]
>>
>> # Baseline JDK 11.0.21
>> Heap dump file created [1446043070 bytes in 5.088 secs]
>> Heap dump file created [1446042039 bytes in 5.096 secs]
>> Heap dump file created [1446042835 bytes in 5.081 secs]
>>
>> # Baseline JDK 17.0.10
>> Heap dump file created [1446841531 bytes in 3.276 secs]
>> Heap dump file created [1446841673 bytes in 3.342 secs]
>> Heap dump file created [1446841590 bytes in 3.323 secs]
>>
>> # Baseline JDK 21.0.2 EA
>> Heap dump file created [1447828697 bytes in 5.516 secs]
>> Heap dump file created [1447829756 bytes in 5.461 secs]
>> Heap dump file created [1447829539 bytes in 5.435 secs]
>>
>> # Baseline JDK 22
>> Heap dump file created [1447285149 bytes in 5.232 secs]
>> Heap dump file created [1447285149 bytes in 5.211 secs]
>> Heap dump file created [1447284813 bytes in 5.222 secs]
>>
>> # Patched JDK 22
>> Heap dump file created [1447285087 bytes in 1.071 secs]
>> Heap dump file created [1447285149 bytes in 1.073 secs]
>> Heap dump file created [1447285073 bytes in 1.058 secs]
>>
>>
>> This amounts to 5x improvement, 1.2 GB/sec dump rate (somewhat I/O limited), and the best heap dump performance across all JDK releases. I am planning to backport this patch where possible: at least to JDK 21u, but hopefully to JDK 17u as well.
>>
>> There are other simple improvements, but this one is the top bottleneck.
>
> Aleksey Shipilev has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains three additional commits since the last revision:
>
> - Limit the table size
> - Merge branch 'master' into JDK-8319650-heapdump-cache-meta
> - Fix
This looks good. I have a couple of minor comments.
src/hotspot/share/services/heapDumper.cpp line 813:
> 811: };
> 812:
> 813: class DumperClassCacheTable : public StackObj {
Technically it's a ValueObj inside a StackObj, so don't really need to have StackObj here. To me, that it's means that you're going to use it as a local variable in a method, when it's really embedded in StackObj. One can argue all ways on this though, it's just my opinion.
src/hotspot/share/services/heapDumper.cpp line 825:
> 823: static constexpr int CACHE_TOP = 256;
> 824:
> 825: static unsigned int ptr_hash(void* const& s1) {
Should we change primitive_hash to this? (not this PR)
src/hotspot/share/services/heapDumper.cpp line 830:
> 828: }
> 829:
> 830: typedef ResourceHashtable<void*, DumperClassCacheTableEntry*, TABLE_SIZE, AnyObj::C_HEAP, mtServiceability,
If the key is InstanceKlass, why not use InstanceKlass* as the key instead of void*.
src/hotspot/share/services/heapDumper.cpp line 1041:
> 1039:
> 1040: // returns the size of the instance of the given class
> 1041: u4 DumperSupport::instance_size(Klass* k, DumperClassCacheTableEntry* class_cache_entry) {
Since you changed the parameters to instance_size(), can you make the argument InstanceKlass* instead of Klass* to remove the cast at 1045. I believe all the callers already pass InstanceKlass*.
src/hotspot/share/services/heapDumper.cpp line 1126:
> 1124: // dump the raw values of the instance fields of the given object
> 1125: void DumperSupport::dump_instance_fields(AbstractDumpWriter* writer, oop o, DumperClassCacheTableEntry* class_cache_entry) {
> 1126: assert(class_cache_entry, "Pre-condition: should be provided");
Needs explicit check against nullptr.
-------------
Marked as reviewed by coleenp (Reviewer).
PR Review: https://git.openjdk.org/jdk/pull/16545#pullrequestreview-1722576207
PR Review Comment: https://git.openjdk.org/jdk/pull/16545#discussion_r1388081172
PR Review Comment: https://git.openjdk.org/jdk/pull/16545#discussion_r1388031769
PR Review Comment: https://git.openjdk.org/jdk/pull/16545#discussion_r1388027181
PR Review Comment: https://git.openjdk.org/jdk/pull/16545#discussion_r1388029365
PR Review Comment: https://git.openjdk.org/jdk/pull/16545#discussion_r1388068721
More information about the hotspot-runtime-dev
mailing list