RFR: 8324573: HashMap::putAll should resize to sum of both map sizes [v3]
Joshua Cao
duke at openjdk.org
Thu Jan 25 00:29:40 UTC 2024
> This change mirrors what we did for ConcurrentHashMap in https://github.com/openjdk/jdk/pull/17116. When we add all entries from one map to anther, we should resize that map to the size of the sum of both maps.
>
> I used the command below to run the benchmarks. I set a high heap to reduce garbage collection noise.
>
> java -Xms25G -jar benchmarks.jar -p size=100000 -p addSize=100000 -gc true org.openjdk.bench.java.util.HashMapBench
>
>
> Before change
>
>
> Benchmark (addSize) (mapType) (size) Mode Cnt Score Error Units
> HashMapBench.putAll 100000 HASH_MAP 100000 avgt 4 22.927 ± 3.170 ms/op
> HashMapBench.putAll 100000 LINKED_HASH_MAP 100000 avgt 4 25.198 ± 2.189 ms/op
>
>
> After change
>
>
> Benchmark (addSize) (mapType) (size) Mode Cnt Score Error Units
> HashMapBench.putAll 100000 HASH_MAP 100000 avgt 4 16.780 ± 0.526 ms/op
> HashMapBench.putAll 100000 LINKED_HASH_MAP 100000 avgt 4 19.721 ± 0.349 ms/op
>
>
> We see about average time improvements of 26% in HashMap and 20% in LinkedHashMap.
>
> ---
>
> In the worse case, we may have two maps with identical keys. In this case, we would aggressively resize when we do not need to. I'm also adding an additional `putAllSameKeys` benchmark.
>
> Before change:
>
>
> Benchmark (addSize) (mapType) (size) Mode Cnt Score Error Units
> HashMapBench.putAllSameKeys 100000 HASH_MAP 100000 avgt 6.956 ms/op
> HashMapBench.putAllSameKeys:gc.alloc.rate 100000 HASH_MAP 100000 avgt 1091.383 MB/sec
> HashMapBench.putAllSameKeys:gc.alloc.rate.norm 100000 HASH_MAP 100000 avgt 7968871.917 B/op
> HashMapBench.putAllSameKeys:gc.count 100000 HASH_MAP 100000 avgt ≈ 0 counts
> HashMapBench.putAllSameKeys 100000 LINKED_HASH_MAP 100000 avgt 8.417 ms/op
> HashMapBench.putAllSameKeys:gc.alloc.rate 100000 LINKED_HASH_MAP 100000 avgt 992.543 MB/sec
> HashMapBench.putAllSameKeys:gc.alloc.rate.norm 100000 LINKED_HASH_MAP 100000 avgt 8768892.941 B/op
> HashMapBench.putAllSameKeys:gc.count 100000 LINKED_HASH_MAP 100000 avgt ≈ 0 counts
>
>
> After change:
>
>
> Benchmark (addSize) (mapType)...
Joshua Cao has updated the pull request incrementally with one additional commit since the last revision:
Use max of both sizes and other maps size in case of overflow
-------------
Changes:
- all: https://git.openjdk.org/jdk/pull/17544/files
- new: https://git.openjdk.org/jdk/pull/17544/files/a74c10da..4ecc08a7
Webrevs:
- full: https://webrevs.openjdk.org/?repo=jdk&pr=17544&range=02
- incr: https://webrevs.openjdk.org/?repo=jdk&pr=17544&range=01-02
Stats: 1 line in 1 file changed: 0 ins; 0 del; 1 mod
Patch: https://git.openjdk.org/jdk/pull/17544.diff
Fetch: git fetch https://git.openjdk.org/jdk.git pull/17544/head:pull/17544
PR: https://git.openjdk.org/jdk/pull/17544
More information about the core-libs-dev
mailing list