RFR: 8372946 - TreeMap sub-map entry spliterator is expensive [v2]
Chen Liang
liach at openjdk.org
Mon Jan 26 16:04:29 UTC 2026
On Tue, 2 Dec 2025 17:00:54 GMT, Oli Gillespie <ogillespie at openjdk.org> wrote:
>> `TreeMap` sub-maps use the default `IteratorSpliterator` implementation for `TreeMap$EntrySetView` which is slow for some operations, because `EntrySetView.size()` iterates all elements. This is most trivially shown by something like `largeTreeMap.tailMap(0L, false).entrySet().limit(1).count()` taking a long time. This showed up in my application, where it was trivial to mitigate by switching to a for loop, but I think the fix is easy enough.
>>
>> `keySet()` does not have the same problem, as it provides a custom `Spliterator` implementation which is not `Spliterator.SIZED`, and returns `Long.MAX_VALUE` for `estimateSize()` (which is the recommended approach when the size is expensive to compute). I'm *assuming* this optimization was simply missed for the EntryIterator in the original implementation, but I don't know for sure.
>>
>> This patch fixes the issue by providing a custom spliterator for `EntrySetView`, which is not SIZED. The implementation is copied almost exactly from the equivalent `KeyIterator` classes in this file (`SubMapKeyIterator`, `DescendingSubMapKeyIterator`). The only difference is in `SubMapEntryIterator.getComparator`, for which I copied the implementation from `TreeMap$EntrySpliterator`.
>>
>>
>> Basic performance test: `map.tailMap(0L, false).entrySet().stream().limit(1).count()` for a `TreeMap` with `10_000_000` entries.
>>
>> Before (keySet is fast using `SubMapKeyIterator`, entrySet is slow using `IteratorSpliterator`):
>>
>> class java.util.TreeMap$KeySet
>> .stream().limit(1).count() took 0.046ms
>> spliterator = SubMapKeyIterator, estimateSize() = 9223372036854775807
>> class java.util.TreeMap$AscendingSubMap$AscendingEntrySetView
>> .stream().limit(1).count() took 218ms
>> spliterator = IteratorSpliterator, estimateSize() = 9999999
>>
>>
>> After (entrySet is now fast, using `SubMapEntryIterator`):
>>
>> class java.util.TreeMap$KeySet
>> .stream().limit(1).count() took 0.017ms
>> spliterator = SubMapKeyIterator, estimateSize() = 9223372036854775807
>> class java.util.TreeMap$AscendingSubMap$AscendingEntrySetView
>> .stream().limit(1).count() took 0.013ms
>> spliterator = SubMapEntryIterator, estimateSize() = 9223372036854775807
>
> Oli Gillespie has updated the pull request incrementally with one additional commit since the last revision:
>
> Fix failing test
Changes requested by liach (Reviewer).
src/java.base/share/classes/java/util/TreeMap.java line 2028:
> 2026: }
> 2027:
> 2028: public abstract Spliterator<Map.Entry<K,V>> spliterator();
I don't think you need this huge a patch. I think you should just do:
Suggestion:
public Spliterator<Map.Entry<K,V>> spliterator() {
return Spliterators.spliterator(iterator(), Spliterator.DISTINCT);
}
Your patch is introducing spliterator behavioral changes unrelated to the performance regression fix.
-------------
PR Review: https://git.openjdk.org/jdk/pull/28608#pullrequestreview-3706750477
PR Review Comment: https://git.openjdk.org/jdk/pull/28608#discussion_r2728206435
More information about the core-libs-dev
mailing list