RFR: 8305774: String.join(CharSequence, Iterable) can be optimized if Iterable is a Collection [v3]
jmehrens
duke at openjdk.org
Mon Apr 10 13:20:49 UTC 2023
On Mon, 10 Apr 2023 05:46:23 GMT, Tingjun Yuan <duke at openjdk.org> wrote:
>> In the current implementation of `String.join(CharSequence, Iterable)`, the temp array `elems` is always initialized with a length of 8. It will cause many array recreations when the `Iterable` contains more than 8 elements. Furthermore, it's very common that an `Iterable` is also a `Collection`. So if the `Iterable` is an instance of `Collection`, the initial length of the array can be `((Collection<?>)elements).size()`. It will not change the current behavior even if the `Collection` is modified asynchronously.
>>
>> I don't know whether this change requires a CSR request.
>
> Tingjun Yuan has updated the pull request incrementally with one additional commit since the last revision:
>
> use spliterator().estimateSize()
src/java.base/share/classes/java/lang/String.java line 3466:
> 3464: }
> 3465: int size = 0;
> 3466: for (CharSequence cs: elements) {
I would think you have to locally store the result of `elements.spliterator()` and then use Spliterators::iterator to adapt it back to an iterator. This should correctly handle [early-binding](https://docs.oracle.com/en/java/javase/17/docs/api/java.base/java/util/Spliterator.html#binding) spliterators.
I think in the loop the code should use ArraysSupport.newLength.
-------------
PR Review Comment: https://git.openjdk.org/jdk/pull/13383#discussion_r1161712521
More information about the core-libs-dev
mailing list