RFR: 8013395 StringBuffer.toString performance regression impacting embedded benchmarks

Peter Levart peter.levart at gmail.com
Mon May 13 22:09:15 UTC 2013


On 05/13/2013 09:05 PM, Alan Bateman wrote:
> On 13/05/2013 08:12, David Holmes wrote:
>> Thanks for all the feedback and discussion. I've rewritten the patch 
>> to use Peter's suggestion of caching the char[] instead of the actual 
>> String:
>>
>> http://cr.openjdk.java.net/~dholmes/8013395/webrev.v3/
>>
>> I too am concerned that any form of caching that avoids the 
>> array-copy (which is the crux of the issue) is going to incur a 2x 
>> space penalty. I can imagine old but well-behaved code that uses 
>> StringBuffer rather than StringBuilder where very long buffers are 
>> created and toString() only called once - and they may get immediate 
>> or subsequent OOME due to the extra char[]; or excessive GC activity 
>> might be induced if a lot of StringBuffers are used.
> Assuming the well-behaved case is where the StringBuffer is discarded 
> after calling toString then I wouldn't expect it to too different to 
> today. That is, you have the 2x penalty when toString is called now. 
> Clearly there are other usages which could be problematic.
>
> I'm not sure what to say about the copy-on-change-after-toString 
> approach. As the offset/count fields have been removed from String 
> then it could only be the count == value.length case. It would clearly 
> be a win in some cases but no help in others.
>
> -Alan.

Hi David, Alan,

It would be a win in cases where the capacity of StringBuilder/Buffer 
could be anticipated to match the final length of string. I tried to 
implement it just for exercise and it turns out to be tricky. Currently 
I have not been able to do it correctly without introducing some sort of 
synchronization to StringBuilder, which would severely impact 
performance, since StringBuilder is designed to be used by single 
thread. In case of sharing char array with String, the "shared" boolean 
flag would have to be checked/updated atomically together with any 
'value' de-referencing and array updating or there could be data races 
that would render String instances produced with such unsynchronized 
StringBuilder appear mutable.

While doing that, I noticed a synchronization bug in 
String.contentEquals method. If called with a StringBuffer argument 
while concurrent thread is modifying the StringBuffer, the method can 
either throw ArrayIndexOutOfBoundsException or return true even though 
the content of the StringBuffer has never been the same as the String's.

Here's a proposed patch:

http://cr.openjdk.java.net/~plevart/jdk8-tl/String.contentEquals/webrev.01/

Regards, Peter




More information about the core-libs-dev mailing list