Null-terminated Unicode strings in java.io on Windows

Krzysztof Żelechowski program.spe at home.pl
Mon Jan 28 08:24:26 UTC 2008


Dnia 25-01-2008, Pt o godzinie 22:16 +0000, Mark Wielaard pisze:
> Hi Robert,
> 
> Robert Lougher <rob.lougher at ...> writes:
> > This is getting a bit hostile for no reason....  Thinking about
> > alignment gives an interesting solution.
> > 
> > 1) Strings are not null-terminated
> > 2) For most strings the alignment gives the VM room to terminate in
> > place when GetStringChars is called
> > 3) Copy strings that can't be terminated in place.
> 
> Note that Strings have a backing [j]char array which can be shared between
> different Strings, and often are when read in in one go and then split in
> different sub-String objects. All these Strings have a shared slice of this
> backing jchar array, so there isn't any place to terminate it because that place
> will overlap with another slice that can belong to another String.

That changes the picture dramatically.  
Indeed, taking a prefix of an unmodifiable string 
requires copying data, as of the C language.  
I should have thought of that earlier, 
particularly because I gave run into this problem 
when I tried to make the source code for gmake straight.
I suppose K&R (or whoever they inherited the concept after) 
chose to z-term strings 
because they found out that writing a 0 at the end 
takes less memory than keeping a separate pointer to the end.
(This is true for character data only, 
that is why strings are so special).

Sorry for wasting your time.
Chris






More information about the core-libs-dev mailing list