RFR: 8369564: Provide a MemorySegment API to read strings with known lengths [v2]

Wed Oct 29 22:49:01 UTC 2025

On Wed, 29 Oct 2025 18:24:56 GMT, Liam Miller-Cushon <cushon at openjdk.org> wrote:

> > A user can easily convert between one or the other length representation by multiplying/dividing by the right scalar
> 
> That is true of e.g. UTF-16 but not of UTF-8, since the encoding is variable width and doing the conversion from bytes to characters is more expensive there.

Sorry, I don't mean 'character' but 'unit', or whatever it's called (I don't think 'code point' is the right word either). For instance, when reading a UTF-8 string, the unit would be one byte, for UTF-16 it would be two, for UTF-32 four. So a user would just need to divide by the unit size, at least that's the idea.

-------------

PR Review Comment: https://git.openjdk.org/jdk/pull/28043#discussion_r2475835009