RFR [8058875]: CharsetEncoder.maxBytesPerChar() should return 4 for UTF-8

Martin Buchholz martinrb at google.com
Mon Sep 22 21:23:11 UTC 2014


I think you are mistaken. It's maxBytesPerChar, not maxBytesPerCodepoint!


changeset:   3116:b44704ce8a08
user:        sherman
date:        2010-11-19 12:58 -0800
6957230: CharsetEncoder.maxBytesPerChar() reports 4 for UTF-8; should be 3
Summary: changged utf-8's CharsetEncoder.maxBytesPerChar to 3
Reviewed-by: alanb


On Mon, Sep 22, 2014 at 1:14 PM, Ivan Gerasimov <ivan.gerasimov at oracle.com>
wrote:

> Hello!
>
> The UTF-8 encoding allows characters that are 4 bytes long.
> However, CharsetEncoder.maxBytesPerChar() currently returns 3.0, which is
> not always enough.
>
> Would you please review the simple fix for this issue?
>
> BUGURL: https://bugs.openjdk.java.net/browse/JDK-8058875
> WEBREV: http://cr.openjdk.java.net/~igerasim/8058875/0/webrev/
>
> Sincerely yours,
> Ivan
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.openjdk.java.net/pipermail/nio-dev/attachments/20140922/adea3662/attachment-0001.html>


More information about the nio-dev mailing list