RFR 8124977 cmdline encoding challenges on Windows

Martin Buchholz martinrb at google.com
Fri Aug 7 13:50:03 UTC 2015


(only half paying attention)

On Wed, Aug 5, 2015 at 7:45 PM, Kirk Shoop <Kirk.Shoop at microsoft.com> wrote:

>
> This webrev uses GetCommandLineW


That sounds right - this should be the source of truth in win32land.


> on windows to retrieve the UCS16


"UCS16" confuses me.  Do you mean UTF-16 or UCS-2?  I'm guessing the former.
http://stackoverflow.com/questions/4592261/windows-api-ansi-and-wide-character-strings-is-it-utf8-or-ascii-utf-16-or-u


> commandline and also supports the 65001(UTF-8) codepage (set by chcp
> 65001) so that when -Dsun.jnu.encoding="UTF-8" is supplied the console
> output (stdout & stderr) will be in UTF8.
>

Is there a reason why UTF-8 is special here?  Shouldn't we have console
output always match the codepage of the console, when possible, without
effort on the user's part?  Is there a problem that we don't have a
translation table between code pages and Java charsets?



More information about the core-libs-dev mailing list