Unix paths as bytes
Martin Buchholz
martinrb at google.com
Mon May 4 23:24:43 PDT 2009
On Mon, May 4, 2009 at 19:54, Philip Jenvey <pjenvey at underboss.org> wrote:
>
> On May 4, 2009, at 3:41 PM, Martin Buchholz wrote:
>>
>> I believe that no implementation based on error handlers can work
>> because it cannot handle the situation where two different byte inputs
>> are converted to the same char sequence without error. The original
>> byte sequence cannot be reliably re-created.
>> What am I missing?
>
> There's no case where 2 different sets of bytes would convert to the same
> chars
I don't understand this. There are many locales with encodings with non-unique
representations. Until the UTF-8 security reform,
even UTF-8 had non-unique representations.
The Python PEP seems designed to be used with
any system encoding, not just UTF-8.
Martin
More information about the nio-dev
mailing list