Unix paths as bytes

Mon May 4 23:24:43 PDT 2009

On Mon, May 4, 2009 at 19:54, Philip Jenvey <pjenvey at underboss.org> wrote:
>
> On May 4, 2009, at 3:41 PM, Martin Buchholz wrote:
>>
>> I believe that no implementation based on error handlers can work
>> because it cannot handle the situation where two different byte inputs
>> are converted to the same char sequence without error.  The original
>> byte sequence cannot be reliably re-created.
>> What am I missing?
>
> There's no case where 2 different sets of bytes would convert to the same
> chars

I don't understand this.  There are many locales with encodings with non-unique
representations.  Until the UTF-8 security reform,
even UTF-8 had non-unique representations.
The Python PEP seems designed to be used with
any system encoding, not just UTF-8.

Martin