Re: [Python-Dev] PEP 383: Non-decodable Bytes in System Character Interfaces

Hrvoje Niksic Tue, 28 Apr 2009 05:46:38 -0700

Thomas Breuel wrote:

But the biggest problem with the proposal is that it isn't needed: ifyou want to be able to turn arbitrary byte sequences into unicodestrings and back, just set your encoding to iso8859-15. That alreadyworks and it doesn't require any changes.

Are you proposing to unconditionally encode file names as iso8859-15, orto do so only when undecodeable bytes are encountered?

If you unconditionally set encoding to iso8859-15, then you areeffectively reverting to treating file names as bytes, regardless of thelocale. You're also angering a lot of European users who expectiso8859-2, etc.

If you switch to iso8859-15 only in the presence of undecodable UTF-8,then you have the same round-trip problem as the PEP: both b'\xff' andb'\xc3\xbf' will be converted to u'\u00ff' without a way tounambiguously recover the original file name.

_______________________________________________
Python-Dev mailing list
Python-Dev@python.org
http://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: 
http://mail.python.org/mailman/options/python-dev/archive%40mail-archive.com

Re: [Python-Dev] PEP 383: Non-decodable Bytes in System Character Interfaces

Reply via email to