On 7/15/2018 4:09 PM, Jim Lee wrote:


On 07/15/18 12:37, MRAB wrote:
To me, Unicode and UTF-8 aren't things to be reserved for I18N. I use them as a matter of course because I find it a lot easier to stick with just one encoding, one that will work with _any_ text I have.

Which is exactly the same rationale for using any other single encoding (including ASCII).  If the text you deal with is not multi-lingual, why complicate matters by trying to support a plethora of encodings which will never be used (and the attendant opportunity for more bugs)?

What you are describing -- supporting hundreds of encodings with occasional bugs, including multiple encodings in a single string -- describes the text as bytes mess. Switching to unicode strings was a vast simplification. Being able to dump the hundreds of byte encoding would be a further simplification, but it will be decades or more before we can do that ;-)

Note that I'm *not* saying Unicode  is *bad*, just that it's an unnecessary complication for a great deal of programming tasks.

I do not understand what you mean by 'unicode is complication'. From the viewpoint of core developers, it is a simplification.

For a great deal more, it's absolutely necessary.  That why I said a "smart" language would make it easy to turn on and off.

What you mean by 'turn unicode off'?

--
Terry Jan Reedy


--
https://mail.python.org/mailman/listinfo/python-list

Reply via email to