Re: inserting Unicode character in dictionary - Python

Joe Strout Fri, 17 Oct 2008 12:39:09 -0700

Thanks for the answers.  That clears things up quite a bit.

What if your source file is set to utf-8?  Do you then have a proper
UTF-8 string, but the problem is that none of the standard Python
library methods know how to properly interpret UTF-8?

Well, the decode method knows how to decode that bytes into a`unicode`

object if you call it with 'utf-8' as argument.


OK, good to know.

4. In Python 3.0, this silliness goes away, because all strings are
Unicode by default.
Yes and no. The problem just shifts because at some point you getintosimilar troubles, just in the other direction. Data enters theprogram
as bytes and must leave it as bytes again, so you have to deal with
encodings at those points.

Yes, but that's still much better than having to litter your code with'u' prefixes and .decode calls and so on. If I'm using a UTF-8-savvytext editor (as we all should be doing in the 21st century!), and type"foo = '2π'", I should get a string containing a '2' and a picharacter, and all the text operations (like counting characters,etc.) should Just Work.

When I read and write files or sockets or whatever, of course I'llhave to think about what encoding the text should be... but internalto my own source code, I shouldn't have to.

I understand the need for a transition strategy, which is what we havein 2.x, and that's working well enough. But I'll be glad when it'sover. :)


Cheers,
- Joe


--
http://mail.python.org/mailman/listinfo/python-list

Re: inserting Unicode character in dictionary - Python

Reply via email to