2012/3/13 kobi zamir <kobi.za...@gmail.com>

>
>
>> So I guess that you're also in the UTF-8 camp.
>>
>
> yes, but my opinion about utf-8 is just my opinion. i like python and
> python defaults to utf-8.
>

Python's internal representation is not UTF-8, but UTF-16, or UTF-32,
depends on build parameters. Thus python doesn't really support code points
above the BMP.
Of course, you cannot know the internal representation, since python
(cleverly) does not allow you to cast a unicode string to a sequence of
bytes without specifying the result encoding.

http://docs.python.org/c-api/unicode.html

(see also this very good
presentation<http://98.245.80.27/tcpc/OSCON2011/gbu.html>on internal
unicode representations in various languages).
_______________________________________________
Linux-il mailing list
Linux-il@cs.huji.ac.il
http://mailman.cs.huji.ac.il/mailman/listinfo/linux-il

Reply via email to