New submission from Mahmoud <[email protected]>:
Odd behaviour with str.encode or codecs.Codec.encode or simailar
functions, when dealing with uncode objects above ffff
with 2.6
>>> u'\u10380'.encode('utf')
'\xe1\x80\xb80'
with 3.x
'\u10380'.encode('utf')
'\xe1\x80\xb80'
correct output must be:
\xf0\x90\x8e\x80
----------
components: Unicode
messages: 93780
nosy: msaghaei
severity: normal
status: open
title: encoding uncode objects greater than FFFF
type: behavior
versions: Python 2.6, Python 2.7, Python 3.0, Python 3.1
_______________________________________
Python tracker <[email protected]>
<http://bugs.python.org/issue7090>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe:
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com