New submission from Arc Riley <arcri...@gmail.com>:

The following is a minimal example which does not work under Python
3.1.1 but functions as expected on Pyhton 2.6 and 3.0.

Python 3.1.1 believes the single UTF-8 glyph is two entirely different
(and illegal) unicode characters:

Traceback (most recent call last):
  File "t.py", line 2, in <module>
    print('𐑛')
UnicodeEncodeError: 'utf-8' codec can't encode character '\ud801' in
position 0: surrogates not allowed

Test system is Ubuntu 9.10-beta 32-bit

----------
components: Interpreter Core
files: t.py
messages: 93475
nosy: ArcRiley
severity: normal
status: open
title: utf-8 encoding error
type: behavior
versions: Python 3.1
Added file: http://bugs.python.org/file15028/t.py

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue7045>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to