New submission from Benjamin Peterson <benja...@python.org>:

Narrow build:
unicodedata.normalize("NFKC", "𝔘𝔫𝔦𝔠𝔬𝔡𝔢")
'𝔘𝔫𝔦𝔠𝔬𝔡𝔢'

Wide build:
>>> unicodedata.normalize("NFKC", "𝔘𝔫𝔦𝔠𝔬𝔡𝔢")
'Unicode'

Normalization needs to properly decode characters in the supplementary plane.

----------
components: Extension Modules
messages: 142008
nosy: benjamin.peterson
priority: normal
severity: normal
status: open
title: normalization is affected by unicode width
type: behavior
versions: Python 2.7, Python 3.2, Python 3.3

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue12746>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to