Martin v. Löwis <mar...@v.loewis.de> added the comment:

Tom: I don't think Unicode::GCString implements UAX#11 correctly (but this is 
really out of scope of this issue). In particular, it contains an ad-hoc 
decision to introduce the EA_Z east-asian width that UAX#11 doesn't talk about.

In most cases, it's probably reasonable to introduce this EA_Z feature. 
However, there are some significant deviations from UAX#11 here:
- combining characters are given EA_Z in sombok/data/custom.pl, even though 
UAX#11 assigns A or N. UAX#11 points out that the advance width depends on 
whether or not the terminal performs character combination or not. It's not 
clear whether Unicode::GCString aims for "strict" UAX#11, or "advance width".
- control characters are also given EA_Z, even though UAX#11 gives them EA_N. 
In this case, it's neither UAX#11 width nor advance width since control 
characters will have various effects on the terminal (in particular for the tab 
character)

----------

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue12568>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to