[issue10459] missing character names in unicodedata (CJK...)

Marc-Andre Lemburg Fri, 19 Nov 2010 07:29:57 -0800

Marc-Andre Lemburg <m...@egenix.com> added the comment:

Vlastimil Brom wrote:
> 
> New submission from Vlastimil Brom <vlastimil.b...@gmail.com>:
> 
> I just noticed an ommision of come character names in unicodedata module.
> These are some CJK - Ideographs:
> 
> 龼 (0x9fbc) - 鿋 (0x9fcb)
>  (CJK Unified Ideographs [19968-40959] [0x4e00-0x9fff])
> 
> 𪜀 (0x2a700) - 𫜴 (0x2b734)
> (CJK Unified Ideographs Extension C [173824-177983] [0x2a700-0x2b73f])
> 
> 𫝀 (0x2b740) - 𫠝 (0x2b81d)
>  (CJK Unified Ideographs Extension D [177984-178207] [0x2b740-0x2b81f])
> 
> The names are probably to be generated - e.g. CJK UNIFIED IDEOGRAPH-2A700 ... 
> etc.


I don't think we should fill those rather big ranges with generated
names, unless there's a standard for this. There are quite a
few ranges in the Unicode database that are assigned, but don't
have a literal name associated with them.

----------
nosy: +lemburg

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue10459>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue10459] missing character names in unicodedata (CJK...)

Reply via email to