Marc-Andre Lemburg <m...@egenix.com> added the comment: Vlastimil Brom wrote: > > New submission from Vlastimil Brom <vlastimil.b...@gmail.com>: > > I just noticed an ommision of come character names in unicodedata module. > These are some CJK - Ideographs: > > 龼 (0x9fbc) - 鿋 (0x9fcb) > (CJK Unified Ideographs [19968-40959] [0x4e00-0x9fff]) > > 𪜀 (0x2a700) - 𫜴 (0x2b734) > (CJK Unified Ideographs Extension C [173824-177983] [0x2a700-0x2b73f]) > > 𫝀 (0x2b740) - 𫠝 (0x2b81d) > (CJK Unified Ideographs Extension D [177984-178207] [0x2b740-0x2b81f]) > > The names are probably to be generated - e.g. CJK UNIFIED IDEOGRAPH-2A700 ... > etc.
I don't think we should fill those rather big ranges with generated names, unless there's a standard for this. There are quite a few ranges in the Unicode database that are assigned, but don't have a literal name associated with them. ---------- nosy: +lemburg _______________________________________ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue10459> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com