On Sun, Aug 16, 2015 at 07:35:17AM -0700, [email protected] wrote: > Hello Unicode Mailing List, > > There is significant discussion about the problems of adding capital letters > with individual under-bars in this mailing list for GNU APL. > > http://lists.gnu.org/archive/html/bug-apl/2015-08/msg00050.html > > Pretty much it adds up to the following problem: > > The string length functionality would view an 'A' code point combined with an > '_' code point as an item that has two elements, while something that looks > like 'A' Should be atomic, and return a length of one.
I think what you need is better “character” counting [1], rather than new precomposed characters. Regards, Khaled 1. http://unicode.org/reports/tr29/#Grapheme_Cluster_Boundaries

