Jim Jewett <jimjjew...@gmail.com> added the comment:

Why was the delta-processing removed from the casing functions?

As best I can tell, the whole point of going through multiple levels of 
indirection (courtesy splitbins) is to maximize compression and minimize the 
amount of cache that unicode might occupy.

By using deltas, only one record is needed for each combination of (upper - 
lower, upper - title), which is generally only one or two combinations per 
script.  

Without deltas, nearly every cased letter needs its own record, and the index 
tables also get bigger. (It seems to be about 2.6 times as large, but cache 
effects may be worse, since letters from the same script will no longer be in 
the same record or the same index chain.)

If it is a concern about not enough room for flags, then the decimal/digit 
chars could be combined.  They are always the same, unless the number isn't 
decimal (in which case the flag is enough).

----------

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue12736>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to