On Tue, Jan 1, 2013 at 1:29 AM, Steven D'Aprano <[email protected]> wrote:
>
> 2 Since "wide builds" use so much extra memory for the average ASCII
> string, hardly anyone uses them.
On Windows (and I think OS X, too) a narrow build has been practical
since the wchar_t type is 16-bit. As to Linux I'm most familiar with
Debian, which uses a wide build. Do you know off-hand which distros
release a narrow build?
> But more important than the memory savings, it means that for the first
> time Python's handling of Unicode strings is correct for the entire range
> of all one million plus characters, not just the first 65 thousand.
Still, be careful not to split 'characters':
>>> list(normalize('NFC', '\u1ebf'))
['ế']
>>> list(normalize('NFD', '\u1ebf'))
['e', '̂', '́']
_______________________________________________
Tutor maillist - [email protected]
To unsubscribe or change subscription options:
http://mail.python.org/mailman/listinfo/tutor