On Mon, Mar 12, 2012 at 5:39 PM, E L <elyl...@cs.huji.ac.il> wrote:

> What's the advantage of using ucs-4 internally?
> Especially if the program needs to save memory (embedded devices are
> pretty common these days).
>

UTF-32 or UCS-4, is the only encoding form that allows random access to
each Unicode codepoint, each codepoint is 32 bits exactly. As I mentioned,
UTF-16 was created with the intention of having indexable codepoints, but
eventually there were too many of them (eg
http://www.fileformat.info/info/unicode/char/1f3e9/index.htm
https://plus.google.com/109925364564856140495/posts etc).
_______________________________________________
Linux-il mailing list
Linux-il@cs.huji.ac.il
http://mailman.cs.huji.ac.il/mailman/listinfo/linux-il

Reply via email to