Re: [Haskell-cafe] Re: PROPOSAL: New efficient Unicode string library.

Deborah Goldsmith Wed, 26 Sep 2007 18:50:07 -0700

On Sep 26, 2007, at 11:06 AM, Aaron Denney wrote:

UTF-16 has no advantage over UTF-8 in this respect, because ofsurrogate
pairs and combining characters.
Good point.

Well, not so much. As Duncan mentioned, it's a matter of what the mostcommon case is. UTF-16 is effectively fixed-width for the majority oftext in the majority of languages. Combining sequences and surrogatepairs are relatively infrequent.

Speaking as someone who has done a lot of Unicode implementation, Iwould say UTF-16 represents the best time/space tradeoff for aninternal representation. As I mentioned, it's what's used in Windows,Mac OS X, ICU, and Java.


Deborah

_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe

Re: [Haskell-cafe] Re: PROPOSAL: New efficient Unicode string library.

Reply via email to