Michael linked to a Unicode FAQ earlier; I want to highlight this:

"Q: How about using UTF-32 interfaces in my APIs?

"A: Except in some environments that store text as UTF-32 in memory,
most Unicode APIs are using UTF-16. With UTF-16 APIs the low level
indexing is at the storage or code unit level, with higher-level
mechanisms for graphemes or words specifying their boundaries in terms
of the code units. This provides efficiency at the low levels, and the
required functionality at the high levels."

The author is Mark Davis, President of the Unicode Consortium.

 http://unicode.org/faq/utf_bom.html#11

-j

_______________________________________________
r6rs-discuss mailing list
[email protected]
http://lists.r6rs.org/cgi-bin/mailman/listinfo/r6rs-discuss

Reply via email to