"Jason Orendorff" <[EMAIL PROTECTED]> writes: > (c) If you know Unicode, it's not hard to work with code units.
I know Unicode to some extent, but I find it very hard to work with UTF-16 code units. It's hard enough that, for example, most code examples in Gillam's book, despite being written in Java, deal in terms of Unicode scalar values rather than Java's native representation. > UTF-8 and UTF-16 were explicitly designed with this in mind. If you > don't know Unicode, you're unlikely to write correct code on top of > the R5.92RS libraries anyway. Hiding code units eliminates exactly > one pitfall--among *many*. It's a particularly hideous pitfall, however. For fun, try to grok Unicode normalization by studying the Java sample implementation at: http://www.unicode.org/reports/tr15/Normalizer.java -- Cheers =8-} Mike Friede, Völkerverständigung und überhaupt blabla _______________________________________________ r6rs-discuss mailing list [email protected] http://lists.r6rs.org/cgi-bin/mailman/listinfo/r6rs-discuss
