On Mar 25, 2007, at 10:32 PM, [EMAIL PROTECTED] wrote:

But I'll tell you what. Find a document, written by someone with
substantial Unicode experience, that recommends UTF-32 as the best overall
in-memory encoding.

For some "all-Scheme" systems, even UTF-32 may be suboptimal since string-ref would incur two additional instructions (shift and tag) while string- set! would take one instruction hit (untag) while ordinarily each could be done with a single machine instruction. A representation of strings as an array of tagged characters may be a win for all Scheme operations and would only lose for cross-language communication (which may lose anyways depending on
the encoding of the interfaced-to environment, or the number of types of
foreign libraries or operating systems).

I would not expect a Unicode expert to know about implementation details of optimizing Scheme implementations, which are far different from the details and constraints of a C library, a browser, or a stand-alone XSLT processor). I would take their advice as a rule-of-thumb (as in follow it when you don't
know any better).  I trust that the editors know better.

Aziz,,,

_______________________________________________
r6rs-discuss mailing list
[email protected]
http://lists.r6rs.org/cgi-bin/mailman/listinfo/r6rs-discuss

Reply via email to