On Thu, Feb 15, 2001 at 02:31:03PM -0800, Hong Zhang wrote:
> Personally I like the UTF-8 encoding. The solution to the
> variable length can be handled by a special (virtual)
> function like
I'm expecting that the virtual, internal representation will not
be in a UTF but will simply be an array of codepoints. Manipulating
UTF8 internally is horrible because it's a variable length encoding,
so you need to keep track of where you are both in terms of characters
and bytes. Yuck, yuck, yuck.
--
Calm down, it's *only* ones and zeroes.