At 12:34 PM -0400 7/17/02, Mark J. Reed wrote:
>On Wed, Jul 17, 2002 at 12:13:47PM -0400, Dan Sugalski wrote:
>>  I thought Java used UTF-16. It's a variable-width encoding, so it
>>  should be fine. (Though I bet a lot of folks will be rather surprised
>>  when it happens...)
>UTF-16 isn't technically a variable-width encoding, since
>surrogate codes are still considered single characters - even
>though they only have meaning when combined in pairs.  It's much
>the same as multiple combining characters coming together to represent
>a single abstract entity that is also not really a "character"; the
>chief difference is that surrogates don't mean anything at all on their own.

Yeah, I see that's how the standard defines it, but... Looks like a 
serious dodge to me. :)
-- 
                                         Dan

--------------------------------------"it's like this"-------------------
Dan Sugalski                          even samurai
[EMAIL PROTECTED]                         have teddy bears and even
                                       teddy bears get drunk

Reply via email to