On Tue, 2009-09-22 at 20:57 -0400, Aubrey Jaffer wrote:
> Unicode doesn't play well with a character datatype.  Downcasing or
> foldcasing a single scalar-value can result in a length 2 string.

That is not a problem with Unicode.  That is a problem with 
the assumption that there is a bijection between upcase
and downcase characters - an assumption violated by one
character in one language.  

> If anyone cares, other Unicode-supporting language development efforts
> seem to be moving away from the character datatype:

>  Accoring to <http://javascript.crockford.com/survey.html>, JavaScript
>  lacks chars:

>  String is a sequence of zero or more Unicode characters. There is no
>  separate character type.  A character is represented as a string of
>  length 1.

A sequence of what now?   What exactly is it represented as a 
string of length 1?



-t



_______________________________________________
r6rs-discuss mailing list
[email protected]
http://lists.r6rs.org/cgi-bin/mailman/listinfo/r6rs-discuss

Reply via email to