There is certainly a hole in the codes left for surrogates that starts at 
0xD8000.  And yeah, one doesn’t want to represent a surrogate code using 
surrogates.  

But, the surrogates are in the range 0xD800-0xDFFF.  The characters from 
0xE000-0xFFFF must also be represented as single 16-bit values, that is, 
without surrogates.  That includes the Private Use Area and the Compatibility 
Area.  

So, the test should be 0xFFFF.  If the code is above that, then use surrogates, 
otherwise use itself.  That is, if it can fit in 16 bits, do that.  

Dar Scott


On May 17, 2014, at 1:03 PM, Richmond <richmondmathew...@gmail.com> wrote:

> On 17/05/14 20:54, Scott Rossi wrote:
>> <snip>
>> 
>> 
>> One item that would be helpful to know is at what value does a unicode
>> character start being represented as pairs?  Is 65536 the upper limit for
>> single value characters?
>> 
>> 
> 
> <snip>
> 
> Representing as pairs starts at Hex D800 = Decimal 55296
> 
> Richmond.
> 
> 
> 
> _______________________________________________
> use-livecode mailing list
> use-livecode@lists.runrev.com
> Please visit this url to subscribe, unsubscribe and manage your subscription 
> preferences:
> http://lists.runrev.com/mailman/listinfo/use-livecode


_______________________________________________
use-livecode mailing list
use-livecode@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode

Reply via email to