On Tue, Jul 8, 2014 at 9:37 PM, Stephan Beal <sgb...@googlemail.com> wrote:

> No characters between 128 and 255 are valid UTF-8, to avoid confusion with
> the many encodings which use that range.
>

For the record, that's apparently wrong. My local man pages (and
experimentation with the termbox API) say otherwise:

   Encoding
       The  following  byte sequences are used to represent a character.
 The sequence to be used depends on the UCS
       code number of the character:

       0x00000000 - 0x0000007F:
           0xxxxxxx

       0x00000080 - 0x000007FF:
           110xxxxx 10xxxxxx

So the range is used, but it encodes to two UTF-8 characters.


-- 
----- stephan beal
http://wanderinghorse.net/home/stephan/
http://gplus.to/sgbeal
"Freedom is sloppy. But since tyranny's the only guaranteed byproduct of
those who insist on a perfect world, freedom will have to do." -- Bigby Wolf
_______________________________________________
fossil-users mailing list
fossil-users@lists.fossil-scm.org
http://lists.fossil-scm.org:8080/cgi-bin/mailman/listinfo/fossil-users

Reply via email to