Rainer:

> I am still tempted to allow only octets in the range of 1..255. ;)

I think at least technically this restriction is possible because 0x00
never appears as part of any characters encoded as multi-octet
characters in UTF-8.  See table here:
http://www.cl.cam.ac.uk/~mgk25/unicode.html#utf-8

However, I don't know if restricting this bit sequence is necessary.
What do other UTF-8 based protocols do?

Anton.



Reply via email to