On 10/11/2012 05:21 AM, eryksun wrote:
> On Thu, Oct 11, 2012 at 5:04 AM, Dave Angel <d...@davea.name> wrote:
>>
>> Actually, the upper limit for a decoded utf-8 character is at least 6
>> bytes.  I think it's 6, but it's no less than 6.
> 
> Yes, but what would be the point? Unicode only has 17 planes, up to
> code 0x10ffff. It's limited by UTF-16.

More importantly, it was restricted by the 2003 rfc 3629, which I had
completely missed.  Last time I wrote a utf-8 encoder was before that
probably about 1997.

    http://tools.ietf.org/html/rfc3629

Thanks for pointing it out.



-- 

DaveA
_______________________________________________
Tutor maillist  -  Tutor@python.org
To unsubscribe or change subscription options:
http://mail.python.org/mailman/listinfo/tutor

Reply via email to