On 10/11/2012 05:21 AM, eryksun wrote: > On Thu, Oct 11, 2012 at 5:04 AM, Dave Angel <d...@davea.name> wrote: >> >> Actually, the upper limit for a decoded utf-8 character is at least 6 >> bytes. I think it's 6, but it's no less than 6. > > Yes, but what would be the point? Unicode only has 17 planes, up to > code 0x10ffff. It's limited by UTF-16.
More importantly, it was restricted by the 2003 rfc 3629, which I had completely missed. Last time I wrote a utf-8 encoder was before that probably about 1997. http://tools.ietf.org/html/rfc3629 Thanks for pointing it out. -- DaveA _______________________________________________ Tutor maillist - Tutor@python.org To unsubscribe or change subscription options: http://mail.python.org/mailman/listinfo/tutor