On 15/04/2013, random...@fastmail.us <random...@fastmail.us> wrote: > On Mon, Apr 15, 2013, at 10:58, Martti Kühne wrote: >> According to a quick google those chars can become as wide as 6 >> bytes, > > No, they can't. I have no idea what your source on this is.
In UTF-8 the maximum encoded character length is 6 bytes [1] [1] Linux docs: man 7 utf-8 This is more than a four-byte integer ('‿')