Right now, we go by how Ruby 1.8.x handles String.size. It'll be
Unicode-safe in the future.
On Wed, Jan 7, 2009 at 18:04, zbowling wrote:
>
> Welcome to UTF-8.
>
> This is something I consult on all the time. The days that encoding
> length equaled character size length and even equaled represe
Welcome to UTF-8.
This is something I consult on all the time. The days that encoding
length equaled character size length and even equaled representation
length are long gone. It's something you have to break your mind of
(and it doesn't help that languages like C and C++ call a byte a
"char".
Yes, we count by byte.
2009/1/7 benjackson :
>
> Just sent out the following tweet through the API:
>
> @gabrielemcrise acho que é um misto de pioneirismo +hype+base de
> usuários. E também o API, que cercou o serviço de ferramentas
> interessantes
>
> The international characters are being count
Just sent out the following tweet through the API:
@gabrielemcrise acho que é um misto de pioneirismo +hype+base de
usuários. E também o API, que cercou o serviço de ferramentas
interessantes
The international characters are being counted more than once and the
tweet shows up as:
@gabrielemcris