On Nov 22, 2007 1:04 PM, Daniel Önnerby <[EMAIL PROTECTED]> wrote:
> In the future I am using UTF8 encoded databases since the conversion of
> strings is a small thing for the system. The advantages of using UTF8
> are many:
> 1. Faster in most cases
> 2. Smaller databases (30% smaller in benchmark test database)
> 3. Less memory usage OR more information will fit in memory.

Well of course it comes at no surprise that if your database is
primarily US-ASCII text, UTF-8 will be better.  Smaller sizes mean
smaller comparisons and more packed b-trees.  UTF-16 is only good if
you have a lot of text that would be encoded with >= 2 UTF-8 code
units.

-- 
Cory Nelson

Reply via email to